Posted On: Sep 14, 2022
Amazon SageMaker Studio is a fully integrated development environment (IDE) for machine learning (ML) that enables data scientists and developers to perform every step of the machine learning workflow, from preparing data to building, training, tuning, and deploying models. SageMaker Studio comes with fast start, collaborative notebooks. You can quickly launch notebooks in Studio, easily dial up or down the underlying compute resources without interrupting your work, and even share your notebook as a link in few simple clicks. Today, we are excited to announce that Amazon SageMaker Studio notebooks now come with built-in integration with Amazon Glue Interactive Sessions. Data scientists and data engineers can use the serverless Apache Spark runtime environment managed by Glue Interactive Sessions to interactively prepare data at scale right in their Studio notebooks.
Using Glue Interactive Session from SageMaker Studio Notebooks is easy; you simply choose the built-in Glue PySpark or Glue Spark kernel for your Studio notebook to initialize interactive, serverless Spark sessions within seconds, without having to worry about provisioning and managing complex compute cluster infrastructure. Once initialized, you can quickly browse the Glue data catalog, run large queries, and interactively analyze and prepare data using Spark, right in your Studio notebook. You can then use the prepared data to build, train, tune and deploy models using the purpose-built ML tools within SageMaker Studio.
Glue Interactive Sessions in SageMaker Studio is generally available in Amazon Web Services China (Beijing) Region, operated by Sinnet and Amazon Web Services China (Ningxia) Region, operated by NWCD. To learn more about SageMaker Studio visit the SageMaker user guide.