AWS expands its widely adopted machine learning service, combining comprehensive data, analytics, and AI capabilities
At AWS re:Invent, Amazon Web Services, Inc. (AWS), an Amazon.com, Inc. company (NASDAQ: AMZN), today announced the next generation of Amazon SageMaker, unifying the capabilities customers need for fast SQL analytics, petabyte-scale big data processing, data exploration and integration, model development and training, and generative artificial intelligence (AI) into one integrated platform.
- The new SageMaker Unified Studio makes it easy for customers to find and access data from across their organization and brings together purpose-built AWS analytics, machine learning (ML), and AI capabilities so customers can act on their data using the best tool for the job across all types of common data use cases, assisted by Amazon Q Developer along the way.
- SageMaker Catalog and built-in governance capabilities allow the right users to access the right data, models, and development artifacts for the right purpose.
- The new SageMaker Lakehouse unifies data across data lakes, data warehouses, operational databases, and enterprise applications, making it easy to access and work with data from within SageMaker Unified Studio and using familiar AI and ML tools or query engines compatible with Apache Iceberg.
- New zero-ETL integrations with leading Software-as-a-Service (SaaS) applications make it easy to access data from third-party SaaS applications in SageMaker Lakehouse and Amazon Redshift for analytics or ML without complex data pipelines.
- Customers and partners including Adastra, Confluent, Etleap, idealista, Informatica, Lennar, Natera, NatWest Group, NTT Data, Roche, Tableau, Toyota Motor North America, and more are already exploring the next generation of SageMaker to bring together their data, analytics, and AI initiatives.
"We are seeing a convergence of analytics and AI, with customers using data in increasingly interconnected ways—from historical analytics to ML model training and generative AI applications," said Swami Sivasubramanian, vice president of Data and AI at AWS. "To support these workloads, many customers already use combinations of our purpose-built analytics and ML tools, such as Amazon SageMaker—the de facto standard for working with data and building ML models—Amazon EMR, Amazon Redshift, Amazon S3 data lakes, and AWS Glue. The next generation of SageMaker brings together these capabilities—along with some exciting new features—to give customers all the tools they need for data processing, SQL analytics, ML model development and training, and generative AI, directly within SageMaker."
Collaborate and build faster with Amazon SageMaker Unified Studio
Today, hundreds of thousands of customers use SageMaker to build, train, and deploy ML models. Many customers also rely on the comprehensive set of purpose-built analytics services from AWS to support a wide range of workloads, including SQL analytics, search analytics, big data processing, and streaming analytics. Increasingly, customers are not using these tools in isolation; rather, they are using a combination of analytics, ML, and generative AI to derive insights and power new experiences for their users. These customers would benefit from a unified environment that brings together familiar AWS tools for analytics, ML, and generative AI, along with easy access to all of their data and the ability to easily collaborate on data projects with other members of their team or organization.
The next generation of SageMaker includes a new, unified studio that gives customers a single data and AI development environment where users can find and access all of the data in their organization, act on it using the best tool for the job across all types of common data use cases, and collaborate within teams and across roles to scale their data and AI initiatives. SageMaker Unified Studio brings together functionality and tools from the range of standalone “studios,” query editors, and visual tools that customers enjoy today in Amazon Bedrock, Amazon EMR, Amazon Redshift, AWS Glue, and the existing SageMaker Studio. This makes it easy for customers to access and use these capabilities to discover and prepare data, author queries or code, process data, and build ML models. Amazon Q Developer assists along the way to support development tasks such as data discovery, coding, SQL generation, and data integration. For example, a user could ask Amazon Q, “What data should I use to get a better idea of product sales?” or “Generate a SQL to calculate total revenue by product category.” Users can securely publish and share data, models, applications, and other artifacts with members of their team or organization, accelerating discoverability and usage of the data assets. With the Amazon Bedrock integrated development environment (IDE) in SageMaker Unified Studio, users can build and deploy generative AI applications quickly and easily using Amazon Bedrock’s selection of high-performing foundation models and tools such as Agents, Guardrails, Knowledge Bases, and Flows. SageMaker Unified Studio comes with data discovery, sharing, and governance capabilities built in, so analysts, data scientists, and engineers can easily search and find the right data they need for their use case, while applying desired security controls and permissions, maintaining access control, and securing their data.
NatWest Group, a leading bank in the United Kingdom serving more than 19 million customers, uses multiple tools for data engineering, SQL analytics, ML, and generative AI workloads. With SageMaker Unified Studio, NatWest Group will have a single unified environment across the organization to support these workloads and anticipates a 50% reduction in the time required for their data users to access analytics and AI capabilities, enabling them to spend less time managing multiple services and more time innovating for their customers.
Meet enterprise security needs with Amazon SageMaker data and AI governance
The next generation of SageMaker simplifies the discovery, governance, and collaboration of data and AI across an organization. With SageMaker Catalog, built on Amazon DataZone, administrators can define and implement consistent access policies using a single permission model with granular controls, while data workers from across teams can securely discover and access approved data and models enriched with business context metadata created by generative AI. Administrators can easily define and enforce permissions across models, tools, and data sources, while customized safeguards help make AI applications secure and compliant. Customers can also safeguard their AI models with data classification, toxicity detection, guardrails, and responsible AI policies within SageMaker.
Reduce data silos and unify data with Amazon SageMaker Lakehouse
Today, more than one million data lakes are built on Amazon Simple Storage Service (Amazon S3), allowing customers to centralize their data assets and derive value with AWS analytics, AI, and ML tools. Data lakes make it possible for customers to store their data as-is—making it easy to combine data from multiple sources. Customers may have data spread across multiple data lakes, as well as a data warehouse, and would benefit from a simple way to unify all of this data.
SageMaker Lakehouse provides unified access to data stored in Amazon S3 data lakes, Redshift data warehouses, and federated data sources, reducing data silos and making it easy to query data, no matter how and where it is physically stored. With this new Apache Iceberg-compatible lakehouse capability in SageMaker, customers can access and work with all of their data from within SageMaker Unified Studio, as well as with familiar AI and ML tools and query engines compatible with Apache Iceberg open standards. Now, customers can use their preferred analytics and ML tools on their data, no matter how and where it is physically stored, to support use cases including SQL analytics, ad-hoc querying, data science, ML, and generative AI. SageMaker Lakehouse provides integrated, fine-grained access controls that are consistently applied across the data in all analytics and AI tools in the lakehouse, enabling customers to define permissions once and securely share data across their organization.
Roche, a global pioneer in pharmaceuticals and diagnostics focused on advancing science to improve people's lives, will use SageMaker Lakehouse to unify data from Redshift and Amazon S3 data lakes, eliminating data silos, enhancing interoperability among teams, and allowing users to seamlessly leverage data without the need for costly data movement or duplicated security access controls. With SageMaker Lakehouse, Roche anticipates a 40% reduction in data processing time, allowing them to focus more on driving their business forward and less on data management.
Quickly and easily access SaaS data with the new zero-ETL integrations with SaaS applications
To truly leverage data across their operations, businesses need seamless access to all their data, regardless of its location. That is why AWS has invested in a zero-ETL future, where data integration is no longer a tedious, manual effort, and customers can easily get their data where they need it. This includes zero-ETL integrations for Amazon Aurora MySQL and PostgreSQL, Amazon RDS for MySQL, and Amazon DynamoDB with Amazon Redshift, which help customers quickly and easily access data from popular relational and non-relational databases in Redshift and SageMaker Lakehouse for analytics and ML. In addition to operational databases and data lakes, many customers also have critical enterprise data stored in SaaS applications and would benefit from easy access to this data for analytics and ML.
The new zero-ETL integrations with SaaS applications make it easy for customers to access their data from applications such as Zendesk and SAP in SageMaker Lakehouse and Redshift for analytics and AI. This removes the need for data pipelines, which can be challenging and costly to build, complex to manage, and prone to errors that may delay access to time-sensitive insights. Zero-ETL integrations for SaaS applications incorporate best practices for full data sync, detection of incremental updates and deletes, and target merge operations.
Organizations of all sizes and across industries, including Infosys, Intuit, and Woolworths, are already benefiting from AWS zero-ETL integrations to quickly and easily connect and analyze data without building and managing data pipelines. With the zero-ETL integrations for SaaS applications, for example, online real estate platform idealista will be able to simplify their data extraction and ingestion processes, eliminating the need for multiple pipelines to access data stored in third-party SaaS applications and freeing their data engineering team to focus on deriving actionable insights from data rather than building and managing infrastructure.
The next generation of SageMaker is available today. SageMaker Unified Studio is currently in preview and will be made generally available soon.
To learn more, visit:
- The AWS Blog for details on today’s announcement.
- The Amazon SageMaker page to learn more about the service.
- The SageMaker Unified Studio page, SageMaker data and AI governance page, and SageMaker Lakehouse page to learn how companies are using these capabilities.
- The AWS re:Invent page for more details on everything happening at AWS re:Invent.
About Amazon Web Services
Since 2006, Amazon Web Services has been the world’s most comprehensive and broadly adopted cloud. AWS has been continually expanding its services to support virtually any workload, and it now has more than 240 fully featured services for compute, storage, databases, networking, analytics, machine learning and artificial intelligence (AI), Internet of Things (IoT), mobile, security, hybrid, media, and application development, deployment, and management from 108 Availability Zones within 34 geographic regions, with announced plans for 18 more Availability Zones and six more AWS Regions in Mexico, New Zealand, the Kingdom of Saudi Arabia, Taiwan, Thailand, and the AWS European Sovereign Cloud. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—trust AWS to power their infrastructure, become more agile, and lower costs. To learn more about AWS, visit aws.amazon.com.
About Amazon
Amazon is guided by four principles: customer obsession rather than competitor focus, passion for invention, commitment to operational excellence, and long-term thinking. Amazon strives to be Earth’s Most Customer-Centric Company, Earth’s Best Employer, and Earth’s Safest Place to Work. Customer reviews, 1-Click shopping, personalized recommendations, Prime, Fulfillment by Amazon, AWS, Kindle Direct Publishing, Kindle, Career Choice, Fire tablets, Fire TV, Amazon Echo, Alexa, Just Walk Out technology, Amazon Studios, and The Climate Pledge are some of the things pioneered by Amazon. For more information, visit amazon.com/about and follow @AmazonNews.
View source version on businesswire.com: https://www.businesswire.com/news/home/20241203118816/en/
Contacts
Amazon.com, Inc.
Media Hotline
Amazon-pr@amazon.com
www.amazon.com/pr