Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average

Favorite We are excited to announce a new version of the Amazon SageMaker Operators for Kubernetes using the AWS Controllers for Kubernetes (ACK). ACK is a framework for building Kubernetes custom controllers, where each controller communicates with an AWS service API. These controllers allow Kubernetes users to provision AWS resources

Read More
Shared by AWS Machine Learning April 20, 2024

Generate customized, compliant application IaC scripts for AWS Landing Zone using Amazon Bedrock

Favorite Migrating to the cloud is an essential step for modern organizations aiming to capitalize on the flexibility and scale of cloud resources. Tools like Terraform and AWS CloudFormation are pivotal for such transitions, offering infrastructure as code (IaC) capabilities that define and manage complex cloud environments with precision. However,

Read More
Shared by AWS Machine Learning April 19, 2024

Open source observability for AWS Inferentia nodes within Amazon EKS clusters

Favorite Recent developments in machine learning (ML) have led to increasingly large models, some of which require hundreds of billions of parameters. Although they are more powerful, training and inference on those models require significant computational resources. Despite the availability of advanced distributed training libraries, it’s common for training and

Read More
Shared by AWS Machine Learning April 18, 2024

Uncover hidden connections in unstructured financial data with Amazon Bedrock and Amazon Neptune

Favorite In asset management, portfolio managers need to closely monitor companies in their investment universe to identify risks and opportunities, and guide investment decisions. Tracking direct events like earnings reports or credit downgrades is straightforward—you can set up alerts to notify managers of news containing company names. However, detecting second

Read More
Shared by AWS Machine Learning April 18, 2024