Why datasets built on public domain might not be enough for AI

Favorite There is tension between copyright laws and large datasets suitable to train large language models. Common Corpus is a dataset that only uses text from copyright-expired sources to bypass the legal issues. It’s a useful achievement, paving the path to research without immediate risk of lawsuits. I also fear

Read More
Shared by voicesofopensource May 7, 2024

Get started with Amazon Titan Text Embeddings V2: A new state-of-the-art embeddings model on Amazon Bedrock

Favorite Embeddings are integral to various natural language processing (NLP) applications, and their quality is crucial for optimal performance. They are commonly used in knowledge bases to represent textual data as dense vectors, enabling efficient similarity search and retrieval. In Retrieval Augmented Generation (RAG), embeddings are used to retrieve relevant

Read More
Shared by AWS Machine Learning May 3, 2024

Amazon Personalize launches new recipes supporting larger item catalogs with lower latency

Favorite Personalized customer experiences are essential for engaging today’s users. However, delivering truly personalized experiences that adapt to changes in user behavior can be both challenging and time-consuming. Amazon Personalize makes it straightforward to personalize your website, app, emails, and more, using the same machine learning (ML) technology used by

Read More
Shared by AWS Machine Learning May 3, 2024

Revolutionize Customer Satisfaction with tailored reward models for your business on Amazon SageMaker

Favorite As more powerful large language models (LLMs) are used to perform a variety of tasks with greater accuracy, the number of applications and services that are being built with generative artificial intelligence (AI) is also growing. With great power comes responsibility, and organizations want to make sure that these

Read More
Shared by AWS Machine Learning May 3, 2024

AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart

Favorite Today, we’re excited to announce the availability of Meta Llama 3 inference on AWS Trainium and AWS Inferentia based instances in Amazon SageMaker JumpStart. The Meta Llama 3 models are a collection of pre-trained and fine-tuned generative text models. Amazon Elastic Compute Cloud (Amazon EC2) Trn1 and Inf2 instances,

Read More
Shared by AWS Machine Learning May 3, 2024

CRA standards request draft published

Favorite The European Commission recently published a public draft of the standards request associated with the Cyber Resilience Act (CRA). Anyone who wants to comment on it has until May 16, after which comments will be considered and a final request to the European Standards Organizations (ESOs) will be issued.

Read More
Shared by voicesofopensource May 2, 2024

Automate chatbot for document and data retrieval using Agents and Knowledge Bases for Amazon Bedrock

Favorite Numerous customers face challenges in managing diverse data sources and seek a chatbot solution capable of orchestrating these sources to offer comprehensive answers. This post presents a solution for developing a chatbot capable of answering queries from both documentation and databases, with straightforward deployment. Amazon Bedrock is a fully

Read More
Shared by AWS Machine Learning May 2, 2024

Improving inclusion and accessibility through automated document translation with an open source app using Amazon Translate

Favorite Organizations often offer support in multiple languages, saying “contact us for translations.” However, customers who don’t speak the predominant language often don’t know that translations are available or how to request them. This can lead to poor customer experience and lost business. A better approach is proactively providing information

Read More
Shared by AWS Machine Learning May 2, 2024

Fine-tune and deploy language models with Amazon SageMaker Canvas and Amazon Bedrock

Favorite Imagine harnessing the power of advanced language models to understand and respond to your customers’ inquiries. Amazon Bedrock, a fully managed service providing access to such models, makes this possible. Fine-tuning large language models (LLMs) on domain-specific data supercharges tasks like answering product questions or generating relevant content. In

Read More
Shared by AWS Machine Learning May 2, 2024

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Favorite Large language models (LLMs) are making a significant impact in the realm of artificial intelligence (AI). Their impressive generative abilities have led to widespread adoption across various sectors and use cases, including content generation, sentiment analysis, chatbot development, and virtual assistant technology. Llama2 by Meta is an example of

Read More
Shared by AWS Machine Learning May 2, 2024