Efficient continual pre-training LLMs for financial domains

Favorite Large language models (LLMs) are generally trained on large publicly available datasets that are domain agnostic. For example, Meta’s Llama models are trained on datasets such as CommonCrawl, C4, Wikipedia, and ArXiv. These datasets encompass a broad range of topics and domains. Although the resulting models yield amazingly good

Read More
Shared by AWS Machine Learning March 29, 2024

Advanced RAG patterns on Amazon SageMaker

Favorite Today, customers of all industries—whether it’s financial services, healthcare and life sciences, travel and hospitality, media and entertainment, telecommunications, software as a service (SaaS), and even proprietary model providers—are using large language models (LLMs) to build applications like question and answering (QnA) chatbots, search engines, and knowledge bases. These

Read More
Shared by AWS Machine Learning March 29, 2024

AutoBNN: Probabilistic time series forecasting with compositional bayesian neural networks

Favorite Posted by Urs Köster, Software Engineer, Google Research Time series problems are ubiquitous, from forecasting weather and traffic patterns to understanding economic trends. Bayesian approaches start with an assumption about the data’s patterns (prior probability), collecting evidence (e.g., new time series data), and continuously updating that assumption to form

Read More
Shared by Google AI Technology March 28, 2024

Achieve DevOps maturity with BMC AMI zAdviser Enterprise and Amazon Bedrock

Favorite In software engineering, there is a direct correlation between team performance and building robust, stable applications. The data community aims to adopt the rigorous engineering principles commonly used in software development into their own practices, which includes systematic approaches to design, development, testing, and maintenance. This requires carefully combining

Read More
Shared by AWS Machine Learning March 28, 2024

Build a receipt and invoice processing pipeline with Amazon Textract

Favorite In today’s business landscape, organizations are constantly seeking ways to optimize their financial processes, enhance efficiency, and drive cost savings. One area that holds significant potential for improvement is accounts payable. On a high level, the accounts payable process includes receiving and scanning invoices, extraction of the relevant data

Read More
Shared by AWS Machine Learning March 27, 2024

Best practices for building secure applications with Amazon Transcribe

Favorite Amazon Transcribe is an AWS service that allows customers to convert speech to text in either batch or streaming mode. It uses machine learning–powered automatic speech recognition (ASR), automatic language identification, and post-processing technologies. Amazon Transcribe can be used for transcription of customer care calls, multiparty conference calls, and

Read More
Shared by AWS Machine Learning March 26, 2024