Efficient continual pre-training LLMs for financial domains

By Admin 28/03/2024

AWS Machine Learning Blog Large language models (LLMs) are generally trained on large publicly available datasets that are domain agnostic. For example, Meta’s Llama models are trained on datasets such as CommonCrawl, C4, Wikipedia, and ArXiv. These datasets encompass a broad range of topics and domains. Although the resulting models yield amazingly good results for […]Continue reading

Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

By Admin 25/03/2024

MIT News – Artificial intelligence Large language models, such as those that power popular artificial intelligence chatbots like ChatGPT, are incredibly complex. Even though these models are being used as tools in many areas, such as customer support, code generation, and language translation, scientists still don’t fully grasp how they work. In an effort to […]Continue reading

Best practices to build generative AI applications on AWS

By Admin 14/03/2024

AWS Machine Learning Blog Generative AI applications driven by foundational models (FMs) are enabling organizations with significant business value in customer experience, productivity, process optimization, and innovations. However, adoption of these FMs involves addressing some key challenges, including quality output, data privacy, security, integration with organization data, cost, and skills to deliver. In this post, […]Continue reading

Techniques and approaches for monitoring large language models on AWS

By Admin 26/02/2024

AWS Machine Learning Blog Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. However, as these models continue to grow in size and complexity, monitoring their performance and behavior has become increasingly challenging. Monitoring the performance and behavior of LLMs […]Continue reading

Deep neural networks show promise as models of human hearing

By Admin 13/12/2023

MIT News – Artificial intelligence Computational models that mimic the structure and function of the human auditory system could help researchers design better hearing aids, cochlear implants, and brain-machine interfaces. A new study from MIT has found that modern computational models derived from machine learning are moving closer to this goal. In the largest study […]Continue reading

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

By Admin 13/12/2023

AWS Machine Learning Blog Machine learning (ML) models do not operate in isolation. To deliver value, they must integrate into existing production systems and infrastructure, which necessitates considering the entire ML lifecycle during design and development. ML operations, known as MLOps, focus on streamlining, automating, and monitoring ML models throughout their lifecycle. Building a robust […]Continue reading

Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

By Admin 12/12/2023

AWS Machine Learning Blog Large language models (or LLMs) have become a topic of daily conversations. Their quick adoption is evident by the amount of time required to reach a 100 million users, which has gone from “4.5yrs by facebook” to an all-time low of mere “2 months by ChatGPT.” A generative pre-trained transformer (GPT) […]Continue reading

Automated system teaches users when to collaborate with an AI assistant

By Admin 08/12/2023

MIT News – Artificial intelligence Artificial intelligence models that pick out patterns in images can often do so better than human eyes — but not always. If a radiologist is using an AI model to help her determine whether a patient’s X-rays show signs of pneumonia, when should she trust the model’s advice and when […]Continue reading

Scale foundation model inference to hundreds of models with Amazon SageMaker – Part 1

By Admin 30/11/2023

AWS Machine Learning Blog As democratization of foundation models (FMs) becomes more prevalent and demand for AI-augmented services increases, software as a service (SaaS) providers are looking to use machine learning (ML) platforms that support multiple tenants—for data scientists internal to their organization and external customers. More and more companies are realizing the value of […]Continue reading

Reduce model deployment costs by 50% on average using the latest features of Amazon SageMaker

By Admin 30/11/2023

AWS Machine Learning Blog As organizations deploy models to production, they are constantly looking for ways to optimize the performance of their foundation models (FMs) running on the latest accelerators, such as AWS Inferentia and GPUs, so they can reduce their costs and decrease response latency to provide the best experience to end-users. However, some […]Continue reading