LLM, ML Engineer and NLP - Artificial Intelligence Zone

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning Blog

FEBRUARY 12, 2025

Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously. This post demonstrates how to use Medusa-1, the first version of the framework, to speed up an LLM by fine-tuning it on Amazon SageMaker AI and confirms the speed up with deployment and a simple load test.

LLM

LLM ML Natural Language Processing Machine Learning

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning Blog

MARCH 18, 2025

This transcription then serves as the input for a powerful LLM, which draws upon its vast knowledge base to provide personalized, context-aware responses tailored to your specific situation. LLM integration The preprocessed text is fed into a powerful LLM tailored for the healthcare and life sciences (HCLS) domain.

LLM

LLM NLP Data Integration AI

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 21, 2025

Fine-tuning a pre-trained large language model (LLM) allows users to customize the model to perform better on domain-specific tasks or align more closely with human preferences. You can use supervised fine-tuning (SFT) and instruction tuning to train the LLM to perform better on specific tasks using human-annotated datasets and instructions.

LLM

LLM AI AI Data Scientist

Webinars

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

AWS Machine Learning Blog

JULY 24, 2024

Large language models (LLMs) have achieved remarkable success in various natural language processing (NLP) tasks, but they may not always generalize well to specific domains or tasks. You may need to customize an LLM to adapt to your unique use case, improving its performance on your specific dataset or task.

LLM

LLM ML Generative AI Machine Learning

Top Artificial Intelligence AI Courses from Google

Marktechpost

MAY 30, 2024

Introduction to AI and Machine Learning on Google Cloud This course introduces Google Cloud’s AI and ML offerings for predictive and generative projects, covering technologies, products, and tools across the data-to-AI lifecycle. It covers how to develop NLP projects using neural networks with Vertex AI and TensorFlow.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence BERT Computer Vision

AI Engineer’s Toolkit

Towards AI

MAY 30, 2024

Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG” is now available on Amazon! The application topics include prompting, RAG, agents, fine-tuning, and deployment — all essential topics in an AI Engineer’s toolkit.” The defacto manual for AI Engineering.

Prompt Engineer

Prompt Engineer Prompt Engineering LLM NLP

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning Blog

FEBRUARY 26, 2024

Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. Monitoring the performance and behavior of LLMs is a critical task for ensuring their safety and effectiveness.

Large Language Models

Large Language Models LLM Big Data Machine Learning

Build a robust text-to-SQL solution generating complex queries, self-correcting, and querying diverse data sources

AWS Machine Learning Blog

FEBRUARY 28, 2024

This generative AI task is called text-to-SQL, which generates SQL queries from natural language processing (NLP) and converts text into semantically correct SQL. With the emergence of large language models (LLMs), NLP-based SQL generation has undergone a significant transformation. on Amazon Bedrock as our LLM.

Metadata

Metadata LLM Generative AI NLP

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

AWS Machine Learning Blog

MARCH 14, 2024

We formulated a text-to-SQL approach where by a user’s natural language query is converted to a SQL statement using an LLM. This data is again provided to an LLM, which is asked to answer the user’s query given the data. The relevant information is then provided to the LLM for final response generation.

Generative AI

Generative AI LLM AI AI

Evaluation of generative AI techniques for clinical report summarization

AWS Machine Learning Blog

MAY 13, 2024

In part 1 of this blog series, we discussed how a large language model (LLM) available on Amazon SageMaker JumpStart can be fine-tuned for the task of radiology report impression generation. This post then seeks to assess whether prompt engineering is more performant for clinical NLP tasks compared to the RAG pattern and fine-tuning.

Generative AI

Generative AI Prompt Engineer Prompt Engineering LLM

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Flipboard

JUNE 20, 2023

Machine learning (ML) engineers have traditionally focused on striking a balance between model training and deployment cost vs. performance. This is important because training ML models and then using the trained models to make predictions (inference) can be highly energy-intensive tasks.

Machine Learning

Machine Learning BERT Deep Learning ML Engineer

Moderate audio and text chats using AWS AI services and LLMs

AWS Machine Learning Blog

MARCH 13, 2024

The LLM analysis provides a violation result (Y or N) and explains the rationale behind the model’s decision regarding policy violation. The audio moderation workflow activates the LLM’s policy evaluation only when the toxicity analysis exceeds a set threshold. LLMs, in contrast, offer a high degree of flexibility.

Natural Language Processing

Natural Language Processing LLM Prompt Engineer Prompt Engineering

#63: Full of Frameworks: APDTFlow, NSGM, MLFlow, and more!

Towards AI

FEBRUARY 20, 2025

You will also find useful tools from the community, collaboration opportunities for diverse skill sets, and, in my industry-special Whats AI section, I will dive into the most sought-after role: LLM developers. But who exactly is an LLM developer, and how are they different from software developers and ML engineers?

Neural Network

Neural Network Explainability LLM Software Development

Enterprise LLM Summit highlights the importance of data development

Snorkel AI

OCTOBER 27, 2023

Snorkel AI held its Enterprise LLM Virtual Summit on October 26, 2023, drawing an engaged crowd of more than 1,000 attendees across three hours and eight sessions that featured 11 speakers. How to fine-tune and customize LLMs Hoang Tran, ML Engineer at Snorkel AI, outlined how he saw LLMs creating value in enterprise environments.

LLM

LLM Data Scientist Machine Learning Large Language Models

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

AWS Machine Learning Blog

APRIL 29, 2024

Historically, natural language processing (NLP) would be a primary research and development expense. In 2024, however, organizations are using large language models (LLMs), which require relatively little focus on NLP, shifting research and development from modeling to the infrastructure needed to support LLM workflows.

ML

ML Python Data Scientist Machine Learning

New – Code Editor, based on Code-OSS VS Code Open Source now available in Amazon SageMaker Studio

AWS Machine Learning Blog

NOVEMBER 30, 2023

Solution overview In the following sections, we share how you can develop an example ML project with Code Editor on Amazon SageMaker Studio. We will deploy a Mistral-7B large language model (LLM) model into an Amazon SageMaker real-time endpoint using a built-in container from HuggingFace.

Machine Learning

Machine Learning ML LLM Python

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

AWS Machine Learning Blog

SEPTEMBER 1, 2023

Furthermore, we deep dive on the most common generative AI use case of text-to-text applications and LLM operations (LLMOps), a subset of FMOps. MLOps engineers are responsible for providing a secure environment for data scientists and ML engineers to productionize the ML use cases.

Generative AI

Generative AI Prompt Engineer Prompt Engineering ML

Introducing the Topic Tracks for ODSC East 2025: Spotlight on Gen AI, AI Agents, LLMs, & More

ODSC - Open Data Science

FEBRUARY 25, 2025

Topics Include: Agentic AI DesignPatterns LLMs & RAG forAgents Agent Architectures &Chaining Evaluating AI Agent Performance Building with LangChain and LlamaIndex Real-World Applications of Autonomous Agents Who Should Attend: Data Scientists, Developers, AI Architects, and ML Engineers seeking to build cutting-edge autonomous systems.

Data Scientist

Data Scientist Machine Learning Large Language Models ML Engineer

Exploring Generative AI in conversational experiences: An Introduction with Amazon Lex, Langchain, and SageMaker Jumpstart

AWS Machine Learning Blog

JUNE 8, 2023

We have included a sample project to quickly deploy an Amazon Lex bot that consumes a pre-trained open-source LLM. This mechanism allows an LLM to recall previous interactions to keep the conversation’s context and pace. We also use LangChain, a popular framework that simplifies LLM-powered applications.

Generative AI

Generative AI LLM Machine Learning Large Language Models

How Thomson Reuters developed Open Arena, an enterprise-grade large language model playground, in under 6 weeks

AWS Machine Learning Blog

AUGUST 16, 2023

Thomson Reuters Labs, the company’s dedicated innovation team, has been integral to its pioneering work in AI and natural language processing (NLP). This technology was one of the first of its kind, using NLP for more efficient and natural legal research. A key milestone was the launch of Westlaw Is Natural (WIN) in 1992.

Large Language Models

Large Language Models Machine Learning Generative AI ML

Top 5 Generative AI Integration Companies to drive Customer Support in 2023

Chatbots Life

MAY 16, 2023

Services : AI Solution Development, ML Engineering, Data Science Consulting, NLP, AI Model Development, AI Strategic Consulting, Computer Vision. Generative AI integration service : proposes to train Generative AI on clients data and add new features to products.

Generative AI

Generative AI Chatbots Conversational AI Software Development

Automatically redact PII for machine learning using Amazon SageMaker Data Wrangler

AWS Machine Learning Blog

OCTOBER 19, 2023

Amazon Comprehend is a natural language processing (NLP) service that uses ML to uncover insights and relationships in unstructured data, with no managing infrastructure or ML experience required. Amazon SageMaker provides purpose-built tools for ML teams to automate and standardize processes across the ML lifecycle.

Machine Learning

Machine Learning Python ML Automation

Hyperparameter Optimization For LLMs: Advanced Strategies

The MLOps Blog

JANUARY 30, 2025

TL;DR Finding an optimal set of hyperparameters is essential for efficient and effective training of Large Language Models (LLMs). The key LLM hyperparameters influence the model size, learning rate, learning behavior, and token generation process. Hyperparameters set and tuned during pre-training influence the total size of an LLM.

LLM

LLM Machine Learning Large Language Models Deep Learning

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

We hope that you will enjoy watching the videos and learning more about the impact of LLMs on the world. Closing Keynote: LLMOps: Making LLM Applications Production-Grade Large language models are fluent text generators, but they struggle at generating factual, correct content.

Data Scientist

Data Scientist ML Computer Vision AI

Unlocking the Potential of LLMs: From MLOps to LLMOps

Heartbeat

OCTOBER 5, 2023

The emergence of Large Language Models (LLMs) like OpenAI's GPT , Meta's Llama , and Google's BERT has ushered in a new era in this field. These LLMs can generate human-like text, understand context, and perform various Natural Language Processing (NLP) tasks.

Large Language Models

Large Language Models DevOps Machine Learning Prompt Engineer

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

We hope that you will enjoy watching the videos and learning more about the impact of LLMs on the world. Closing Keynote: LLMOps: Making LLM Applications Production-Grade Large language models are fluent text generators, but they struggle at generating factual, correct content.

Data Scientist

Data Scientist ML Computer Vision AI

Exploring summarization options for Healthcare with Amazon SageMaker

AWS Machine Learning Blog

AUGUST 1, 2023

Jurassic-2 Grande Instruct is a large language model (LLM) by AI21 Labs, optimized for natural language instructions and applicable to various language tasks. By fine-tuning the model with your domain-specific data, you can optimize its performance for your particular use case, such as text summarization or any other NLP task.

ML

ML Large Language Models NLP Machine Learning

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

We hope that you will enjoy watching the videos and learning more about the impact of LLMs on the world. Closing Keynote: LLMOps: Making LLM Applications Production-Grade Large language models are fluent text generators, but they struggle at generating factual, correct content.

Data Scientist

Data Scientist NLP ML Computer Vision

The Future of Data-Centric AI Day 2: Snorkel Flow and Beyond

Snorkel AI

JUNE 9, 2023

Snorkel Foundry will allow customers to programmatically curate unstructured data to pre-train an LLM for a specific domain. Leveraging Data-centric AI for Document Intelligence and PDF Extraction Snorkel AI ML Engineer Ashwini Ramamoorthy highlighted the challenges of extracting entities from semi-structured documents.

Large Language Models

Large Language Models Data Scientist Machine Learning Computer Vision

The Future of Data-Centric AI Day 2: Snorkel Flow and Beyond

Snorkel AI

JUNE 9, 2023

Snorkel Foundry will allow customers to programmatically curate unstructured data to pre-train an LLM for a specific domain. Leveraging Data-centric AI for Document Intelligence and PDF Extraction Snorkel AI ML Engineer Ashwini Ramamoorthy highlighted the challenges of extracting entities from semi-structured documents.

Large Language Models

Large Language Models Data Scientist Machine Learning Computer Vision

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs

AWS Machine Learning Blog

JANUARY 26, 2024

Understanding and addressing LLM vulnerabilities, threats, and risks during the design and architecture phases helps teams focus on maximizing the economic and productivity benefits generative AI can bring. This post provides three guided steps to architect risk management strategies while developing generative AI applications using LLMs.

Generative AI

Generative AI ML LLM AI

Building Generative AI and ML solutions faster with AI apps from AWS partners using Amazon SageMaker

AWS Machine Learning Blog

DECEMBER 4, 2024

Available in SageMaker AI and SageMaker Unified Studio (preview) Data scientists and ML engineers can access these applications from Amazon SageMaker AI (formerly known as Amazon SageMaker) and from SageMaker Unified Studio. Deepchecks Deepchecks specializes in LLM evaluation.

ML

ML Generative AI Data Scientist ML Engineer

How TUI uses Amazon Bedrock to scale content creation and enhance hotel descriptions in under 10 seconds

AWS Machine Learning Blog

DECEMBER 17, 2024

Amazon SageMaker helps data scientists and machine learning (ML) engineers build FMs from scratch, evaluate and customize FMs with advanced techniques, and deploy FMs with fine-grain controls for generative AI use cases that have stringent requirements on accuracy, latency, and cost. Of the six challenges, the LLM met only one.

LLM

LLM Prompt Engineer Prompt Engineering Generative AI

Unlocking the Power of Adaptive RAG Systems

ODSC - Open Data Science

MARCH 11, 2025

The second script shows how to query those embeddings with an LLM for RAG-based Q&A. This quick workflow lets you maintain a powerful, scalable knowledge base for any LLM-powered application. He brings deep expertise in building and training models for applications like NLP, data visualization, and real-time analytics.

LLM

LLM Metadata Software Engineer Python

Dive deep into vector data stores using Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

OCTOBER 11, 2024

This post dives deep into Amazon Bedrock Knowledge Bases , which helps with the storage and retrieval of data in vector databases for RAG-based workflows, with the objective to improve large language model (LLM) responses for inference involving an organization’s datasets. The LLM response is passed back to the agent.

Metadata

Metadata Generative AI LLM Data Ingestion

How MSD uses Amazon Bedrock to translate natural language into SQL for complex healthcare databases

AWS Machine Learning Blog

NOVEMBER 18, 2024

To give the LLM access to these codes without overwhelming the main prompt, we created lookup tools that the LLM can use to look up for sex, race, and state codes. Because data analysts need to filter on complex combinations of factors, this list can get too long to be reliably rewritten by the LLM in the SQL query.

LLM

LLM Generative AI Data Science Data Scientist

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Revolutionizing clinical trials with the power of voice and AI

Webinars

Trending Sources

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

Webinars

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Top Artificial Intelligence AI Courses from Google

AI Engineer’s Toolkit

Techniques and approaches for monitoring large language models on AWS

Build a robust text-to-SQL solution generating complex queries, self-correcting, and querying diverse data sources

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

Evaluation of generative AI techniques for clinical report summarization

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Moderate audio and text chats using AWS AI services and LLMs

#63: Full of Frameworks: APDTFlow, NSGM, MLFlow, and more!

Enterprise LLM Summit highlights the importance of data development

Develop and train large models cost-efficiently with Metaflow and AWS Trainium

New – Code Editor, based on Code-OSS VS Code Open Source now available in Amazon SageMaker Studio

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

Introducing the Topic Tracks for ODSC East 2025: Spotlight on Gen AI, AI Agents, LLMs, & More

Exploring Generative AI in conversational experiences: An Introduction with Amazon Lex, Langchain, and SageMaker Jumpstart

How Thomson Reuters developed Open Arena, an enterprise-grade large language model playground, in under 6 weeks

Top 5 Generative AI Integration Companies to drive Customer Support in 2023

Automatically redact PII for machine learning using Amazon SageMaker Data Wrangler

Hyperparameter Optimization For LLMs: Advanced Strategies

Watch all Future of Data-Centric AI 2023 videos now!

Unlocking the Potential of LLMs: From MLOps to LLMOps

Watch all Future of Data-Centric AI 2023 videos now!

Exploring summarization options for Healthcare with Amazon SageMaker

Watch all Future of Data-Centric AI 2023 videos now!

The Future of Data-Centric AI Day 2: Snorkel Flow and Beyond

The Future of Data-Centric AI Day 2: Snorkel Flow and Beyond

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs

Building Generative AI and ML solutions faster with AI apps from AWS partners using Amazon SageMaker

How TUI uses Amazon Bedrock to scale content creation and enhance hotel descriptions in under 10 seconds

Unlocking the Power of Adaptive RAG Systems

Dive deep into vector data stores using Amazon Bedrock Knowledge Bases

How MSD uses Amazon Bedrock to translate natural language into SQL for complex healthcare databases

Stay Connected