LLM, ML and ML Engineer - Artificial Intelligence Zone

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

AWS Machine Learning Blog

JANUARY 28, 2025

Evaluating large language models (LLMs) is crucial as LLM-based systems become increasingly powerful and relevant in our society. Rigorous testing allows us to understand an LLMs capabilities, limitations, and potential biases, and provide actionable feedback to identify and mitigate risk.

LLM

LLM Large Language Models ML Algorithm

Go from Engineer to ML Engineer with Declarative ML

Flipboard

MAY 31, 2023

Learn how to easily build any AI model and customize your own LLM in just a few lines of code with a declarative approach to machine learning.

ML Engineer

ML Engineer ML Machine Learning LLM

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

AWS Machine Learning Blog

DECEMBER 9, 2024

With access to a wide range of generative AI foundation models (FM) and the ability to build and train their own machine learning (ML) models in Amazon SageMaker , users want a seamless and secure way to experiment with and select the models that deliver the most value for their business.

ML

ML Data Scientist Machine Learning Software Engineer

Webinars

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

5 Tools to Help Build Your LLM Apps

Flipboard

DECEMBER 12, 2023

Whether you're a seasoned ML engineer or a new LLM developer, these tools will help you get more productive and accelerate the development and deployment of your AI projects.

LLM

LLM ML Engineer ML AI

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning Blog

FEBRUARY 12, 2025

Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously. This post demonstrates how to use Medusa-1, the first version of the framework, to speed up an LLM by fine-tuning it on Amazon SageMaker AI and confirms the speed up with deployment and a simple load test.

LLM

LLM ML Natural Language Processing Machine Learning

Future AGI Secures $1.6M to Launch the World’s Most Accurate AI Evaluation Platform

Unite.AI

FEBRUARY 11, 2025

” Transforming AI Performance Across Industries Future AGI is already delivering impactful results across industries: A Series E sales-tech company used Future AGIs LLM Experimentation Hub to achieve 99% accuracy in its agentic pipeline, compressing weeks of work into just hours.

Auto-complete

Auto-complete ML Engineer AI AI

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 21, 2025

Fine-tuning a pre-trained large language model (LLM) allows users to customize the model to perform better on domain-specific tasks or align more closely with human preferences. You can use supervised fine-tuning (SFT) and instruction tuning to train the LLM to perform better on specific tasks using human-annotated datasets and instructions.

LLM

LLM AI AI Data Scientist

Implement Amazon SageMaker domain cross-Region disaster recovery using custom Amazon EFS instances

AWS Machine Learning Blog

OCTOBER 22, 2024

Amazon SageMaker is a cloud-based machine learning (ML) platform within the AWS ecosystem that offers developers a seamless and convenient way to build, train, and deploy ML models. He focuses on architecting and implementing large-scale generative AI and classic ML pipeline solutions.

ML Engineer

ML Engineer Data Scientist Machine Learning ML

Using Large Language Models on Amazon Bedrock for multi-step task execution

AWS Machine Learning Blog

APRIL 2, 2025

The goal of this blog post is to show you how a large language model (LLM) can be used to perform tasks that require multi-step dynamic reasoning and execution. Fig 1: Simple execution flow solution overview In a more complex scheme, you can add multiple layers of validation and provide relevant APIs to increase the success rate of the LLM.

Large Language Models

Large Language Models LLM Machine Learning Big Data

2024 BAIR Graduate Directory

BAIR

MARCH 11, 2024

Currently, I am working on Large Language Model (LLM) based autonomous agents. I have previously worked on sequence models for DNA and RNA, and benchmarks for evaluating the interpretability and fairness of ML models across domains. Specifically, I work on methods that algorithmically generates diverse training environments (i.e.,

Robotics

Robotics Natural Language Processing Machine Learning Deep Learning

LLM Agents Underscore One Truth: Data Is The Real Differentiator.

Towards AI

NOVEMBER 8, 2024

Edited Photo by Taylor Vick on Unsplash In ML engineering, data quality isn’t just critical — it’s foundational. Yet, this perspective often gets sidelined and there was never a consensus in the ML community about it. Because of how ML practitioners were initially trained. That early obsession with algorithms was vital.

LLM

LLM ML Engineer Data Quality Data Scientist

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning Blog

MARCH 18, 2025

This transcription then serves as the input for a powerful LLM, which draws upon its vast knowledge base to provide personalized, context-aware responses tailored to your specific situation. LLM integration The preprocessed text is fed into a powerful LLM tailored for the healthcare and life sciences (HCLS) domain.

LLM

LLM NLP Data Integration AI

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

AWS Machine Learning Blog

DECEMBER 2, 2024

This enhancement allows customers running high-throughput production workloads to handle sudden traffic spikes more efficiently, providing more predictable scaling behavior and minimal impact on end-user latency across their ML infrastructure, regardless of the chosen inference framework.

Generative AI

Generative AI Machine Learning Large Language Models ML Engineer

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 16, 2024

Amazon SageMaker supports geospatial machine learning (ML) capabilities, allowing data scientists and ML engineers to build, train, and deploy ML models using geospatial data. SageMaker Processing provisions cluster resources for you to run city-, country-, or continent-scale geospatial ML workloads.

Machine Learning

Machine Learning ML Data Scientist Robotics

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

AWS Machine Learning Blog

JULY 24, 2024

Large language models (LLMs) have achieved remarkable success in various natural language processing (NLP) tasks, but they may not always generalize well to specific domains or tasks. You may need to customize an LLM to adapt to your unique use case, improving its performance on your specific dataset or task.

LLM

LLM ML Generative AI Machine Learning

Establishing an AI/ML center of excellence

AWS Machine Learning Blog

MAY 9, 2024

The rapid advancements in artificial intelligence and machine learning (AI/ML) have made these technologies a transformative force across industries. An effective approach that addresses a wide range of observed issues is the establishment of an AI/ML center of excellence (CoE). What is an AI/ML CoE?

ML

ML Generative AI AI AI

Building AI Skills in Your Engineering Team: A 2025 Guide to Upskilling with Impact

ODSC - Open Data Science

APRIL 2, 2025

Whether an engineer is cleaning a dataset, building a recommendation engine, or troubleshooting LLM behavior, these cognitive skills form the bedrock of effective AI development. Engineers who can visualize data, explain outputs, and align their work with business objectives are consistently more valuable to theirteams.

Prompt Engineering

Prompt Engineering Prompt Engineer Large Language Models ML Engineer

Yandex Introduces YaFSDP: An Open-Source AI Tool that Promises to Revolutionize LLM Training by Cutting GPU Usage by 20%

Marktechpost

JUNE 14, 2024

Recently, Yandex has introduced a new solution: YaFSDP, an open-source tool that promises to revolutionize LLM training by significantly reducing GPU resource consumption and training time. ML engineers can leverage this tool to enhance the efficiency of their LLM training processes. Check out the GitHub Page.

LLM

LLM AI Tools Large Language Models ML Engineer

Falcon 3 models now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

FEBRUARY 11, 2025

Get started with SageMaker JumpStart SageMaker JumpStart is a machine learning (ML) hub that can help accelerate your ML journey. Marc Karp is an ML Architect with the Amazon SageMaker Service team. He focuses on helping customers design, deploy, and manage ML workloads at scale.

ML

ML Machine Learning Python Computer Vision

Advanced tracing and evaluation of generative AI agents using LangChain and Amazon SageMaker AI MLFlow

AWS Machine Learning Blog

APRIL 7, 2025

This post explores how Amazon SageMaker AI with MLflow can help you as a developer and a machine learning (ML) practitioner efficiently experiment, evaluate generative AI agent performance, and optimize their applications for production readiness. You can follow this example by running the code in the same aws-samples GitHub repository.

Generative AI

Generative AI AI AI LLM

Stacklock Releases Promptwright: A Python Library for Synthetic Dataset Generation Using an LLM (Local or Hosted)

Marktechpost

DECEMBER 1, 2024

It supports multiple LLM providers, making it compatible with a wide array of hosted and local models, including OpenAI’s models, Anthropic’s Claude, and Google Gemini. This combination of technical depth and usability lowers the barrier for data scientists and ML engineers to generate synthetic data efficiently.

Python

Python LLM Data Scarcity Data Scientist

Why GenAI evaluation requires SME-in-the-loop for validation and trust

Snorkel AI

MARCH 20, 2025

GenAI evaluation with SME-evaluator agreement AI/ML engineers develop specialized evaluators with ground truth. Lets consider an LLM-as-a-Judge (LLMAJ) which checks to see if an AI assistant has repeated itself. Its far more likely that the AI/ML engineer needs to go back and continue iterating on the prompt.

ML Engineer

ML Engineer Automation Prompt Engineer Prompt Engineering

20 Must-Attend Sessions at ODSC East 2025: The Future of Agentic and Applied AI

ODSC - Open Data Science

APRIL 1, 2025

Building Multimodal AI Agents: Agentic RAG with Image, Text, and Audio Inputs Suman Debnath, Principal AI/ML Advocate at Amazon Web Services Discover the transformative potential of Multimodal Agentic RAG systems that integrate image, audio, and text to power intelligent, real-world applications.

Neural Network

Neural Network LLM Software Engineer AI

AMD Researchers Introduce Agent Laboratory: An Autonomous LLM-based Framework Capable of Completing the Entire Research Process

Marktechpost

JANUARY 8, 2025

This innovative system employs large language models (LLMs) to streamline key stages of research, including literature review, experimentation, and report writing. Dont Forget to join our 60k+ ML SubReddit. Agent Laboratory comprises a pipeline of specialized agents tailored to specific research tasks.

LLM

LLM Large Language Models Automation ML Engineer

12 Can’t-Miss Hands-on Training & Workshops Coming to ODSC East 2025

ODSC - Open Data Science

MARCH 10, 2025

Building Multimodal AI Agents: Agentic RAG with Vision-Language Models Suman Debnath, Principal AI/ML Advocate at Amazon WebServices Building a truly intelligent AI assistant requires overcoming the limitations of native Retrieval-Augmented Generation (RAG) models, especially when handling diverse data types like text, tables, and images.

Data Scientist

Data Scientist Data Science LLM Machine Learning

Direct Preference Optimization, Intuitively Explained

Towards AI

JANUARY 30, 2024

The Top Secret Behind Effective LLM Training in 2024 Large-scale unsupervised language models (LMs) have shown remarkable capabilities in understanding and generating human-like text. ML Engineers(LLM), Tech Enthusiasts, VCs, etc. Anybody previously acquainted with ML terms should be able to follow along.

Explainability

Explainability ML Engineer LLM ML

Top Artificial Intelligence AI Courses from Google

Marktechpost

MAY 30, 2024

Introduction to AI and Machine Learning on Google Cloud This course introduces Google Cloud’s AI and ML offerings for predictive and generative projects, covering technologies, products, and tools across the data-to-AI lifecycle. It includes labs on feature engineering with BigQuery ML, Keras, and TensorFlow.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence BERT Computer Vision

Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average

AWS Machine Learning Blog

APRIL 19, 2024

About the Authors Rajesh Ramchander is a Principal ML Engineer in Professional Services at AWS. He helps customers at various stages in their AI/ML and GenAI journey, from those that are just getting started all the way to those that are leading their business with an AI-first strategy.

Metadata

Metadata LLM Software Development Machine Learning

On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models from Google

Bugra Akyildiz

JULY 20, 2024

Libraries STORM(Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking) is a LLM system that writes Wikipedia-like articles from scratch based on Internet search. local-gemma provides an easy way to run Gemma-2 locally directly from your CLI (or via a Python library) and fast. Hardware: Trained on H100 GPUs

ML Engineer

ML Engineer ML Neural Network Algorithm

DeepSeek in My Engineer’s Eyes

Towards AI

FEBRUARY 18, 2025

AI agents, on the other hand, hold a lot of promise but are still constrained by the reliability of LLM reasoning. From an engineering perspective, the core challenge for both lies in improving accuracy and reliability to meet real-world business requirements. They also inspired a bunch of new potentials for ML engineers.

ML Engineer

ML Engineer LLM Data Quality Algorithm

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Flipboard

JUNE 20, 2023

Machine learning (ML) engineers have traditionally focused on striking a balance between model training and deployment cost vs. performance. This is important because training ML models and then using the trained models to make predictions (inference) can be highly energy-intensive tasks.

Machine Learning

Machine Learning BERT Deep Learning ML Engineer

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning Blog

FEBRUARY 26, 2024

Our proposed architecture provides a scalable and customizable solution for online LLM monitoring, enabling teams to tailor your monitoring solution to your specific use cases and requirements. We suggest that each module take incoming inference requests to the LLM, passing prompt and completion (response) pairs to metric compute modules.

Large Language Models

Large Language Models LLM Big Data Machine Learning

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

AWS Machine Learning Blog

MARCH 14, 2024

We formulated a text-to-SQL approach where by a user’s natural language query is converted to a SQL statement using an LLM. This data is again provided to an LLM, which is asked to answer the user’s query given the data. The relevant information is then provided to the LLM for final response generation.

Generative AI

Generative AI LLM AI AI

Why GenAI evaluation requires fine-grained metrics to be insightful

Snorkel AI

MARCH 18, 2025

However, when evaluations provide deep insights into the behavior of GenAI applications, AI/ML engineers can quickly identify what improvements are needed and correctly determine the best way to implement them resulting in a much faster, and far more efficient, GenAI development process.

ML Engineer

ML Engineer LLM AI AI

The Vulnerabilities and Security Threats Facing Large Language Models

Unite.AI

FEBRUARY 28, 2024

Attackers may attempt to fine-tune surrogate models using queries to the target LLM to reverse-engineer its knowledge. Adversaries can also attempt to breach cloud environments hosting LLMs to sabotage operations or exfiltrate data. Stolen models also create additional attack surface for adversaries to mount further attacks.

Large Language Models

Large Language Models Machine Learning LLM Neural Network

Moderate audio and text chats using AWS AI services and LLMs

AWS Machine Learning Blog

MARCH 13, 2024

The LLM analysis provides a violation result (Y or N) and explains the rationale behind the model’s decision regarding policy violation. The audio moderation workflow activates the LLM’s policy evaluation only when the toxicity analysis exceeds a set threshold. LLMs, in contrast, offer a high degree of flexibility.

Natural Language Processing

Natural Language Processing LLM Prompt Engineer Prompt Engineering

Evaluation of generative AI techniques for clinical report summarization

AWS Machine Learning Blog

MAY 13, 2024

In part 1 of this blog series, we discussed how a large language model (LLM) available on Amazon SageMaker JumpStart can be fine-tuned for the task of radiology report impression generation. We also explore the utility of the RAG prompt engineering technique as it applies to the task of summarization.

Generative AI

Generative AI Prompt Engineer Prompt Engineering LLM

Building LLM Applications With Vector Databases

The MLOps Blog

JULY 4, 2024

They enable efficient context retrieval or dynamic few-shot prompting to improve the factual accuracy of LLM-generated responses. Use re-ranking or contextual compression techniques to ensure only the most relevant information is provided to the LLM, improving response accuracy and reducing cost.

LLM

LLM Large Language Models Machine Learning Explainability

Fine tune a generative AI application for Amazon Bedrock using Amazon SageMaker Pipeline decorators

AWS Machine Learning Blog

AUGUST 22, 2024

You can use Amazon SageMaker Model Building Pipelines to collaborate between multiple AI/ML teams. SageMaker Pipelines You can use SageMaker Pipelines to define and orchestrate the various steps involved in the ML lifecycle, such as data preprocessing, model training, evaluation, and deployment. We use Python to do this.

Generative AI

Generative AI Metadata Python ML

Enterprise LLM Summit highlights the importance of data development

Snorkel AI

OCTOBER 27, 2023

Snorkel AI held its Enterprise LLM Virtual Summit on October 26, 2023, drawing an engaged crowd of more than 1,000 attendees across three hours and eight sessions that featured 11 speakers. How to fine-tune and customize LLMs Hoang Tran, ML Engineer at Snorkel AI, outlined how he saw LLMs creating value in enterprise environments.

LLM

LLM Data Scientist Machine Learning Large Language Models

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention

AWS Machine Learning Blog

JANUARY 10, 2024

Specialist Data Engineering at Merck, and Prabakaran Mathaiyan, Sr. ML Engineer at Tiger Analytics. The large machine learning (ML) model development lifecycle requires a scalable model release process similar to that of software development. The input to the training pipeline is the features dataset.

ML

ML Machine Learning Data Scientist ETL

Researchers from Stanford University Propose MLAgentBench: A Suite of Machine Learning Tasks for Benchmarking AI Research Agents

Marktechpost

OCTOBER 11, 2023

The team started with a collection of 15 ML engineering projects spanning various fields, with experiments that are quick and cheap to run. At a high level, they simply ask the LLMs to take the next action, using a prompt that is automatically produced based on the available information about the task and previous steps.

Machine Learning

Machine Learning AI Research AI Researcher Convolutional Neural Networks

AI Engineer’s Toolkit

Towards AI

MAY 30, 2024

Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG” is now available on Amazon! The application topics include prompting, RAG, agents, fine-tuning, and deployment — all essential topics in an AI Engineer’s toolkit.” The defacto manual for AI Engineering.

Prompt Engineer

Prompt Engineer Prompt Engineering LLM NLP

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

AWS Machine Learning Blog

SEPTEMBER 1, 2023

Furthermore, we deep dive on the most common generative AI use case of text-to-text applications and LLM operations (LLMOps), a subset of FMOps. The ML consumers are other business stakeholders who use the inference results (predictions) to drive decisions. The following figure illustrates the topics we discuss.

Generative AI

Generative AI Prompt Engineer Prompt Engineering ML

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

Go from Engineer to ML Engineer with Declarative ML

Webinars

Trending Sources

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

Webinars

5 Tools to Help Build Your LLM Apps

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

Future AGI Secures $1.6M to Launch the World’s Most Accurate AI Evaluation Platform

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

Implement Amazon SageMaker domain cross-Region disaster recovery using custom Amazon EFS instances

Using Large Language Models on Amazon Bedrock for multi-step task execution

2024 BAIR Graduate Directory

LLM Agents Underscore One Truth: Data Is The Real Differentiator.

Revolutionizing clinical trials with the power of voice and AI

Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker Inference

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Establishing an AI/ML center of excellence

Building AI Skills in Your Engineering Team: A 2025 Guide to Upskilling with Impact

Yandex Introduces YaFSDP: An Open-Source AI Tool that Promises to Revolutionize LLM Training by Cutting GPU Usage by 20%

Falcon 3 models now available in Amazon SageMaker JumpStart

Advanced tracing and evaluation of generative AI agents using LangChain and Amazon SageMaker AI MLFlow

Stacklock Releases Promptwright: A Python Library for Synthetic Dataset Generation Using an LLM (Local or Hosted)

Why GenAI evaluation requires SME-in-the-loop for validation and trust

20 Must-Attend Sessions at ODSC East 2025: The Future of Agentic and Applied AI

AMD Researchers Introduce Agent Laboratory: An Autonomous LLM-based Framework Capable of Completing the Entire Research Process

12 Can’t-Miss Hands-on Training & Workshops Coming to ODSC East 2025

Direct Preference Optimization, Intuitively Explained

Top Artificial Intelligence AI Courses from Google

Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average

On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models from Google

DeepSeek in My Engineer’s Eyes

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Techniques and approaches for monitoring large language models on AWS

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

Why GenAI evaluation requires fine-grained metrics to be insightful

The Vulnerabilities and Security Threats Facing Large Language Models

Moderate audio and text chats using AWS AI services and LLMs

Evaluation of generative AI techniques for clinical report summarization

Building LLM Applications With Vector Databases

Fine tune a generative AI application for Amazon Bedrock using Amazon SageMaker Pipeline decorators

Enterprise LLM Summit highlights the importance of data development

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention

Researchers from Stanford University Propose MLAgentBench: A Suite of Machine Learning Tasks for Benchmarking AI Research Agents

AI Engineer’s Toolkit

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

Stay Connected