LLM and ML Engineer - Artificial Intelligence Zone

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

AWS Machine Learning Blog

JANUARY 28, 2025

Evaluating large language models (LLMs) is crucial as LLM-based systems become increasingly powerful and relevant in our society. Rigorous testing allows us to understand an LLMs capabilities, limitations, and potential biases, and provide actionable feedback to identify and mitigate risk.

LLM

LLM Large Language Models ML Algorithm

5 Tools to Help Build Your LLM Apps

Flipboard

DECEMBER 12, 2023

Whether you're a seasoned ML engineer or a new LLM developer, these tools will help you get more productive and accelerate the development and deployment of your AI projects.

LLM

LLM ML Engineer ML AI

Go from Engineer to ML Engineer with Declarative ML

Flipboard

MAY 31, 2023

Learn how to easily build any AI model and customize your own LLM in just a few lines of code with a declarative approach to machine learning.

ML Engineer

ML Engineer ML Machine Learning LLM

Webinars

The Intersection of AI and Sales: Personalization Without Compromise

How to Achieve High-Accuracy Results When Using LLMs

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

MORE WEBINARS

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

AWS Machine Learning Blog

FEBRUARY 12, 2025

Researchers developed Medusa , a framework to speed up LLM inference by adding extra heads to predict multiple tokens simultaneously. This post demonstrates how to use Medusa-1, the first version of the framework, to speed up an LLM by fine-tuning it on Amazon SageMaker AI and confirms the speed up with deployment and a simple load test.

LLM

LLM ML Natural Language Processing Machine Learning

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 21, 2025

Fine-tuning a pre-trained large language model (LLM) allows users to customize the model to perform better on domain-specific tasks or align more closely with human preferences. You can use supervised fine-tuning (SFT) and instruction tuning to train the LLM to perform better on specific tasks using human-annotated datasets and instructions.

LLM

LLM AI AI Data Scientist

Future AGI Secures $1.6M to Launch the World’s Most Accurate AI Evaluation Platform

Unite.AI

FEBRUARY 11, 2025

” Transforming AI Performance Across Industries Future AGI is already delivering impactful results across industries: A Series E sales-tech company used Future AGIs LLM Experimentation Hub to achieve 99% accuracy in its agentic pipeline, compressing weeks of work into just hours.

Auto-complete

Auto-complete ML Engineer AI AI

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

AWS Machine Learning Blog

JULY 24, 2024

Large language models (LLMs) have achieved remarkable success in various natural language processing (NLP) tasks, but they may not always generalize well to specific domains or tasks. You may need to customize an LLM to adapt to your unique use case, improving its performance on your specific dataset or task.

LLM

LLM ML Generative AI Machine Learning

Implement Amazon SageMaker domain cross-Region disaster recovery using custom Amazon EFS instances

AWS Machine Learning Blog

OCTOBER 22, 2024

Amazon SageMaker is a cloud-based machine learning (ML) platform within the AWS ecosystem that offers developers a seamless and convenient way to build, train, and deploy ML models. Katherine Feng is a Cloud Consultant at AWS Professional Services within the Data and ML team.

ML Engineer

ML Engineer Data Scientist Machine Learning ML

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning Blog

MARCH 18, 2025

This transcription then serves as the input for a powerful LLM, which draws upon its vast knowledge base to provide personalized, context-aware responses tailored to your specific situation. LLM integration The preprocessed text is fed into a powerful LLM tailored for the healthcare and life sciences (HCLS) domain.

LLM

LLM NLP Data Integration AI

Yandex Introduces YaFSDP: An Open-Source AI Tool that Promises to Revolutionize LLM Training by Cutting GPU Usage by 20%

Marktechpost

JUNE 14, 2024

Recently, Yandex has introduced a new solution: YaFSDP, an open-source tool that promises to revolutionize LLM training by significantly reducing GPU resource consumption and training time. ML engineers can leverage this tool to enhance the efficiency of their LLM training processes. Check out the GitHub Page.

LLM

LLM AI Tools Large Language Models ML Engineer

Using Large Language Models on Amazon Bedrock for multi-step task execution

AWS Machine Learning Blog

APRIL 2, 2025

The goal of this blog post is to show you how a large language model (LLM) can be used to perform tasks that require multi-step dynamic reasoning and execution. Fig 1: Simple execution flow solution overview In a more complex scheme, you can add multiple layers of validation and provide relevant APIs to increase the success rate of the LLM.

Large Language Models

Large Language Models LLM Machine Learning Big Data

Building AI Skills in Your Engineering Team: A 2025 Guide to Upskilling with Impact

ODSC - Open Data Science

APRIL 2, 2025

Whether an engineer is cleaning a dataset, building a recommendation engine, or troubleshooting LLM behavior, these cognitive skills form the bedrock of effective AI development. Engineers who can visualize data, explain outputs, and align their work with business objectives are consistently more valuable to theirteams.

Prompt Engineering

Prompt Engineering Prompt Engineer Large Language Models ML Engineer

Stacklock Releases Promptwright: A Python Library for Synthetic Dataset Generation Using an LLM (Local or Hosted)

Marktechpost

DECEMBER 1, 2024

It supports multiple LLM providers, making it compatible with a wide array of hosted and local models, including OpenAI’s models, Anthropic’s Claude, and Google Gemini. This combination of technical depth and usability lowers the barrier for data scientists and ML engineers to generate synthetic data efficiently.

Python

Python LLM Data Scarcity Data Scientist

12 Can’t-Miss Hands-on Training & Workshops Coming to ODSC East 2025

ODSC - Open Data Science

MARCH 10, 2025

Beyond Benchmarks: Evaluating AI Agents, Multimodal Systems, and Generative AI in the RealWorld Sinan Ozdemir, AI & LLM Expert | Author | Founder + CTO at LoopGenius As AI systems advance into autonomous agents, multimodal models, and RAG workflows, traditional evaluation methods often fall short.

Data Scientist

Data Scientist Data Science LLM Machine Learning

Why GenAI evaluation requires SME-in-the-loop for validation and trust

Snorkel AI

MARCH 20, 2025

GenAI evaluation with SME-evaluator agreement AI/ML engineers develop specialized evaluators with ground truth. Lets consider an LLM-as-a-Judge (LLMAJ) which checks to see if an AI assistant has repeated itself. Its far more likely that the AI/ML engineer needs to go back and continue iterating on the prompt.

ML Engineer

ML Engineer Automation Prompt Engineering Prompt Engineer

20 Must-Attend Sessions at ODSC East 2025: The Future of Agentic and Applied AI

ODSC - Open Data Science

APRIL 1, 2025

Adaptive RAG Systems with Knowledge Graphs: Building Smarter LLM Pipelines David vonThenen, Senior AI/ML Engineer at DigitalOcean Unlock the full potential of Retrieval-Augmented Generation by embedding adaptive reasoning with knowledge graphs.

Neural Network

Neural Network LLM Software Engineer AI

Direct Preference Optimization, Intuitively Explained

Towards AI

JANUARY 30, 2024

The Top Secret Behind Effective LLM Training in 2024 Large-scale unsupervised language models (LMs) have shown remarkable capabilities in understanding and generating human-like text. ML Engineers(LLM), Tech Enthusiasts, VCs, etc. Anybody previously acquainted with ML terms should be able to follow along.

Explainability

Explainability ML Engineer LLM ML

Techniques and approaches for monitoring large language models on AWS

AWS Machine Learning Blog

FEBRUARY 26, 2024

Our proposed architecture provides a scalable and customizable solution for online LLM monitoring, enabling teams to tailor your monitoring solution to your specific use cases and requirements. We suggest that each module take incoming inference requests to the LLM, passing prompt and completion (response) pairs to metric compute modules.

Large Language Models

Large Language Models LLM Big Data Machine Learning

Building LLM Applications With Vector Databases

The MLOps Blog

JULY 4, 2024

They enable efficient context retrieval or dynamic few-shot prompting to improve the factual accuracy of LLM-generated responses. Use re-ranking or contextual compression techniques to ensure only the most relevant information is provided to the LLM, improving response accuracy and reducing cost.

LLM

LLM Large Language Models Machine Learning Explainability

DeepSeek in My Engineer’s Eyes

Towards AI

FEBRUARY 18, 2025

AI agents, on the other hand, hold a lot of promise but are still constrained by the reliability of LLM reasoning. From an engineering perspective, the core challenge for both lies in improving accuracy and reliability to meet real-world business requirements. They also inspired a bunch of new potentials for ML engineers.

ML Engineer

ML Engineer LLM Data Quality Algorithm

Falcon 3 models now available in Amazon SageMaker JumpStart

AWS Machine Learning Blog

FEBRUARY 11, 2025

Clean up To clean up the model and endpoint, use the following code: predictor.delete_model() predictor.delete_endpoint() Conclusion In this post, we explored how SageMaker JumpStart empowers data scientists and ML engineers to discover, access, and run a wide range of pre-trained FMs for inference, including the Falcon 3 family of models.

ML

ML Machine Learning Python Computer Vision

The Vulnerabilities and Security Threats Facing Large Language Models

Unite.AI

FEBRUARY 28, 2024

Attackers may attempt to fine-tune surrogate models using queries to the target LLM to reverse-engineer its knowledge. Adversaries can also attempt to breach cloud environments hosting LLMs to sabotage operations or exfiltrate data. Stolen models also create additional attack surface for adversaries to mount further attacks.

Large Language Models

Large Language Models Machine Learning LLM Neural Network

Enterprise LLM Summit highlights the importance of data development

Snorkel AI

OCTOBER 27, 2023

Snorkel AI held its Enterprise LLM Virtual Summit on October 26, 2023, drawing an engaged crowd of more than 1,000 attendees across three hours and eight sessions that featured 11 speakers. How to fine-tune and customize LLMs Hoang Tran, ML Engineer at Snorkel AI, outlined how he saw LLMs creating value in enterprise environments.

LLM

LLM Data Scientist Machine Learning Large Language Models

Why GenAI evaluation requires fine-grained metrics to be insightful

Snorkel AI

MARCH 18, 2025

However, when evaluations provide deep insights into the behavior of GenAI applications, AI/ML engineers can quickly identify what improvements are needed and correctly determine the best way to implement them resulting in a much faster, and far more efficient, GenAI development process.

ML Engineer

ML Engineer LLM AI AI

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

AWS Machine Learning Blog

MARCH 14, 2024

We formulated a text-to-SQL approach where by a user’s natural language query is converted to a SQL statement using an LLM. This data is again provided to an LLM, which is asked to answer the user’s query given the data. The relevant information is then provided to the LLM for final response generation.

Generative AI

Generative AI LLM AI AI

AI Engineer’s Toolkit

Towards AI

MAY 30, 2024

Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG” is now available on Amazon! The application topics include prompting, RAG, agents, fine-tuning, and deployment — all essential topics in an AI Engineer’s toolkit.” The defacto manual for AI Engineering.

Prompt Engineering

Prompt Engineering Prompt Engineer LLM NLP

Top Artificial Intelligence AI Courses from Google

Marktechpost

MAY 30, 2024

Introduction to AI and Machine Learning on Google Cloud This course introduces Google Cloud’s AI and ML offerings for predictive and generative projects, covering technologies, products, and tools across the data-to-AI lifecycle. It includes lessons on vector search and text embeddings, practical demos, and a hands-on lab.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence BERT Computer Vision

Deploy Private LLMs using Databricks Model Serving

databricks

SEPTEMBER 28, 2023

We are excited to announce public preview of GPU and LLM optimization support for Databricks Model Serving! With this launch, you can deploy.

LLM

LLM ML Engineer ML Data Science

Key Takeaways From Week 4 of the AI Builders Summit — Building AI

ODSC - Open Data Science

FEBRUARY 13, 2025

The AI agent classified and summarized GenAI-related content from Reddit, using a structured pipeline with utility functions for API interactions, web scraping, and LLM-based reasoning. He demonstrated practical AI-powered workflows for engineers, including essay generation, research retrieval, and iterative refinement.

Large Language Models

Large Language Models AI AI Automation

Enterprise LLM Summit highlights the importance of data development

Snorkel AI

OCTOBER 27, 2023

Snorkel AI held its Enterprise LLM Virtual Summit on October 26, 2023, drawing an engaged crowd of more than 1,000 attendees across three hours and eight sessions that featured 11 speakers. How to fine-tune and customize LLMs Hoang Tran, ML Engineer at Snorkel AI, outlined how he saw LLMs creating value in enterprise environments.

LLM

LLM Data Scientist Machine Learning Large Language Models

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 16, 2024

Amazon SageMaker supports geospatial machine learning (ML) capabilities, allowing data scientists and ML engineers to build, train, and deploy ML models using geospatial data. His current area of research includes LLM evaluation and data generation. About the Author Xiong Zhou is a Senior Applied Scientist at AWS.

Machine Learning

Machine Learning ML Data Scientist Robotics

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Flipboard

JUNE 20, 2023

Machine learning (ML) engineers have traditionally focused on striking a balance between model training and deployment cost vs. performance. This is important because training ML models and then using the trained models to make predictions (inference) can be highly energy-intensive tasks.

Machine Learning

Machine Learning BERT Deep Learning ML Engineer

Researchers from Stanford University Propose MLAgentBench: A Suite of Machine Learning Tasks for Benchmarking AI Research Agents

Marktechpost

OCTOBER 11, 2023

The team started with a collection of 15 ML engineering projects spanning various fields, with experiments that are quick and cheap to run. At a high level, they simply ask the LLMs to take the next action, using a prompt that is automatically produced based on the available information about the task and previous steps.

Machine Learning

Machine Learning AI Research AI Researcher Convolutional Neural Networks

Moderate audio and text chats using AWS AI services and LLMs

AWS Machine Learning Blog

MARCH 13, 2024

The LLM analysis provides a violation result (Y or N) and explains the rationale behind the model’s decision regarding policy violation. The audio moderation workflow activates the LLM’s policy evaluation only when the toxicity analysis exceeds a set threshold. LLMs, in contrast, offer a high degree of flexibility.

Natural Language Processing

Natural Language Processing LLM Prompt Engineering Prompt Engineer

Build a robust text-to-SQL solution generating complex queries, self-correcting, and querying diverse data sources

AWS Machine Learning Blog

FEBRUARY 28, 2024

on Amazon Bedrock as our LLM. The multi-step component allows the LLM to correct the generated SQL query for accuracy. We use Athena error messages to enrich our prompt for the LLM for more accurate and effective corrections in the generated SQL. About the Authors Sanjeeb Panda is a Data and ML engineer at Amazon.

Metadata

Metadata Generative AI LLM NLP

Top Large Language Models LLMs Courses

Marktechpost

JULY 25, 2024

Large Language Models (LLMs) Concepts Difficulty Level: Beginner This course explores Large Language Models (LLMs), their impact on AI, and real-world applications. It helps learn about LLM building blocks, training methodologies, and ethical considerations.

Large Language Models

Large Language Models Prompt Engineer Prompt Engineering Chatbots

Evaluation of generative AI techniques for clinical report summarization

AWS Machine Learning Blog

MAY 13, 2024

In part 1 of this blog series, we discussed how a large language model (LLM) available on Amazon SageMaker JumpStart can be fine-tuned for the task of radiology report impression generation. Techniques and experimentation Prompt design is the technique of creating the most effective prompt for an LLM with a clear objective.

Generative AI

Generative AI Prompt Engineering Prompt Engineer LLM

Enterprise LLM Summit highlights the importance of data development

Snorkel AI

OCTOBER 27, 2023

Snorkel AI held its Enterprise LLM Virtual Summit on October 26, 2023, drawing an engaged crowd of more than 1,000 attendees across three hours and eight sessions that featured 11 speakers. How to fine-tune and customize LLMs Hoang Tran, ML Engineer at Snorkel AI, outlined how he saw LLMs creating value in enterprise environments.

LLM

LLM Data Scientist Machine Learning Large Language Models

From concept to reality: Navigating the Journey of RAG from proof of concept to production

AWS Machine Learning Blog

FEBRUARY 12, 2025

Machine learning (ML) engineers must make trade-offs and prioritize the most important factors for their specific use case and business requirements. Optimization techniques The diagram below illustrates the tradeoffs to consider for a production-ready RAG application.

Auto-classification

Auto-classification Metadata Generative AI Machine Learning

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

AWS Machine Learning Blog

DECEMBER 9, 2024

MLflow , a popular open-source tool, helps data scientists organize, track, and analyze ML and generative AI experiments, making it easier to reproduce and compare results. SageMaker is a comprehensive, fully managed ML service designed to provide data scientists and ML engineers with the tools they need to handle the entire ML workflow.

ML

ML Data Scientist Machine Learning Software Engineer

Create an HCLS document summarization application with Falcon using Amazon SageMaker JumpStart

AWS Machine Learning Blog

OCTOBER 4, 2023

In this post, we walk you through deploying a Falcon large language model (LLM) using Amazon SageMaker JumpStart and using the model to summarize long documents with LangChain and Python. SageMaker is a HIPAA-eligible managed service that provides tools that enable data scientists, ML engineers, and business analysts to innovate with ML.

LLM

LLM Large Language Models ML Data Scientist

MakeBlobs + Fictional Synthetic Data, Adding Data to Domain-Specific LLMs, and What Tech Layoffs…

ODSC - Open Data Science

DECEMBER 7, 2023

How to Add Domain-Specific Knowledge to an LLM Based on Your Data In this article, we will explore one of several strategies and techniques to infuse domain knowledge into LLMs, allowing them to perform at their best within specific professional contexts by adding chunks of documentation into an LLM as context when injecting the query.

Data Scientist

Data Scientist Explainable AI Data Science Python

Excited To Bring You the E-book Version of “Building LLMs for Production”

Towards AI

JUNE 18, 2024

About Building LLMs for Production Generative AI and LLMs are transforming industries with their ability to understand and generate human-like text and images. However, building reliable and scalable LLM applications requires a lot of extra work and a deep understanding of various techniques and frameworks.

Large Language Models

Large Language Models Prompt Engineering Prompt Engineer LLM

One Week, 7 Major Foundation Model Releases

TheSequence

JULY 21, 2024

NuminaMath 7B TIR is based on a combination of an LLM reasoning agent and code generation and the architecture is totally fascinating —> Read more. Proven-Verifier Games in LLMs OpenAI published a paper unveiling a prover-verifier game to improve the legibility of LLM outputs.

ML Engineer

ML Engineer Automation LLM OpenAI

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

5 Tools to Help Build Your LLM Apps

Webinars

Trending Sources

Go from Engineer to ML Engineer with Declarative ML

Webinars

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker

Future AGI Secures $1.6M to Launch the World’s Most Accurate AI Evaluation Platform

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Implement Amazon SageMaker domain cross-Region disaster recovery using custom Amazon EFS instances

Revolutionizing clinical trials with the power of voice and AI

Yandex Introduces YaFSDP: An Open-Source AI Tool that Promises to Revolutionize LLM Training by Cutting GPU Usage by 20%

Using Large Language Models on Amazon Bedrock for multi-step task execution

Building AI Skills in Your Engineering Team: A 2025 Guide to Upskilling with Impact

Stacklock Releases Promptwright: A Python Library for Synthetic Dataset Generation Using an LLM (Local or Hosted)

12 Can’t-Miss Hands-on Training & Workshops Coming to ODSC East 2025

Why GenAI evaluation requires SME-in-the-loop for validation and trust

20 Must-Attend Sessions at ODSC East 2025: The Future of Agentic and Applied AI

Direct Preference Optimization, Intuitively Explained

Techniques and approaches for monitoring large language models on AWS

Building LLM Applications With Vector Databases

DeepSeek in My Engineer’s Eyes

Falcon 3 models now available in Amazon SageMaker JumpStart

The Vulnerabilities and Security Threats Facing Large Language Models

Enterprise LLM Summit highlights the importance of data development

Why GenAI evaluation requires fine-grained metrics to be insightful

The journey of PGA TOUR’s generative AI virtual assistant, from concept to development to prototype

AI Engineer’s Toolkit

Top Artificial Intelligence AI Courses from Google

Deploy Private LLMs using Databricks Model Serving

Key Takeaways From Week 4 of the AI Builders Summit — Building AI

Enterprise LLM Summit highlights the importance of data development

Map Earth’s vegetation in under 20 minutes with Amazon SageMaker

Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators

Researchers from Stanford University Propose MLAgentBench: A Suite of Machine Learning Tasks for Benchmarking AI Research Agents

Moderate audio and text chats using AWS AI services and LLMs

Build a robust text-to-SQL solution generating complex queries, self-correcting, and querying diverse data sources

Top Large Language Models LLMs Courses

Evaluation of generative AI techniques for clinical report summarization

Enterprise LLM Summit highlights the importance of data development

From concept to reality: Navigating the Journey of RAG from proof of concept to production

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow

Create an HCLS document summarization application with Falcon using Amazon SageMaker JumpStart

MakeBlobs + Fictional Synthetic Data, Adding Data to Domain-Specific LLMs, and What Tech Layoffs…

Excited To Bring You the E-book Version of “Building LLMs for Production”

One Week, 7 Major Foundation Model Releases

Stay Connected