Metadata, ML and ML Engineer - Artificial Intelligence Zone

From Solo Notebooks to Collaborative Powerhouse: VS Code Extensions for Data Science and ML Teams

Towards AI

AUGUST 7, 2024

From Solo Notebooks to Collaborative Powerhouse: VS Code Extensions for Data Science and ML Teams Photo by Parabol | The Agile Meeting Toolbox on Unsplash In this article, we will explore the essential VS Code extensions that enhance productivity and collaboration for data scientists and machine learning (ML) engineers.

Data Science

Data Science ML ML Engineer Data Scientist

Governing the ML lifecycle at scale, Part 1: A framework for architecting ML workloads using Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 20, 2023

Customers of every size and industry are innovating on AWS by infusing machine learning (ML) into their products and services. Recent developments in generative AI models have further sped up the need of ML adoption across industries.

ML

ML Data Scientist ML Engineer Data Science

Customized model monitoring for near real-time batch inference with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 28, 2024

Real-world applications vary in inference requirements for their artificial intelligence and machine learning (AI/ML) solutions to optimize performance and reduce costs. SageMaker Model Monitor monitors the quality of SageMaker ML models in production. Your client applications invoke this endpoint to get inferences from the model.

ML

ML Metadata Data Scientist Machine Learning

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Advanced tracing and evaluation of generative AI agents using LangChain and Amazon SageMaker AI MLFlow

AWS Machine Learning Blog

APRIL 7, 2025

This post explores how Amazon SageMaker AI with MLflow can help you as a developer and a machine learning (ML) practitioner efficiently experiment, evaluate generative AI agent performance, and optimize their applications for production readiness.

Generative AI

Generative AI AI AI LLM

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning Blog

NOVEMBER 14, 2024

We recently announced the general availability of cross-account sharing of Amazon SageMaker Model Registry using AWS Resource Access Manager (AWS RAM) , making it easier to securely share and discover machine learning (ML) models across your AWS accounts.

ML

ML Machine Learning Auto-complete Auto-classification

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

AWS Machine Learning Blog

JANUARY 28, 2025

FMEval is an open source LLM evaluation library, designed to provide data scientists and machine learning (ML) engineers with a code-first experience to evaluate LLMs for various aspects, including accuracy, toxicity, fairness, robustness, and efficiency. This allows you to keep track of your ML experiments.

LLM

LLM Large Language Models ML Algorithm

Data4ML Preparation Guidelines (Beyond The Basics)

Towards AI

NOVEMBER 8, 2024

Data preparation isn’t just a part of the ML engineering process — it’s the heart of it. Data is a key differentiator in ML projects (more on this in my blog post below). This post dives into key steps for preparing data to build real-world ML systems. This member-only story is on us. Upgrade to access all of Medium.

Data Ingestion

Data Ingestion Metadata ML Engineer ML

Top Artificial Intelligence AI Courses from Google

Marktechpost

MAY 30, 2024

Introduction to AI and Machine Learning on Google Cloud This course introduces Google Cloud’s AI and ML offerings for predictive and generative projects, covering technologies, products, and tools across the data-to-AI lifecycle. It includes labs on feature engineering with BigQuery ML, Keras, and TensorFlow.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence BERT Computer Vision

Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average

AWS Machine Learning Blog

APRIL 19, 2024

environment: HF_MODEL_ID: databricks/dolly-v2-7b HF_TASK: text-generation apiVersion: sagemaker.services.k8s.aws/v1alpha1 kind: Model metadata: name: flan-t5-xxl spec: modelName: flan-t5-xxl executionRoleARN: containers: - image: 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-tgi-inference:2.0.1-tgi0.9.3-gpu-py39-cu118-ubuntu20.04

Metadata

Metadata LLM Software Development Machine Learning

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

AWS Machine Learning Blog

JUNE 25, 2024

This solution simplifies the integration of advanced monitoring tools such as Prometheus and Grafana, enabling you to set up and manage your machine learning (ML) workflows with AWS AI Chips. By deploying the Neuron Monitor DaemonSet across EKS nodes, developers can collect and analyze performance metrics from ML workload pods.

ML

ML Metadata Software Development Generative AI

Deploy Amazon SageMaker pipelines using AWS Controllers for Kubernetes

AWS Machine Learning Blog

SEPTEMBER 4, 2024

Its scalability and load-balancing capabilities make it ideal for handling the variable workloads typical of machine learning (ML) applications. Amazon SageMaker provides capabilities to remove the undifferentiated heavy lifting of building and deploying ML models. They often work with DevOps engineers to operate those pipelines.

DevOps

DevOps ML Engineer ML Metadata

Build a robust text-to-SQL solution generating complex queries, self-correcting, and querying diverse data sources

AWS Machine Learning Blog

FEBRUARY 28, 2024

Structured Query Language (SQL) is a complex language that requires an understanding of databases and metadata. Third, despite the larger adoption of centralized analytics solutions like data lakes and warehouses, complexity rises with different table names and other metadata that is required to create the SQL for the desired sources.

Metadata

Metadata LLM Generative AI NLP

Fine tune a generative AI application for Amazon Bedrock using Amazon SageMaker Pipeline decorators

AWS Machine Learning Blog

AUGUST 22, 2024

You can use Amazon SageMaker Model Building Pipelines to collaborate between multiple AI/ML teams. SageMaker Pipelines You can use SageMaker Pipelines to define and orchestrate the various steps involved in the ML lifecycle, such as data preprocessing, model training, evaluation, and deployment.

Generative AI

Generative AI Metadata Python ML

Driving advanced analytics outcomes at scale using Amazon SageMaker powered PwC’s Machine Learning Ops Accelerator

AWS Machine Learning Blog

DECEMBER 19, 2023

Artificial intelligence (AI) and machine learning (ML) are becoming an integral part of systems and processes, enabling decisions in real time, thereby driving top and bottom-line improvements across organizations. However, putting an ML model into production at scale is challenging and requires a set of best practices.

Machine Learning

Machine Learning ML Engineer DevOps ML

How Thomson Reuters built an AI platform using Amazon SageMaker to accelerate delivery of ML projects

AWS Machine Learning Blog

JANUARY 13, 2023

Since then, TR has achieved many more milestones as its AI products and services are continuously growing in number and variety, supporting legal, tax, accounting, compliance, and news service professionals worldwide, with billions of machine learning (ML) insights generated every year. The challenges. Solution overview.

ML

ML Data Scientist Machine Learning Metadata

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

Alignment to other tools in the organization’s tech stack Consider how well the MLOps tool integrates with your existing tools and workflows, such as data sources, data engineering platforms, code repositories, CI/CD pipelines, monitoring systems, etc. and Pandas or Apache Spark DataFrames.

Machine Learning

Machine Learning Metadata Data Scientist Data Quality

Llama 4 family of models from Meta are now available in SageMaker JumpStart

AWS Machine Learning Blog

APRIL 7, 2025

This approach allows for greater flexibility and integration with existing AI and machine learning (AI/ML) workflows and pipelines. By providing multiple access points, SageMaker JumpStart helps you seamlessly incorporate pre-trained models into your AI/ML development efforts, regardless of your preferred interface or workflow.

Machine Learning

Machine Learning Large Language Models Python Automation

Best practices for Amazon SageMaker HyperPod task governance

AWS Machine Learning Blog

FEBRUARY 19, 2025

In this example, the ML engineering team is borrowing 5 GPUs for their training task With SageMaker HyperPod, you can additionally set up observability tools of your choice. metadata: name: job-name namespace: hyperpod-ns-researchers labels: kueue.x-k8s.io/queue-name: queue-name: hyperpod-ns-researchers-localqueue kueue.x-k8s.io/priority-class:

Data Scientist

Data Scientist Data Science ML Engineer Generative AI

Airbnb Researchers Develop Chronon: A Framework for Developing Production-Grade Features for Machine Learning Models

Marktechpost

AUGUST 8, 2023

In the ever-evolving landscape of machine learning, feature management has emerged as a key pain point for ML Engineers at Airbnb. Chronon empowers ML practitioners to define features and centralize data computation for model training and production inference, guaranteeing accuracy and consistency throughout the process.

Machine Learning

Machine Learning ML Engineer Data Ingestion ML

From concept to reality: Navigating the Journey of RAG from proof of concept to production

AWS Machine Learning Blog

FEBRUARY 12, 2025

Machine learning (ML) engineers must make trade-offs and prioritize the most important factors for their specific use case and business requirements. You can use metadata filtering to narrow down search results by specifying inclusion and exclusion criteria. Nitin Eusebius is a Sr.

Auto-classification

Auto-classification Metadata Generative AI Machine Learning

Amazon SageMaker Feature Store now supports cross-account sharing, discovery, and access

AWS Machine Learning Blog

FEBRUARY 13, 2024

Amazon SageMaker Feature Store is a fully managed, purpose-built repository to store, share, and manage features for machine learning (ML) models. Features are inputs to ML models used during training and inference. SageMaker Feature Store now makes it effortless to share, discover, and access feature groups across AWS accounts.

ML

ML Machine Learning ML Engineer Data Scientist

Machine Learning Engineering in the Real World

ODSC - Open Data Science

SEPTEMBER 21, 2023

Secondly, to be a successful ML engineer in the real world, you cannot just understand the technology; you must understand the business. Some typical examples are given in the following table, along with some discussion as to whether or not ML would be an appropriate tool for solving the problem: Figure 1.1:

Machine Learning

Machine Learning ML Engineer ML Data Science

How to Save Trained Model in Python

The MLOps Blog

MAY 10, 2023

When working on real-world machine learning (ML) use cases, finding the best algorithm/model is not the end of your responsibilities. Reusability & reproducibility: Building ML models is time-consuming by nature. Save vs package vs store ML models Although all these terms look similar, they are not the same.

Python

Python Metadata ML Machine Learning

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention

AWS Machine Learning Blog

JANUARY 10, 2024

Specialist Data Engineering at Merck, and Prabakaran Mathaiyan, Sr. ML Engineer at Tiger Analytics. The large machine learning (ML) model development lifecycle requires a scalable model release process similar to that of software development. The input to the training pipeline is the features dataset.

ML

ML Machine Learning Data Scientist ETL

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

AWS Machine Learning Blog

JUNE 27, 2023

When machine learning (ML) models are deployed into production and employed to drive business decisions, the challenge often lies in the operation and management of multiple models. They swiftly began to work on AI/ML capabilities by building image recognition models using Amazon SageMaker.

DevOps

DevOps ML Machine Learning ML Engineer

Build a crop segmentation machine learning model with Planet data and Amazon SageMaker geospatial capabilities

AWS Machine Learning Blog

SEPTEMBER 29, 2023

In this post, we illustrate how to use a segmentation machine learning (ML) model to identify crop and non-crop regions in an image. Identifying crop regions is a core step towards gaining agricultural insights, and the combination of rich geospatial data and ML can lead to insights that drive decisions and actions.

Machine Learning

Machine Learning Data Scientist ML Python

Set up Amazon SageMaker Studio with Jupyter Lab 3 using the AWS CDK

AWS Machine Learning Blog

JANUARY 17, 2023

Amazon SageMaker Studio is a fully integrated development environment (IDE) for machine learning (ML) partly based on JupyterLab 3. Studio provides a web-based interface to interactively perform ML development tasks required to prepare data and build, train, and deploy ML models. cdk.json – Contains metadata, and feature flags.

Software Engineer

Software Engineer ML ML Engineer Machine Learning

CMU Researchers Introduce Zeno: A Framework for Behavioral Evaluation of Machine Learning (ML) Models

Marktechpost

JULY 19, 2023

In the actual world, machine learning (ML) systems can embed issues like societal prejudices and safety worries. Stakeholders such as ML engineers, designers, and domain experts must work together to identify a model’s expected and potential faults. Zeno works together with other systems and combines the methods of others.

Machine Learning

Machine Learning ML Python Metadata

Top Large Language Models LLMs Courses

Marktechpost

JULY 25, 2024

It is ideal for ML engineers, data scientists, and technical leaders, providing real-world training for production-ready generative AI using Amazon Bedrock and cloud-native services.

Large Language Models

Large Language Models Prompt Engineer Prompt Engineering Chatbots

First ODSC Europe 2023 Sessions Announced

ODSC - Open Data Science

MARCH 27, 2023

Learn about the flow, difficulties, and tools for performing ML clustering at scale Ori Nakar | Principal Engineer, Threat Research | Imperva Given that there are billions of daily botnet attacks from millions of different IPs, the most difficult challenge of botnet detection is choosing the most relevant data.

Machine Learning

Machine Learning Data Science Deep Learning Data Ingestion

MLOps Is an Extension of DevOps. Not a Fork — My Thoughts on THE MLOPS Paper as an MLOps Startup CEO

The MLOps Blog

JANUARY 23, 2023

Came to ML from software. Founded neptune.ai , a modular MLOps component for ML metadata store , aka “experiment tracker + model registry”. Most of our customers are doing ML/MLOps at a reasonable scale, NOT at the hyperscale of big-tech FAANG companies. A special type of software with ML in it but software nonetheless.

DevOps

DevOps Metadata Software Engineer Data Scientist

How to Build Machine Learning Systems With a Feature Store

The MLOps Blog

JANUARY 26, 2024

Luckily, we have tried and trusted tools and architectural patterns that provide a blueprint for reliable ML systems. In this article, I’ll introduce you to a unified architecture for ML systems built around the idea of FTI pipelines and a feature store as the central component. But what is an ML pipeline?

Machine Learning

Machine Learning Metadata ML Python

How Kakao Games automates lifetime value prediction from game data using Amazon SageMaker and AWS Glue

AWS Machine Learning Blog

MARCH 1, 2023

Statistical methods and machine learning (ML) methods are actively developed and adopted to maximize the LTV. In this post, we share how Kakao Games and the Amazon Machine Learning Solutions Lab teamed up to build a scalable and reliable LTV prediction solution by using AWS data and ML services such as AWS Glue and Amazon SageMaker.

Automation

Automation ETL Data Drift ML

Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker

AWS Machine Learning Blog

MAY 15, 2024

Next, we present the solution architecture and process flows for machine learning (ML) model building, deployment, and inferencing. Here, Amazon SageMaker Ground Truth allowed ML engineers to easily build the human-in-the-loop workflow (step v). We end with lessons learned.

Generative AI

Generative AI AI AI Machine Learning

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 16, 2023

Amazon SageMaker provides purpose-built tools for machine learning operations (MLOps) to help automate and standardize processes across the ML lifecycle. In this post, we describe how Philips partnered with AWS to develop AI ToolSuite—a scalable, secure, and compliant ML platform on SageMaker.

Data Scientist

Data Scientist ML Data Science Machine Learning

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

AWS Machine Learning Blog

JULY 24, 2024

Fine-tuning an LLM can be a complex workflow for data scientists and machine learning (ML) engineers to operationalize. Solution overview Running hundreds of experiments, comparing the results, and keeping a track of the ML lifecycle can become very complex. Each iteration can be considered a run within an experiment.

LLM

LLM ML Generative AI Machine Learning

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

AWS Machine Learning Blog

DECEMBER 13, 2023

Machine learning (ML) models do not operate in isolation. To deliver value, they must integrate into existing production systems and infrastructure, which necessitates considering the entire ML lifecycle during design and development. GitHub serves as a centralized location to store, version, and manage your ML code base.

ML

ML Automation Metadata Software Development

How Did We Get to ML Model Reproducibility

The MLOps Blog

MARCH 14, 2023

When working on real-world ML projects , you come face-to-face with a series of obstacles. The ml model reproducibility problem is one of them. This is indeed an erroneous thing to do when working on ML projects at scale. Reproducibility is the key factor when it comes to stabilizing the outcomes of any ML pipeline.

ML

ML Machine Learning Metadata ML Engineer

Use Amazon SageMaker Model Card sharing to improve model governance

AWS Machine Learning Blog

AUGUST 31, 2023

As Artificial Intelligence (AI) and Machine Learning (ML) technologies have become mainstream, many enterprises have been successful in building critical business applications powered by ML models at scale in production. They provide a fact sheet of the model that is important for model governance.

ML

ML Data Scientist Machine Learning Data Science

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

AWS Machine Learning Blog

SEPTEMBER 1, 2023

ML operationalization summary As defined in the post MLOps foundation roadmap for enterprises with Amazon SageMaker , ML and operations (MLOps) is the combination of people, processes, and technology to productionize machine learning (ML) solutions efficiently.

Generative AI

Generative AI Prompt Engineering Prompt Engineer ML

The Sequence Chat: Emmanuel Turlay – CEO, Sematic

TheSequence

JULY 12, 2023

In 2018, I joined Cruise and cofounded the ML Infrastructure team there. We built many critical platform systems that enabled the ML teams to develop and ship models much faster, which contributed to the commercial launch of robotaxis in San Francisco in 2022. This required large end-to-end pipelines.

ML

ML Python Machine Learning Metadata

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

As the number of ML-powered apps and services grows, it gets overwhelming for data scientists and ML engineers to build and deploy models at scale. In this comprehensive guide, we’ll explore everything you need to know about machine learning platforms, including: Components that make up an ML platform.

Machine Learning

Machine Learning Data Scientist ML Metadata

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 2, 2024

This is often referred to as platform engineering and can be neatly summarized by the mantra “You (the developer) build and test, and we (the platform engineering team) do all the rest!” Amazon Bedrock is compatible with robust observability features to monitor and manage ML models and applications.

Generative AI

Generative AI Data Ingestion AI AI

Vitech uses Amazon Bedrock to revolutionize information access with AI-powered chatbot

AWS Machine Learning Blog

MAY 30, 2024

Additionally, VitechIQ includes metadata from the vector database (for example, document URLs) in the model’s output, providing users with source attribution and enhancing trust in the generated answers. Prompt engineering Prompt engineering is crucial for the knowledge retrieval system.

Chatbots

Chatbots Prompt Engineering Prompt Engineer Large Language Models

From Solo Notebooks to Collaborative Powerhouse: VS Code Extensions for Data Science and ML Teams

Governing the ML lifecycle at scale, Part 1: A framework for architecting ML workloads using Amazon SageMaker

Webinars

Trending Sources

Customized model monitoring for near real-time batch inference with Amazon SageMaker

Webinars

Advanced tracing and evaluation of generative AI agents using LangChain and Amazon SageMaker AI MLFlow

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

Data4ML Preparation Guidelines (Beyond The Basics)

Top Artificial Intelligence AI Courses from Google

Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average

Scale and simplify ML workload monitoring on Amazon EKS with AWS Neuron Monitor container

Deploy Amazon SageMaker pipelines using AWS Controllers for Kubernetes

Build a robust text-to-SQL solution generating complex queries, self-correcting, and querying diverse data sources

Fine tune a generative AI application for Amazon Bedrock using Amazon SageMaker Pipeline decorators

Driving advanced analytics outcomes at scale using Amazon SageMaker powered PwC’s Machine Learning Ops Accelerator

How Thomson Reuters built an AI platform using Amazon SageMaker to accelerate delivery of ML projects

MLOps Landscape in 2023: Top Tools and Platforms

Llama 4 family of models from Meta are now available in SageMaker JumpStart

Best practices for Amazon SageMaker HyperPod task governance

Airbnb Researchers Develop Chronon: A Framework for Developing Production-Grade Features for Machine Learning Models

From concept to reality: Navigating the Journey of RAG from proof of concept to production

Amazon SageMaker Feature Store now supports cross-account sharing, discovery, and access

Machine Learning Engineering in the Real World

How to Save Trained Model in Python

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

Build a crop segmentation machine learning model with Planet data and Amazon SageMaker geospatial capabilities

Set up Amazon SageMaker Studio with Jupyter Lab 3 using the AWS CDK

CMU Researchers Introduce Zeno: A Framework for Behavioral Evaluation of Machine Learning (ML) Models

Top Large Language Models LLMs Courses

First ODSC Europe 2023 Sessions Announced

MLOps Is an Extension of DevOps. Not a Fork — My Thoughts on THE MLOPS Paper as an MLOps Startup CEO

How to Build Machine Learning Systems With a Feature Store

How Kakao Games automates lifetime value prediction from game data using Amazon SageMaker and AWS Glue

Learn how Amazon Ads created a generative AI-powered image generation capability using Amazon SageMaker

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

How Did We Get to ML Model Reproducibility

Use Amazon SageMaker Model Card sharing to improve model governance

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

The Sequence Chat: Emmanuel Turlay – CEO, Sematic

Definite Guide to Building a Machine Learning Platform

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

Vitech uses Amazon Bedrock to revolutionize information access with AI-powered chatbot

Stay Connected