Algorithm, Data Quality and ML Engineer - Artificial Intelligence Zone

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

AWS Machine Learning Blog

JANUARY 28, 2025

Furthermore, evaluation processes are important not only for LLMs, but are becoming essential for assessing prompt template quality, input data quality, and ultimately, the entire application stack. Evaluation algorithm Computes evaluation metrics to model outputs.

LLM

LLM Large Language Models ML Algorithm

DeepSeek in My Engineer’s Eyes

Towards AI

FEBRUARY 18, 2025

In this post, I want to shift the conversation to how Deepseek is redefining the future of machine learning engineering. It has already inspired me to set new goals for 2025, and I hope it can do the same for other ML engineers. It is fascinating what Deepseek has achieved with their top noche engineering skill.

ML Engineer

ML Engineer LLM Data Quality Algorithm

Use a data-centric approach to minimize the amount of data required to train Amazon SageMaker models

AWS Machine Learning Blog

MARCH 9, 2023

As machine learning (ML) models have improved, data scientists, ML engineers and researchers have shifted more of their attention to defining and bettering data quality. Applying these techniques allows ML practitioners to reduce the amount of data required to train an ML model.

ML Engineer

ML Engineer Data Scientist Convolutional Neural Networks ML

Webinars

AI for Paralegals: Everything You Need to Know (and How to Use It Safely)

The Intersection of AI and Sales: Personalization Without Compromise

How to Achieve High-Accuracy Results When Using LLMs

Beyond the Buzz: How to Turn Marketing Trends into Revenue-Driving Strategies

MORE WEBINARS

11 Open Source Data Exploration Tools You Need to Know in 2023

ODSC - Open Data Science

FEBRUARY 24, 2023

Its goal is to help with a quick analysis of target characteristics, training vs testing data, and other such data characterization tasks. Apache Superset GitHub | Website Apache Superset is a must-try project for any ML engineer, data scientist, or data analyst.

Data Analysis

Data Analysis Data Science Business Intelligence Python

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

Learn more The Best Tools, Libraries, Frameworks and Methodologies that ML Teams Actually Use – Things We Learned from 41 ML Startups [ROUNDUP] Key use cases and/or user journeys Identify the main business problems and the data scientist’s needs that you want to solve with ML, and choose a tool that can handle them effectively.

Machine Learning

Machine Learning Metadata Data Scientist Data Quality

Prioritizing employee well-being: An innovative approach with generative AI and Amazon SageMaker Canvas

AWS Machine Learning Blog

JUNE 3, 2024

In a single visual interface, you can complete each step of a data preparation workflow: data selection, cleansing, exploration, visualization, and processing. Custom Spark commands can also expand the over 300 built-in data transformations. Other analyses are also available to help you visualize and understand your data.

Generative AI

Generative AI Categorization Auto-complete Auto-classification

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 16, 2023

Amazon SageMaker provides purpose-built tools for machine learning operations (MLOps) to help automate and standardize processes across the ML lifecycle. In this post, we describe how Philips partnered with AWS to develop AI ToolSuite—a scalable, secure, and compliant ML platform on SageMaker.

Data Scientist

Data Scientist ML Data Science Automation

Must-Have Skills for a Machine Learning Engineer

Pickl AI

NOVEMBER 28, 2024

Summary: The blog discusses essential skills for Machine Learning Engineer, emphasising the importance of programming, mathematics, and algorithm knowledge. Understanding Machine Learning algorithms and effective data handling are also critical for success in the field. million by 2030, with a remarkable CAGR of 44.8%

Machine Learning

Machine Learning Neural Network ML Engineer Algorithm

The Age of Health Informatics: Part 1

Heartbeat

OCTOBER 23, 2023

The Role of Data Scientists and ML Engineers in Health Informatics At the heart of the Age of Health Informatics are data scientists and ML engineers who play a critical role in harnessing the power of data and developing intelligent algorithms.

Data Scientist

Data Scientist Machine Learning Big Data Algorithm

Deliver your first ML use case in 8–12 weeks

AWS Machine Learning Blog

APRIL 26, 2023

You may have gaps in skills and technologies, including operationalizing ML solutions, implementing ML services, and managing ML projects for rapid iterations. Ensuring data quality, governance, and security may slow down or stall ML projects. Conduct exploratory analysis and data preparation.

ML

ML Machine Learning Data Science Data Drift

How to Visualize Deep Learning Models

The MLOps Blog

NOVEMBER 14, 2023

Visualizing deep learning models can help us with several different objectives: Interpretability and explainability: The performance of deep learning models is, at times, staggering, even for seasoned data scientists and ML engineers. Data scientists and ML engineers: Creating and training deep learning models is no easy feat.

Deep Learning

Deep Learning Neural Network Convolutional Neural Networks Data Scientist

MLOps: What is a Product First vs. Model First Mindset?

Mlearning.ai

MAY 23, 2023

It’s critical for beginners learn this, since it affects everything: workflows, data quality requirements, etc. Model mindset prioritizes the ML model that you are building. While product mindset focuses on the end data product: the minimum viable product. Focusing on model architecture and algorithm development.

Machine Learning

Machine Learning Data Scientist ML Data Science

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities

AWS Machine Learning Blog

JANUARY 17, 2024

Solution overview As mentioned earlier, the AWS services that you can use for analysis of mobility data are Amazon S3, Amazon Macie, AWS Glue, S3 Object Lambda, Amazon Comprehend, and Amazon SageMaker geospatial capabilities. Example 1 – The following screenshot shows all visits to the Macy’s store.

ETL

ETL ML Machine Learning Data Scientist

What is Data Scrubbing? Unfolding the Details

Pickl AI

JUNE 6, 2024

Data scrubbing is often used interchangeably but there’s a subtle difference. Cleaning is broader, improving data quality. This is a more intensive technique within data cleaning, focusing on identifying and correcting errors. Data scrubbing is a powerful tool within this cleaning service.

Machine Learning

Machine Learning Algorithm Business Intelligence Data Quality

7 Critical Model Training Errors: What They Mean & How to Fix Them

Viso.ai

JANUARY 30, 2024

The principle of looking globally at data is important. It’s something that engineers should always build into models. Algorithms like Random Forests or Gradient Boosting Machines are less sensitive to outliers in general. Making sure that the training data is correct is imperative in the process.

Data Drift

Data Drift Machine Learning Computer Vision Algorithm

Importance of Machine Learning Model Retraining in Production

Heartbeat

OCTOBER 30, 2023

Once the best model is identified, it is usually deployed in production to make accurate predictions on real-world data (similar to the one on which the model was trained initially). Ideally, the responsibilities of the ML engineering team should be completed once the model is deployed. But this is only sometimes the case.

Machine Learning

Machine Learning Data Drift ML Data Scientist

How to Build ETL Data Pipeline in ML

The MLOps Blog

MAY 17, 2023

From data processing to quick insights, robust pipelines are a must for any ML system. Often the Data Team, comprising Data and ML Engineers , needs to build this infrastructure, and this experience can be painful. However, efficient use of ETL pipelines in ML can help make their life much easier.

ETL

ETL ML Machine Learning Data Scientist

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

Leveraging Data-Centric AI for Document Intelligence and PDF Extraction Extracting entities from semi-structured documents is often a challenging task, requiring complex and time-consuming manual processes. Typically these heuristics are applied after the unsupervised techniques identify the anomalies and outliers in the data.

Data Scientist

Data Scientist ML Computer Vision AI

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

Leveraging Data-Centric AI for Document Intelligence and PDF Extraction Extracting entities from semi-structured documents is often a challenging task, requiring complex and time-consuming manual processes. Typically these heuristics are applied after the unsupervised techniques identify the anomalies and outliers in the data.

Data Scientist

Data Scientist ML Computer Vision AI

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

Leveraging Data-Centric AI for Document Intelligence and PDF Extraction Extracting entities from semi-structured documents is often a challenging task, requiring complex and time-consuming manual processes. Typically these heuristics are applied after the unsupervised techniques identify the anomalies and outliers in the data.

Data Scientist

Data Scientist NLP ML Computer Vision

How to Build a CI/CD MLOps Pipeline [Case Study]

The MLOps Blog

MARCH 15, 2023

For small-scale/low-value deployments, there might not be many items to focus on, but as the scale and reach of deployment go up, data governance becomes crucial. This includes data quality, privacy, and compliance. For an experienced Data Scientist/ML engineer, that shouldn’t come as so much of a problem.

ETL

ETL Data Drift Machine Learning ML

Deploying Conversational AI Products to Production With Jason Flaks

The MLOps Blog

JULY 18, 2023

Then we subsequently try to run audio fingerprinting type algorithms on top of it so that we can actually identify specifically who those people are if we’ve seen them in the past. We need to do that, but we don’t really know what those topics are, so we use some algorithms. We call it our “format stage.”

Conversational AI

Conversational AI Natural Language Processing Machine Learning AI

Learnings From Building the ML Platform at Stitch Fix

The MLOps Blog

AUGUST 3, 2023

This is Piotr Niedźwiedź and Aurimas Griciūnas from neptune.ai , and you’re listening to ML Platform Podcast. Stefan is a software engineer, data scientist, and has been doing work as an ML engineer. One of the features that Hamilton has is that it has a really lightweight data quality runtime check.

ML

ML Data Scientist Software Engineer Machine Learning

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning Blog

NOVEMBER 14, 2024

Model governance involves overseeing the development, deployment, and maintenance of ML models to help ensure that they meet business objectives and are accurate, fair, and compliant with regulations. After you have completed the data preparation step, it’s time to train the classification model.

ML

ML Machine Learning Auto-complete Auto-classification

How to Build an End-To-End ML Pipeline

The MLOps Blog

MAY 9, 2023

One of the most prevalent complaints we hear from ML engineers in the community is how costly and error-prone it is to manually go through the ML workflow of building and deploying models. Building end-to-end machine learning pipelines lets ML engineers build once, rerun, and reuse many times. Data preprocessing.

ML

ML Machine Learning Metadata Data Science

LLM Agents Underscore One Truth: Data Is The Real Differentiator.

Towards AI

NOVEMBER 8, 2024

We don’t have better algorithms; we just have more data. Peter Norvig, The Unreasonable Effectiveness of Data. Edited Photo by Taylor Vick on Unsplash In ML engineering, data quality isn’t just critical — it’s foundational. Because of how ML practitioners were initially trained.

LLM

LLM ML Engineer Data Quality Data Scientist

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

From gathering and processing data to building models through experiments, deploying the best ones, and managing them at scale for continuous value in production—it’s a lot. As the number of ML-powered apps and services grows, it gets overwhelming for data scientists and ML engineers to build and deploy models at scale.

Machine Learning

Machine Learning Data Scientist ML Metadata

Artificial Intelligence Zone

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval

DeepSeek in My Engineer’s Eyes

Webinars

Trending Sources

Use a data-centric approach to minimize the amount of data required to train Amazon SageMaker models

Webinars

11 Open Source Data Exploration Tools You Need to Know in 2023

MLOps Landscape in 2023: Top Tools and Platforms

Prioritizing employee well-being: An innovative approach with generative AI and Amazon SageMaker Canvas

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

Must-Have Skills for a Machine Learning Engineer

The Age of Health Informatics: Part 1

Deliver your first ML use case in 8–12 weeks

How to Visualize Deep Learning Models

MLOps: What is a Product First vs. Model First Mindset?

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities

What is Data Scrubbing? Unfolding the Details

7 Critical Model Training Errors: What They Mean & How to Fix Them

Importance of Machine Learning Model Retraining in Production

How to Build ETL Data Pipeline in ML

Watch all Future of Data-Centric AI 2023 videos now!

Watch all Future of Data-Centric AI 2023 videos now!

Watch all Future of Data-Centric AI 2023 videos now!

How to Build a CI/CD MLOps Pipeline [Case Study]

Deploying Conversational AI Products to Production With Jason Flaks

Learnings From Building the ML Platform at Stitch Fix

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

How to Build an End-To-End ML Pipeline

LLM Agents Underscore One Truth: Data Is The Real Differentiator.

Definite Guide to Building a Machine Learning Platform

Stay Connected