Automation, Data Quality and ML Engineer - Artificial Intelligence Zone

Revolutionizing clinical trials with the power of voice and AI

AWS Machine Learning Blog

MARCH 18, 2025

Streamlined data collection and analysis Automating the process of extracting relevant data points from patient-physician interactions can significantly reduce the time and effort required for manual data entry and analysis, enabling more efficient clinical trial management.

LLM

LLM NLP Data Integration AI

The Weather Company enhances MLOps with Amazon SageMaker, AWS CloudFormation, and Amazon CloudWatch

AWS Machine Learning Blog

JULY 8, 2024

TWCo data scientists and ML engineers took advantage of automation, detailed experiment tracking, integrated training, and deployment pipelines to help scale MLOps effectively. Amazon CloudWatch – Collects and visualizes real-time logs that provide the basis for automation.

Data Scientist

Data Scientist ML Engineer Machine Learning Data Science

How Axfood enables accelerated machine learning throughout the organization using Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 27, 2024

However, there are many clear benefits of modernizing our ML platform and moving to Amazon SageMaker Studio and Amazon SageMaker Pipelines. Automation of building new projects based on the template is streamlined through AWS Service Catalog , where a portfolio is created, serving as an abstraction for multiple products.

Machine Learning

Machine Learning DevOps Data Scientist Data Quality

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

This includes features for hyperparameter tuning, automated model selection, and visualization of model metrics. They should also offer version control capabilities to manage the changes and revisions of ML artifacts, ensuring reproducibility and facilitating effective teamwork.

Machine Learning

Machine Learning Metadata Data Scientist Data Quality

Customized model monitoring for near real-time batch inference with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 28, 2024

Early and proactive detection of deviations in model quality enables you to take corrective actions, such as retraining models, auditing upstream systems, or fixing quality issues without having to monitor models manually or build additional tooling. Ajay Raghunathan is a Machine Learning Engineer at AWS.

ML

ML Metadata Data Scientist Machine Learning

Prioritizing employee well-being: An innovative approach with generative AI and Amazon SageMaker Canvas

AWS Machine Learning Blog

JUNE 3, 2024

In a single visual interface, you can complete each step of a data preparation workflow: data selection, cleansing, exploration, visualization, and processing. Custom Spark commands can also expand the over 300 built-in data transformations. Other analyses are also available to help you visualize and understand your data.

Categorization

Categorization Generative AI Auto-complete Auto-classification

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 16, 2023

Amazon SageMaker provides purpose-built tools for machine learning operations (MLOps) to help automate and standardize processes across the ML lifecycle. In this post, we describe how Philips partnered with AWS to develop AI ToolSuite—a scalable, secure, and compliant ML platform on SageMaker.

Data Scientist

Data Scientist ML Data Science Machine Learning

Use a data-centric approach to minimize the amount of data required to train Amazon SageMaker models

AWS Machine Learning Blog

MARCH 9, 2023

As machine learning (ML) models have improved, data scientists, ML engineers and researchers have shifted more of their attention to defining and bettering data quality. Applying these techniques allows ML practitioners to reduce the amount of data required to train an ML model.

ML Engineer

ML Engineer Data Scientist Convolutional Neural Networks ML

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning Blog

NOVEMBER 14, 2024

Model governance involves overseeing the development, deployment, and maintenance of ML models to help ensure that they meet business objectives and are accurate, fair, and compliant with regulations. He is focused on AI/ML technology, ML model management, and ML governance to improve overall organizational efficiency and productivity.

ML

ML Machine Learning Auto-complete Auto-classification

Deliver your first ML use case in 8–12 weeks

AWS Machine Learning Blog

APRIL 26, 2023

You may have gaps in skills and technologies, including operationalizing ML solutions, implementing ML services, and managing ML projects for rapid iterations. Ensuring data quality, governance, and security may slow down or stall ML projects. This may often be the same team as cloud engineering.

ML

ML Machine Learning Data Science Data Drift

The Age of Health Informatics: Part 1

Heartbeat

OCTOBER 23, 2023

The Role of Data Scientists and ML Engineers in Health Informatics At the heart of the Age of Health Informatics are data scientists and ML engineers who play a critical role in harnessing the power of data and developing intelligent algorithms.

Data Scientist

Data Scientist Machine Learning Big Data Algorithm

Must-Have Skills for a Machine Learning Engineer

Pickl AI

NOVEMBER 28, 2024

Fundamental Programming Skills Strong programming skills are essential for success in ML. This section will highlight the critical programming languages and concepts ML engineers should master, including Python, R , and C++, and an understanding of data structures and algorithms. during the forecast period.

Machine Learning

Machine Learning Neural Network ML Engineer Algorithm

How Vodafone Uses TensorFlow Data Validation in their Data Contracts to Elevate Data Governance at Scale

TensorFlow

MARCH 10, 2023

It can also include constraints on the data, such as: Minimum and maximum values for numerical columns Allowed values for categorical columns. Before a model is productionized, the Contract is agreed upon by the stakeholders working on the pipeline, such as the ML Engineers, Data Scientists and Data Owners.

Data Drift

Data Drift Data Scientist ML Engineer Machine Learning

Importance of Machine Learning Model Retraining in Production

Heartbeat

OCTOBER 30, 2023

Once the best model is identified, it is usually deployed in production to make accurate predictions on real-world data (similar to the one on which the model was trained initially). Ideally, the responsibilities of the ML engineering team should be completed once the model is deployed. But this is only sometimes the case.

Machine Learning

Machine Learning Data Drift ML Data Scientist

What is Data Scrubbing? Unfolding the Details

Pickl AI

JUNE 6, 2024

Data scrubbing is often used interchangeably but there’s a subtle difference. Cleaning is broader, improving data quality. This is a more intensive technique within data cleaning, focusing on identifying and correcting errors. Data scrubbing is a powerful tool within this cleaning service.

Machine Learning

Machine Learning Algorithm Business Intelligence Data Quality

How to Build ETL Data Pipeline in ML

The MLOps Blog

MAY 17, 2023

From data processing to quick insights, robust pipelines are a must for any ML system. Often the Data Team, comprising Data and ML Engineers , needs to build this infrastructure, and this experience can be painful. However, efficient use of ETL pipelines in ML can help make their life much easier.

ETL

ETL ML Machine Learning Data Scientist

The Future of Data-Centric AI Day 2: Snorkel Flow and Beyond

Snorkel AI

JUNE 9, 2023

Full session recap The Opportunity of Data-Centric AI in Insurance Alejandro Zarate Santovena, lecturer at Columbia University and Managing Director at Marsh , asserted that AI and foundation models have a lot of potential to disrupt the insurance industry.

Large Language Models

Large Language Models Data Scientist Machine Learning Computer Vision

The Future of Data-Centric AI Day 2: Snorkel Flow and Beyond

Snorkel AI

JUNE 9, 2023

Full session recap The Opportunity of Data-Centric AI in Insurance Alejandro Zarate Santovena, lecturer at Columbia University and Managing Director at Marsh , asserted that AI and foundation models have a lot of potential to disrupt the insurance industry.

Large Language Models

Large Language Models Data Scientist Machine Learning Computer Vision

How to Build a CI/CD MLOps Pipeline [Case Study]

The MLOps Blog

MARCH 15, 2023

Automation : Automating as many tasks to reduce human error and increase efficiency. Collaboration : Ensuring that all teams involved in the project, including data scientists, engineers, and operations teams, are working together effectively. This includes data quality, privacy, and compliance.

ETL

ETL Data Drift Machine Learning ML

7 Critical Model Training Errors: What They Mean & How to Fix Them

Viso.ai

JANUARY 30, 2024

They use automation tools like the caret package in R and Pipelines in scikit-learn. Data leakage occurs when training data is not truly representative of the population at large – source. This is a bigger deal with raw or unstructured data that engineers and developers might be using to feed the machine learning program.

Data Drift

Data Drift Machine Learning Computer Vision Algorithm

Google experts on practical paths to data-centricity in applied AI

Snorkel AI

JULY 5, 2023

Organizations struggle in multiple aspects, especially in modern-day data engineering practices and getting ready for successful AI outcomes. One of them is that it is really hard to maintain high data quality with rigorous validation. More features mean more data consumed upstream.

Large Language Models

Large Language Models Metadata Machine Learning AI

Google experts on practical paths to data-centricity in applied AI

Snorkel AI

JULY 5, 2023

Organizations struggle in multiple aspects, especially in modern-day data engineering practices and getting ready for successful AI outcomes. One of them is that it is really hard to maintain high data quality with rigorous validation. More features mean more data consumed upstream.

Large Language Models

Large Language Models Metadata Machine Learning AI

Google experts on practical paths to data-centricity in applied AI

Snorkel AI

JULY 5, 2023

Organizations struggle in multiple aspects, especially in modern-day data engineering practices and getting ready for successful AI outcomes. One of them is that it is really hard to maintain high data quality with rigorous validation. More features mean more data consumed upstream.

Large Language Models

Large Language Models Metadata Machine Learning AI

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

Leveraging Data-Centric AI for Document Intelligence and PDF Extraction Extracting entities from semi-structured documents is often a challenging task, requiring complex and time-consuming manual processes. Wayfair does this by automating image tagging using a data-centric approach.

Data Scientist

Data Scientist ML Computer Vision AI

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

Leveraging Data-Centric AI for Document Intelligence and PDF Extraction Extracting entities from semi-structured documents is often a challenging task, requiring complex and time-consuming manual processes. Wayfair does this by automating image tagging using a data-centric approach.

Data Scientist

Data Scientist ML Computer Vision AI

Watch all Future of Data-Centric AI 2023 videos now!

Snorkel AI

OCTOBER 12, 2023

Leveraging Data-Centric AI for Document Intelligence and PDF Extraction Extracting entities from semi-structured documents is often a challenging task, requiring complex and time-consuming manual processes. Wayfair does this by automating image tagging using a data-centric approach.

Data Scientist

Data Scientist NLP ML Computer Vision

Deploying Conversational AI Products to Production With Jason Flaks

The MLOps Blog

JULY 18, 2023

It’s an automated chief of staff that automates conversational tasks. We are aiming to automate that functionality so that every worker in an organization can have access to that help, just like a CEO or someone else in the company would. How do you ensure data quality when building NLP products?

Conversational AI

Conversational AI Natural Language Processing Machine Learning AI

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs

AWS Machine Learning Blog

JANUARY 26, 2024

The goal of this post is to empower AI and machine learning (ML) engineers, data scientists, solutions architects, security teams, and other stakeholders to have a common mental model and framework to apply security best practices, allowing AI/ML teams to move fast without trading off security for speed.

Generative AI

Generative AI ML LLM AI

How to Build an End-To-End ML Pipeline

The MLOps Blog

MAY 9, 2023

One of the most prevalent complaints we hear from ML engineers in the community is how costly and error-prone it is to manually go through the ML workflow of building and deploying models. Building end-to-end machine learning pipelines lets ML engineers build once, rerun, and reuse many times. Data preprocessing.

ML

ML Machine Learning Metadata Data Science

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

From gathering and processing data to building models through experiments, deploying the best ones, and managing them at scale for continuous value in production—it’s a lot. As the number of ML-powered apps and services grows, it gets overwhelming for data scientists and ML engineers to build and deploy models at scale.

Machine Learning

Machine Learning Data Scientist ML Metadata

Mikiko Bazeley: What I Learned Building the ML Platform at Mailchimp

The MLOps Blog

JANUARY 26, 2024

Responsibility for MLOps and the ML platform was split across three teams: 1 One team focused on making tools, setting up the environment for development and training for data scientists, and helping with the productionization work. This included maintaining the underlying infrastructure and working on model deployment automation.

ML

ML Data Scientist Machine Learning ML Engineer

Improve governance of models with Amazon SageMaker unified Model Cards and Model Registry

AWS Machine Learning Blog

NOVEMBER 13, 2024

With the unification of SageMaker Model Cards and SageMaker Model Registry, architects, data scientists, ML engineers, or platform engineers (depending on the organization’s hierarchy) can now seamlessly register ML model versions early in the development lifecycle, including essential business details and technical metadata.

Metadata

Metadata ML Software Engineer Machine Learning

Artificial Intelligence Zone

Revolutionizing clinical trials with the power of voice and AI

The Weather Company enhances MLOps with Amazon SageMaker, AWS CloudFormation, and Amazon CloudWatch

Webinars

Trending Sources

How Axfood enables accelerated machine learning throughout the organization using Amazon SageMaker

Webinars

MLOps Landscape in 2023: Top Tools and Platforms

Customized model monitoring for near real-time batch inference with Amazon SageMaker

Prioritizing employee well-being: An innovative approach with generative AI and Amazon SageMaker Canvas

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

Use a data-centric approach to minimize the amount of data required to train Amazon SageMaker models

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Deliver your first ML use case in 8–12 weeks

The Age of Health Informatics: Part 1

Must-Have Skills for a Machine Learning Engineer

How Vodafone Uses TensorFlow Data Validation in their Data Contracts to Elevate Data Governance at Scale

Importance of Machine Learning Model Retraining in Production

What is Data Scrubbing? Unfolding the Details

How to Build ETL Data Pipeline in ML

The Future of Data-Centric AI Day 2: Snorkel Flow and Beyond

The Future of Data-Centric AI Day 2: Snorkel Flow and Beyond

How to Build a CI/CD MLOps Pipeline [Case Study]

7 Critical Model Training Errors: What They Mean & How to Fix Them

Google experts on practical paths to data-centricity in applied AI

Google experts on practical paths to data-centricity in applied AI

Google experts on practical paths to data-centricity in applied AI

Watch all Future of Data-Centric AI 2023 videos now!

Watch all Future of Data-Centric AI 2023 videos now!

Watch all Future of Data-Centric AI 2023 videos now!

Deploying Conversational AI Products to Production With Jason Flaks

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs

How to Build an End-To-End ML Pipeline

Definite Guide to Building a Machine Learning Platform

Mikiko Bazeley: What I Learned Building the ML Platform at Mailchimp

Improve governance of models with Amazon SageMaker unified Model Cards and Model Registry

Stay Connected