Data Drift, Explainability and Python - Artificial Intelligence Zone

Data Scientists in the Age of AI Agents and AutoML

Towards AI

JANUARY 22, 2025

Uncomfortable reality: In the era of large language models (LLMs) and AutoML, traditional skills like Python scripting, SQL, and building predictive models are no longer enough for data scientist to remain competitive in the market. You have to understand data, how to extract value from them and how to monitor model performances.

Data Scientist

Data Scientist Data Drift Data Science Data Analysis

Monitoring Machine Learning Models in Production

Heartbeat

JUNE 12, 2023

Key Challenges in ML Model Monitoring in Production Data Drift and Concept Drift Data and concept drift are two common types of drift that can occur in machine-learning models over time. Data drift refers to a change in the input data distribution that the model receives.

Machine Learning

Machine Learning Data Drift Explainability Data Quality

Explainable AI (XAI): The Complete Guide (2024)

Viso.ai

FEBRUARY 12, 2024

True to its name, Explainable AI refers to the tools and methods that explain AI systems and how they arrive at a certain output. In this blog, we’ll dive into the need for AI explainability, the various methods available currently, and their applications. Why do we need Explainable AI (XAI)?

Explainable AI

Explainable AI Explainability Deep Learning Neural Network

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

How are AI Projects Different

Towards AI

AUGUST 16, 2023

Michael Dziedzic on Unsplash I am often asked by prospective clients to explain the artificial intelligence (AI) software process, and I have recently been asked by managers with extensive software development and data science experience who wanted to implement MLOps. Join thousands of data leaders on the AI newsletter.

Machine Learning

Machine Learning Software Development Data Drift Data Science

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

For example, if your team is proficient in Python and R, you may want an MLOps tool that supports open data formats like Parquet, JSON, CSV, etc., This includes features for model explainability, fairness assessment, privacy preservation, and compliance tracking. and Pandas or Apache Spark DataFrames.

Machine Learning

Machine Learning Metadata Data Scientist Data Quality

Lyft's explains their Model Serving Infrastructure

Bugra Akyildiz

MARCH 12, 2023

Uber wrote about how they build a data drift detection system. pyribs is a bare-bones Python library for quality diversity (QD) optimization. In our case that meant prioritizing stability, performance, and flexibility above all else. Don’t be afraid to use boring technology.

Explainability

Explainability Data Drift Data Science Software Engineer

Promote pipelines in a multi-environment setup using Amazon SageMaker Model Registry, HashiCorp Terraform, GitHub, and Jenkins CI/CD

AWS Machine Learning Blog

NOVEMBER 9, 2023

Building out a machine learning operations (MLOps) platform in the rapidly evolving landscape of artificial intelligence (AI) and machine learning (ML) for organizations is essential for seamlessly bridging the gap between data science experimentation and deployment while meeting the requirements around model performance, security, and compliance.

Data Drift

Data Drift Auto-complete ML Automation

Better Forecasting with AI-Powered Time Series Modeling

DataRobot Blog

DECEMBER 15, 2022

For code-first users, we offer a code experience too, using the AP—both in Python and R—for your convenience. The model training process is not a black box—it includes trust and explainability. DataRobot Blueprint—from data to predictions. Model Performance, Insights, and Explainability. Setting up a Time Series Project.

Machine Learning

Machine Learning AI AI Data Drift

Modular functions design for Advanced Driver Assistance Systems (ADAS) on AWS

AWS Machine Learning Blog

FEBRUARY 23, 2023

This post explains the functions based on a modular pipeline approach. SageMaker has developed the distributed data parallel library , which splits data per node and optimizes the communication between the nodes. Each node has a copy of the DNN.

Automation

Automation Machine Learning Neural Network Data Scientist

MLOps for batch inference with model monitoring and retraining using Amazon SageMaker, HashiCorp Terraform, and GitLab CI/CD

AWS Machine Learning Blog

AUGUST 29, 2023

GitLab CI/CD serves as the macro-orchestrator, orchestrating model build and model deploy pipelines, which include sourcing, building, and provisioning Amazon SageMaker Pipelines and supporting resources using the SageMaker Python SDK and Terraform.

Data Scientist

Data Scientist Data Quality Python ML

OpenAI announces ChatGPT

Bugra Akyildiz

DECEMBER 3, 2022

Articles Netflix explained how they build a federated search on their heterogeneous contents for their content engineering. PyTerrier is a Python framework for performing information retrieval experiments, built on Terrier. We’re eager to collect user feedback to aid our ongoing work to improve this system.

OpenAI

OpenAI ChatGPT Data Drift Robotics

How to Build a CI/CD MLOps Pipeline [Case Study]

The MLOps Blog

MARCH 15, 2023

For small-scale/low-value deployments, there might not be many items to focus on, but as the scale and reach of deployment go up, data governance becomes crucial. This includes data quality, privacy, and compliance. Another type of data was images with specific event IDs getting dumped to an S3 location. Redshift, S3, and so on.

ETL

ETL Data Drift Machine Learning ML

Monitoring Your Time Series Model in Comet

Heartbeat

MARCH 21, 2023

There are several techniques used for model monitoring with time series data, including: Data Drift Detection: This involves monitoring the distribution of the input data over time to detect any changes that may impact the model’s performance. You can learn more about Comet here.

Machine Learning

Machine Learning Data Drift Data Scientist Data Analysis

Capital One’s data-centric solutions to banking business challenges

Snorkel AI

MAY 12, 2023

Together, these data ops efforts ensure that model development time is efficient, model performance is robust, and teams focus more on innovation and customer experience, which is what matters. The piece that connects the model to the application and the data is the explainability of the model. Bayan Bruss: Thanks Kishore.

Machine Learning

Machine Learning Data Scientist Data Science ML

Capital One’s data-centric solutions to banking business challenges

Snorkel AI

MAY 12, 2023

Together, these data ops efforts ensure that model development time is efficient, model performance is robust, and teams focus more on innovation and customer experience, which is what matters. The piece that connects the model to the application and the data is the explainability of the model. Bayan Bruss: Thanks Kishore.

Machine Learning

Machine Learning Data Scientist Data Science ML

How to Build an End-To-End ML Pipeline

The MLOps Blog

MAY 9, 2023

Data validation This step collects the transformed data as input and, through a series of tests and validators, ensures that it meets the criteria for the next component. It checks the data for quality issues and detects outliers and anomalies. Is it a black-box model, or can the decisions be explained?

ML

ML Machine Learning Metadata Data Science

Creating An Information Edge With Conversational Access To Data

Topbots

JUNE 29, 2023

However, as of now, unleashing the full potential of organisational data is often a privilege of a handful of data scientists and analysts. Most employees don’t master the conventional data science toolkit (SQL, Python, R etc.). Adaptability over time To use Text2SQL in a durable way, you need to adapt to data drift, i.

Algorithm

Algorithm Auto-complete Data Scientist Auto-classification

Artificial Intelligence Zone

Data Scientists in the Age of AI Agents and AutoML

Monitoring Machine Learning Models in Production

Webinars

Trending Sources

Explainable AI (XAI): The Complete Guide (2024)

Webinars

How are AI Projects Different

MLOps Landscape in 2023: Top Tools and Platforms

Lyft's explains their Model Serving Infrastructure

Promote pipelines in a multi-environment setup using Amazon SageMaker Model Registry, HashiCorp Terraform, GitHub, and Jenkins CI/CD

Better Forecasting with AI-Powered Time Series Modeling

Modular functions design for Advanced Driver Assistance Systems (ADAS) on AWS

MLOps for batch inference with model monitoring and retraining using Amazon SageMaker, HashiCorp Terraform, and GitLab CI/CD

OpenAI announces ChatGPT

How to Build a CI/CD MLOps Pipeline [Case Study]

Monitoring Your Time Series Model in Comet

Capital One’s data-centric solutions to banking business challenges

Capital One’s data-centric solutions to banking business challenges

How to Build an End-To-End ML Pipeline

Creating An Information Edge With Conversational Access To Data

Stay Connected