Data Ingestion, ML and ML Engineer - Artificial Intelligence Zone

How Rocket Companies modernized their data science solution on AWS

AWS Machine Learning Blog

FEBRUARY 21, 2025

Data exploration and model development were conducted using well-known machine learning (ML) tools such as Jupyter or Apache Zeppelin notebooks. Apache Hive was used to provide a tabular interface to data stored in HDFS, and to integrate with Apache Spark SQL. This created a challenge for data scientists to become productive.

Data Science

Data Science Data Scientist Data Ingestion DevOps

Deliver your first ML use case in 8–12 weeks

AWS Machine Learning Blog

APRIL 26, 2023

Do you need help to move your organization’s Machine Learning (ML) journey from pilot to production? Most executives think ML can apply to any business decision, but on average only half of the ML projects make it to production. Challenges Customers may face several challenges when implementing machine learning (ML) solutions.

ML

ML Machine Learning Data Science Data Drift

Data4ML Preparation Guidelines (Beyond The Basics)

Towards AI

NOVEMBER 8, 2024

Data preparation isn’t just a part of the ML engineering process — it’s the heart of it. Photo by Myriam Jessier on Unsplash To set the stage, let’s examine the nuances between research-phase data and production-phase data. Data is a key differentiator in ML projects (more on this in my blog post below).

Data Ingestion

Data Ingestion Metadata ML Engineer ML

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

MORE WEBINARS

Airbnb Researchers Develop Chronon: A Framework for Developing Production-Grade Features for Machine Learning Models

Marktechpost

AUGUST 8, 2023

In the ever-evolving landscape of machine learning, feature management has emerged as a key pain point for ML Engineers at Airbnb. Airbnb recognized the need for a solution that could streamline feature data management, provide real-time updates, and ensure consistency between training and production environments.

Machine Learning

Machine Learning ML Engineer Data Ingestion ML

Governing the ML lifecycle at scale, Part 4: Scaling MLOps with security and governance controls

AWS Machine Learning Blog

FEBRUARY 7, 2025

This post, part of the Governing the ML lifecycle at scale series ( Part 1 , Part 2 , Part 3 ), explains how to set up and govern a multi-account ML platform that addresses these challenges. An enterprise might have the following roles involved in the ML lifecycles. This ML platform provides several key benefits.

ML

ML Data Scientist ML Engineer Data Science

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

AWS Machine Learning Blog

JUNE 27, 2023

When machine learning (ML) models are deployed into production and employed to drive business decisions, the challenge often lies in the operation and management of multiple models. That is where Provectus , an AWS Premier Consulting Partner with competencies in Machine Learning, Data & Analytics, and DevOps, stepped in.

DevOps

DevOps ML Machine Learning ML Engineer

How Axfood enables accelerated machine learning throughout the organization using Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 27, 2024

In this post, we share how Axfood, a large Swedish food retailer, improved operations and scalability of their existing artificial intelligence (AI) and machine learning (ML) operations by prototyping in close collaboration with AWS experts and using Amazon SageMaker. This is a guest post written by Axfood AB.

Machine Learning

Machine Learning DevOps Data Scientist Data Quality

First ODSC Europe 2023 Sessions Announced

ODSC - Open Data Science

MARCH 27, 2023

Learn about the flow, difficulties, and tools for performing ML clustering at scale Ori Nakar | Principal Engineer, Threat Research | Imperva Given that there are billions of daily botnet attacks from millions of different IPs, the most difficult challenge of botnet detection is choosing the most relevant data.

Machine Learning

Machine Learning Data Science Deep Learning Data Ingestion

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

AWS Machine Learning Blog

OCTOBER 2, 2024

This is often referred to as platform engineering and can be neatly summarized by the mantra “You (the developer) build and test, and we (the platform engineering team) do all the rest!” Amazon Bedrock is compatible with robust observability features to monitor and manage ML models and applications.

Generative AI

Generative AI Data Ingestion AI AI

Orchestrate Ray-based machine learning workflows using Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 18, 2023

Machine learning (ML) is becoming increasingly complex as customers try to solve more and more challenging problems. This complexity often leads to the need for distributed ML, where multiple machines are used to train a single model. SageMaker is a fully managed service for building, training, and deploying ML models.

Machine Learning

Machine Learning ML Python Auto-complete

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

Alignment to other tools in the organization’s tech stack Consider how well the MLOps tool integrates with your existing tools and workflows, such as data sources, data engineering platforms, code repositories, CI/CD pipelines, monitoring systems, etc. and Pandas or Apache Spark DataFrames.

Machine Learning

Machine Learning Metadata Data Scientist Data Quality

Vertex AI: Guide to Google’s Unified Machine Learning Platform

Pickl AI

AUGUST 28, 2024

Introduction In the rapidly evolving landscape of Machine Learning , Google Cloud’s Vertex AI stands out as a unified platform designed to streamline the entire Machine Learning (ML) workflow. This unified approach enables seamless collaboration among data scientists, data engineers, and ML engineers.

Machine Learning

Machine Learning ML Engineer ML Automation

Introducing the Topic Tracks for ODSC East 2025: Spotlight on Gen AI, AI Agents, LLMs, & More

ODSC - Open Data Science

FEBRUARY 25, 2025

Topics Include: Agentic AI DesignPatterns LLMs & RAG forAgents Agent Architectures &Chaining Evaluating AI Agent Performance Building with LangChain and LlamaIndex Real-World Applications of Autonomous Agents Who Should Attend: Data Scientists, Developers, AI Architects, and ML Engineers seeking to build cutting-edge autonomous systems.

Data Scientist

Data Scientist Machine Learning Large Language Models ML Engineer

How to Build Machine Learning Systems With a Feature Store

The MLOps Blog

JANUARY 26, 2024

Luckily, we have tried and trusted tools and architectural patterns that provide a blueprint for reliable ML systems. In this article, I’ll introduce you to a unified architecture for ML systems built around the idea of FTI pipelines and a feature store as the central component. But what is an ML pipeline?

Machine Learning

Machine Learning Metadata ML Python

What Do Data Scientists Do? A Guide to AI Maturity, Challenges, and Solutions

DataRobot Blog

SEPTEMBER 13, 2022

Data scientists drive business outcomes. Many implement machine learning and artificial intelligence to tackle challenges in the age of Big Data. They develop and continuously optimize AI/ML models , collaborating with stakeholders across the enterprise to inform decisions that drive strategic business value. Download Now.

Data Scientist

Data Scientist Automation ML Machine Learning

Machine Learning Operations (MLOPs) with Azure Machine Learning

ODSC - Open Data Science

JULY 19, 2023

Machine Learning Operations (MLOps) can significantly accelerate how data scientists and ML engineers meet organizational needs. A well-implemented MLOps process not only expedites the transition from testing to production but also offers ownership, lineage, and historical data about ML artifacts used within the team.

Machine Learning

Machine Learning Data Drift Data Science Data Scientist

Up Your Machine Learning Game With These ODSC East 2024 Sessions

ODSC - Open Data Science

FEBRUARY 22, 2024

Andre Franca | CTO | connectedFlow Join this session to demystify the world of Causal AI, with a focus on understanding cause-and-effect relationships within data to drive optimal decisions. By the end of this session, you’ll have a practical blueprint to efficiently harness feature stores within ML workflows.

Machine Learning

Machine Learning Data Science Python ML

Announcing the First Sessions for ODSC East 2024

ODSC - Open Data Science

JANUARY 10, 2024

Andre Franca | CTO | connectedFlow Explore the world of Causal AI for data science practitioners, with a focus on understanding cause-and-effect relationships within data to drive optimal decisions. No-Code and Low-Code AI: A Practical Project-Driven Approach to ML Gwendolyn D.

Large Language Models

Large Language Models Deep Learning LLM Data Science

How to Build an End-To-End ML Pipeline

The MLOps Blog

MAY 9, 2023

One of the most prevalent complaints we hear from ML engineers in the community is how costly and error-prone it is to manually go through the ML workflow of building and deploying models. Building end-to-end machine learning pipelines lets ML engineers build once, rerun, and reuse many times.

ML

ML Machine Learning Metadata Data Science

ML Pipeline Architecture Design Patterns (With 10 Real-World Examples)

The MLOps Blog

AUGUST 11, 2023

There comes a time when every ML practitioner realizes that training a model in Jupyter Notebook is just one small part of the entire project. Getting a workflow ready which takes your data from its raw form to predictions while maintaining responsiveness and flexibility is the real deal.

ML

ML Machine Learning Data Ingestion Deep Learning

Migrating to Amazon SageMaker: Karini AI Cut Costs by 23%

AWS Machine Learning Blog

SEPTEMBER 24, 2024

For production deployment, the no-code recipes enable easy assembly of the data ingestion pipeline to create a knowledge base and deployment of RAG or agentic chains. These solutions include two primary components: a data ingestion pipeline for building a knowledge base and a system for knowledge retrieval and summarization.

Data Ingestion

Data Ingestion Machine Learning Large Language Models Generative AI

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

From gathering and processing data to building models through experiments, deploying the best ones, and managing them at scale for continuous value in production—it’s a lot. As the number of ML-powered apps and services grows, it gets overwhelming for data scientists and ML engineers to build and deploy models at scale.

Machine Learning

Machine Learning Data Scientist ML Metadata

Dive deep into vector data stores using Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

OCTOBER 11, 2024

Data lineage and auditing – Metadata can provide information about the provenance and lineage of documents, such as the source system, data ingestion pipeline, or other transformations applied to the data. This information can be valuable for data governance, auditing, and compliance purposes.

Metadata

Metadata Generative AI LLM Data Ingestion

Artificial Intelligence Zone

How Rocket Companies modernized their data science solution on AWS

Deliver your first ML use case in 8–12 weeks

Webinars

Trending Sources

Data4ML Preparation Guidelines (Beyond The Basics)

Webinars

Airbnb Researchers Develop Chronon: A Framework for Developing Production-Grade Features for Machine Learning Models

Governing the ML lifecycle at scale, Part 4: Scaling MLOps with security and governance controls

How Earth.com and Provectus implemented their MLOps Infrastructure with Amazon SageMaker

How Axfood enables accelerated machine learning throughout the organization using Amazon SageMaker

First ODSC Europe 2023 Sessions Announced

Achieve operational excellence with well-architected generative AI solutions using Amazon Bedrock

Orchestrate Ray-based machine learning workflows using Amazon SageMaker

MLOps Landscape in 2023: Top Tools and Platforms

Vertex AI: Guide to Google’s Unified Machine Learning Platform

Introducing the Topic Tracks for ODSC East 2025: Spotlight on Gen AI, AI Agents, LLMs, & More

How to Build Machine Learning Systems With a Feature Store

What Do Data Scientists Do? A Guide to AI Maturity, Challenges, and Solutions

Machine Learning Operations (MLOPs) with Azure Machine Learning

Up Your Machine Learning Game With These ODSC East 2024 Sessions

Announcing the First Sessions for ODSC East 2024

How to Build an End-To-End ML Pipeline

ML Pipeline Architecture Design Patterns (With 10 Real-World Examples)

Migrating to Amazon SageMaker: Karini AI Cut Costs by 23%

Definite Guide to Building a Machine Learning Platform

Dive deep into vector data stores using Amazon Bedrock Knowledge Bases

Stay Connected