Data Platform, ETL and ML - Artificial Intelligence Zone

Data platform trinity: Competitive or complementary?

IBM Journey to AI blog

JANUARY 18, 2023

Data platform architecture has an interesting history. A read-optimized platform that can integrate data from multiple applications emerged. In another decade, the internet and mobile started the generate data of unforeseen volume, variety and velocity. It required a different data platform solution.

Data Platform

Data Platform ETL Metadata Data Discovery

Twilio Segment: Transforming customer experiences with AI

AI News

SEPTEMBER 26, 2023

AI and machine learning (ML) models are incredibly effective at doing this but are complex to build and require data science expertise. HT: When companies rely on managing data in a customer data platform (CDP) in tandem with AI, they can create strong, personalised campaigns that reach and inspire their customers.

Big Data

Big Data AI AI ETL

The Rise and Fall of Data Science Trends: A 2018–2024 Conference Perspective

ODSC - Open Data Science

MARCH 12, 2025

20212024: Interest declined as deep learning and pre-trained models took over, automating many tasks previously handled by classical ML techniques. This shift suggests that while traditional ML is still relevant, its role is now more supportive rather than cutting-edge.

Data Science

Data Science ETL Machine Learning AI Engineer

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

How Rocket Companies modernized their data science solution on AWS

AWS Machine Learning Blog

FEBRUARY 21, 2025

Data exploration and model development were conducted using well-known machine learning (ML) tools such as Jupyter or Apache Zeppelin notebooks. Apache Hive was used to provide a tabular interface to data stored in HDFS, and to integrate with Apache Spark SQL. This created a challenge for data scientists to become productive.

Data Science

Data Science Data Scientist Data Ingestion DevOps

Building ML Platform in Retail and eCommerce

The MLOps Blog

MAY 31, 2023

And eCommerce companies have a ton of use cases where ML can help. The problem is, with more ML models and systems in production, you need to set up more infrastructure to reliably manage everything. And because of that, many companies decide to centralize this effort in an internal ML platform. But how to build it?

ML

ML Algorithm Data Drift Machine Learning

Exploring the AI and data capabilities of watsonx

IBM Journey to AI blog

JULY 17, 2023

is our enterprise-ready next-generation studio for AI builders, bringing together traditional machine learning (ML) and new generative AI capabilities powered by foundation models. Watsonx.data allows customers to augment data warehouses such as Db2 Warehouse and Netezza and optimize workloads for performance and cost. IBM watsonx.ai

Machine Learning

Machine Learning Metadata Automation AI

Improving air quality with generative AI

AWS Machine Learning Blog

JUNE 18, 2024

Despite the challenges, Afri-SET, with limited resources, envisions a comprehensive data management solution for stakeholders seeking sensor hosting on their platform, aiming to deliver accurate data from low-cost sensors. Qiong (Jo) Zhang , PhD, is a Senior Partner Solutions Architect at AWS, specializing in AI/ML.

Generative AI

Generative AI Data Ingestion Python LLM

Bring your own AI using Amazon SageMaker with Salesforce Data Cloud

AWS Machine Learning Blog

AUGUST 4, 2023

As a result, businesses can accelerate time to market while maintaining data integrity and security, and reduce the operational burden of moving data from one location to another. With Einstein Studio, a gateway to AI tools on the data platform, admins and data scientists can effortlessly create models with a few clicks or using code.

Data Scientist

Data Scientist ML ETL Data Platform

AI that’s ready for business starts with data that’s ready for AI

IBM Journey to AI blog

JULY 3, 2024

Your data strategy should incorporate databases designed with open and integrated components, allowing for seamless unification and access to data for advanced analytics and AI applications within a data platform. With an open data lakehouse, you can access a single copy of data wherever your data resides.

Data Quality

Data Quality Metadata Business Intelligence AI

Learnings From Building the ML Platform at Stitch Fix

The MLOps Blog

AUGUST 3, 2023

This article was originally an episode of the ML Platform Podcast , a show where Piotr Niedźwiedź and Aurimas Griciūnas, together with ML platform professionals, discuss design choices, best practices, example tool stacks, and real-world learnings from some of the best ML platform professionals.

ML

ML Data Scientist Software Engineer Machine Learning

How to Build Machine Learning Systems With a Feature Store

The MLOps Blog

JANUARY 26, 2024

Luckily, we have tried and trusted tools and architectural patterns that provide a blueprint for reliable ML systems. In this article, I’ll introduce you to a unified architecture for ML systems built around the idea of FTI pipelines and a feature store as the central component. But what is an ML pipeline?

Machine Learning

Machine Learning Metadata ML Python

Comparing Tools For Data Processing Pipelines

The MLOps Blog

MARCH 15, 2023

Data pipeline stages But before delving deeper into the technical aspects of these tools, let’s quickly understand the core components of a data pipeline succinctly captured in the image below: Data pipeline stages | Source: Author What does a good data pipeline look like? Uses secure protocols for data security.

Categorization

Categorization ETL Data Integration Automation

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning Blog

MARCH 5, 2025

About the authors Samantha Stuart is a Data Scientist with AWS Professional Services, and has delivered for customers across generative AI, MLOps, and ETL engagements. He has touched on most aspects of these projects, from infrastructure and DevOps to software development and AI/ML.

Generative AI

Generative AI LLM AI AI

Top Predictive Analytics Tools/Platforms (2023)

Marktechpost

JULY 17, 2023

IBM merged the critical capabilities of the vendor into its more contemporary Watson Studio running on the IBM Cloud Pak for Data platform as it continues to innovate. The platform makes collaborative data science better for corporate users and simplifies predictive analytics for professional data scientists.

Machine Learning

Machine Learning Data Mining Data Scientist Data Science

Why Software Engineers Should Be Embracing AI: A Guide to Staying Ahead

ODSC - Open Data Science

OCTOBER 9, 2024

So what are you waiting for? Get your pass today !

Software Engineer

Software Engineer Software Development DevOps Machine Learning

A brief history of Data Engineering: From IDS to Real-Time streaming

Artificial Corner

JUNE 6, 2023

Cloud-based data storage solutions, such as Amazon S3 (Simple Storage Service) and Google Cloud Storage, provide highly durable and scalable repositories for storing large volumes of data. It’s optimized with performance features like indexing, and customers have seen ETL workloads execute up to 48x faster.

Data Mining

Data Mining Big Data ETL Machine Learning

Ground truth curation and metric interpretation best practices for evaluating generative AI question answering using FMEval

AWS Machine Learning Blog

SEPTEMBER 6, 2024

About the Authors Samantha Stuart is a Data Scientist with AWS Professional Services, and has delivered for customers across generative AI, MLOps, and ETL engagements. Rahul Jani is a Data Architect with AWS Professional Services. Beyond work, he values quality time with family and embraces opportunities for travel.

Generative AI

Generative AI LLM AI AI

Exploring the Power of Data Warehouse Functionality

Pickl AI

JUNE 11, 2024

Let’s delve into the key components that form the backbone of a data warehouse: Source Systems These are the operational databases, CRM systems, and other applications that generate the raw data feeding the data warehouse. Data Extraction, Transformation, and Loading (ETL) This is the workhorse of architecture.

ETL

ETL Data Mining Data Integration Actionable Intelligence

TransOrg’s Cloud Data Engineering Services on AWS, GCP & Snowflake

TransOrg Analytics

SEPTEMBER 24, 2024

Data Foundation on AWS Amazon S3: Scalable storage foundation for data lakes. AWS Lake Formation: Simplify the process of creating and managing a secure data lake. Amazon Redshift: Fast, scalable data warehouse for analytics. AWS Glue: Fully managed ETL service for easy data preparation and integration.

ETL

ETL LLM Data Ingestion Automation

TransOrg’s Cloud Data Engineering Services on AWS, GCP & Snowflake

TransOrg Analytics

SEPTEMBER 24, 2024

Data Foundation on AWS Amazon S3: Scalable storage foundation for data lakes. AWS Lake Formation: Simplify the process of creating and managing a secure data lake. Amazon Redshift: Fast, scalable data warehouse for analytics. AWS Glue: Fully managed ETL service for easy data preparation and integration.

ETL

ETL LLM Data Ingestion Automation

Data democratization: How data architecture can drive business decisions and AI initiatives

IBM Journey to AI blog

AUGUST 4, 2023

It’s often described as a way to simply increase data access, but the transition is about far more than that. When effectively implemented, a data democracy simplifies the data stack, eliminates data gatekeepers, and makes the company’s comprehensive data platform easily accessible by different teams via a user-friendly dashboard.

Machine Learning

Machine Learning Metadata Automation AI

Search enterprise data assets using LLMs backed by knowledge graphs

Flipboard

NOVEMBER 27, 2024

His mission is to enable customers achieve their business goals and create value with data and AI. He helps architect solutions across AI/ML applications, enterprise data platforms, data governance, and unified search in enterprises.

Metadata

Metadata Auto-complete Data Discovery ML Engineer

Artificial Intelligence Zone

Data platform trinity: Competitive or complementary?

Twilio Segment: Transforming customer experiences with AI

Webinars

Trending Sources

The Rise and Fall of Data Science Trends: A 2018–2024 Conference Perspective

Webinars

How Rocket Companies modernized their data science solution on AWS

Building ML Platform in Retail and eCommerce

Exploring the AI and data capabilities of watsonx

Improving air quality with generative AI

Bring your own AI using Amazon SageMaker with Salesforce Data Cloud

AI that’s ready for business starts with data that’s ready for AI

Learnings From Building the ML Platform at Stitch Fix

How to Build Machine Learning Systems With a Feature Store

Comparing Tools For Data Processing Pipelines

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Top Predictive Analytics Tools/Platforms (2023)

Why Software Engineers Should Be Embracing AI: A Guide to Staying Ahead

A brief history of Data Engineering: From IDS to Real-Time streaming

Ground truth curation and metric interpretation best practices for evaluating generative AI question answering using FMEval

Exploring the Power of Data Warehouse Functionality

TransOrg’s Cloud Data Engineering Services on AWS, GCP & Snowflake

TransOrg’s Cloud Data Engineering Services on AWS, GCP & Snowflake

Data democratization: How data architecture can drive business decisions and AI initiatives

Search enterprise data assets using LLMs backed by knowledge graphs

Stay Connected