Data Platform, Data Science and ETL - Artificial Intelligence Zone

How Rocket Companies modernized their data science solution on AWS

AWS Machine Learning Blog

FEBRUARY 21, 2025

Rockets legacy data science environment challenges Rockets previous data science solution was built around Apache Spark and combined the use of a legacy version of the Hadoop environment and vendor-provided Data Science Experience development tools.

Data Science

Data Science Data Scientist Data Ingestion DevOps

The Rise and Fall of Data Science Trends: A 2018–2024 Conference Perspective

ODSC - Open Data Science

MARCH 12, 2025

The field of data science has evolved dramatically over the past several years, driven by technological breakthroughs, industry demands, and shifting priorities within the community. Data Engineerings SteadyGrowth 20182021: Data engineering was often mentioned but overshadowed by modeling advancements.

Data Science

Data Science ETL Machine Learning AI Engineer

Twilio Segment: Transforming customer experiences with AI

AI News

SEPTEMBER 26, 2023

AI and machine learning (ML) models are incredibly effective at doing this but are complex to build and require data science expertise. HT: When companies rely on managing data in a customer data platform (CDP) in tandem with AI, they can create strong, personalised campaigns that reach and inspire their customers.

Big Data

Big Data AI AI ETL

Webinars

The Intersection of AI and Sales: Personalization Without Compromise

How to Achieve High-Accuracy Results When Using LLMs

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

MORE WEBINARS

Data architecture strategy for data quality

IBM Journey to AI blog

JANUARY 5, 2023

The right data architecture can help your organization improve data quality because it provides the framework that determines how data is collected, transported, stored, secured, used and shared for business intelligence and data science use cases.

Data Quality

Data Quality Metadata Big Data ETL

Exploring the AI and data capabilities of watsonx

IBM Journey to AI blog

JULY 17, 2023

By supporting open-source frameworks and tools for code-based, automated and visual data science capabilities — all in a secure, trusted studio environment — we’re already seeing excitement from companies ready to use both foundation models and machine learning to accomplish key tasks.

Machine Learning

Machine Learning Metadata Automation AI

Top Predictive Analytics Tools/Platforms (2023)

Marktechpost

JULY 17, 2023

IBM merged the critical capabilities of the vendor into its more contemporary Watson Studio running on the IBM Cloud Pak for Data platform as it continues to innovate. The platform makes collaborative data science better for corporate users and simplifies predictive analytics for professional data scientists.

Machine Learning

Machine Learning Data Mining Data Scientist Data Science

Navigating Data Solutions: CDP, MDM, Lakes, Warehouses, Marts, Feature Stores, ERP”

TransOrg Analytics

AUGUST 9, 2024

In the realm of data management and analytics, businesses face a myriad of options to store, manage, and utilize their data effectively. Understanding their differences, advantages, and ideal use cases is crucial for making informed decisions about your data strategy. Cons: Costly: Can be expensive to implement and maintain.

Machine Learning

Machine Learning ETL Big Data Data Quality

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

AWS Machine Learning Blog

MARCH 5, 2025

About the authors Samantha Stuart is a Data Scientist with AWS Professional Services, and has delivered for customers across generative AI, MLOps, and ETL engagements. He collaborates closely with enterprise customers building modern data platforms, generative AI applications, and MLOps.

Generative AI

Generative AI LLM AI AI

Drowning in Data? A Data Lake May Be Your Lifesaver

ODSC - Open Data Science

SEPTEMBER 29, 2023

Arjuna Chala, associate vice president, HPCC Systems For those not familiar with the HPCC Systems data lake platform, can you describe your organization and the development history behind HPCC Systems? They were interested in creating a data platform capable of managing a sizable number of datasets.

Big Data

Big Data ETL Data Science Data Ingestion

Why Software Engineers Should Be Embracing AI: A Guide to Staying Ahead

ODSC - Open Data Science

OCTOBER 9, 2024

Confirmed sessions related to software engineering include: Building Data Contracts with Open-Source Tools Chronon — Open Source Data Platform for AI/ML Creating APIs That Data Scientists Will Love with FastAPI, SQLAlchemy, and Pydantic Using APIs in Data Science Without Breaking Anything Don’t Go Over the Deep End: Building an Effective OSS Management (..)

Software Engineer

Software Engineer Software Development DevOps Machine Learning

Popular Data Transformation Tools: Importance and Best Practices

Pickl AI

OCTOBER 10, 2024

It supports real-time data processing and has built-in security protocols to ensure data integrity. Some common use cases for Apache Nifi include streaming data from IoT devices, ingesting data into big data platforms, and transferring data between cloud environments.

ETL

ETL Data Quality Machine Learning Business Intelligence

Differentiation: Microsoft Fabric vs Power BI

Pickl AI

DECEMBER 16, 2024

Whether you aim for comprehensive data integration or impactful visual insights, this comparison will clarify the best fit for your goals. Key Takeaways Microsoft Fabric is a full-scale data platform, while Power BI focuses on visualising insights. Fabric suits large enterprises; Power BI fits team-level reporting needs.

ETL

ETL Data Ingestion Data Integration Machine Learning

Top 50+ Data Analyst Interview Questions & Answers

Pickl AI

APRIL 26, 2024

Data Warehousing and ETL Processes What is a data warehouse, and why is it important? A data warehouse is a centralised repository that consolidates data from various sources for reporting and analysis. It is essential to provide a unified data view and enable business intelligence and analytics.

Data Analysis

Data Analysis Machine Learning ETL Explainability

A brief history of Data Engineering: From IDS to Real-Time streaming

Artificial Corner

JUNE 6, 2023

Cloud-based data storage solutions, such as Amazon S3 (Simple Storage Service) and Google Cloud Storage, provide highly durable and scalable repositories for storing large volumes of data. It provided a platform for big data processing and machine learning, simplifying the process of building and deploying data pipelines.

Data Mining

Data Mining Big Data ETL Machine Learning

Ground truth curation and metric interpretation best practices for evaluating generative AI question answering using FMEval

AWS Machine Learning Blog

SEPTEMBER 6, 2024

About the Authors Samantha Stuart is a Data Scientist with AWS Professional Services, and has delivered for customers across generative AI, MLOps, and ETL engagements. He collaborates closely with enterprise customers building modern data platforms, generative AI applications, and MLOps.

Generative AI

Generative AI LLM AI AI

Learnings From Building the ML Platform at Stitch Fix

The MLOps Blog

AUGUST 3, 2023

Stefan is a software engineer, data scientist, and has been doing work as an ML engineer. He also ran the data platform in his previous company and is also co-creator of open-source framework, Hamilton. As you’ve been running the ML data platform team, how do you do that? Stefan: Yeah. Thanks for having me.

ML

ML Data Scientist Software Engineer Machine Learning

Unleashing the power of Presto: The Uber case study

IBM Journey to AI blog

SEPTEMBER 25, 2023

Uber’s prowess as a transportation, logistics and analytics company hinges on their ability to leverage data effectively. The pursuit of hyperscale analytics The scale of Uber’s analytical endeavor requires careful selection of data platforms with high regard for limitless analytical processing.

Automation

Automation ETL Data Scientist Data Science

Exploring the Power of Data Warehouse Functionality

Pickl AI

JUNE 11, 2024

Let’s delve into the key components that form the backbone of a data warehouse: Source Systems These are the operational databases, CRM systems, and other applications that generate the raw data feeding the data warehouse. Data Extraction, Transformation, and Loading (ETL) This is the workhorse of architecture.

ETL

ETL Data Mining Data Integration Actionable Intelligence

TransOrg’s Cloud Data Engineering Services on AWS, GCP & Snowflake

TransOrg Analytics

SEPTEMBER 24, 2024

Data Foundation on AWS Amazon S3: Scalable storage foundation for data lakes. AWS Lake Formation: Simplify the process of creating and managing a secure data lake. Amazon Redshift: Fast, scalable data warehouse for analytics. AWS Glue: Fully managed ETL service for easy data preparation and integration.

ETL

ETL LLM Data Ingestion Automation

TransOrg’s Cloud Data Engineering Services on AWS, GCP & Snowflake

TransOrg Analytics

SEPTEMBER 24, 2024

Data Foundation on AWS Amazon S3: Scalable storage foundation for data lakes. AWS Lake Formation: Simplify the process of creating and managing a secure data lake. Amazon Redshift: Fast, scalable data warehouse for analytics. AWS Glue: Fully managed ETL service for easy data preparation and integration.

ETL

ETL LLM Data Ingestion Automation

Artificial Intelligence Zone

How Rocket Companies modernized their data science solution on AWS

The Rise and Fall of Data Science Trends: A 2018–2024 Conference Perspective

Webinars

Trending Sources

Twilio Segment: Transforming customer experiences with AI

Webinars

Data architecture strategy for data quality

Exploring the AI and data capabilities of watsonx

Top Predictive Analytics Tools/Platforms (2023)

Navigating Data Solutions: CDP, MDM, Lakes, Warehouses, Marts, Feature Stores, ERP”

Ground truth generation and review best practices for evaluating generative AI question-answering with FMEval

Drowning in Data? A Data Lake May Be Your Lifesaver

Why Software Engineers Should Be Embracing AI: A Guide to Staying Ahead

Popular Data Transformation Tools: Importance and Best Practices

Differentiation: Microsoft Fabric vs Power BI

Top 50+ Data Analyst Interview Questions & Answers

A brief history of Data Engineering: From IDS to Real-Time streaming

Ground truth curation and metric interpretation best practices for evaluating generative AI question answering using FMEval

Learnings From Building the ML Platform at Stitch Fix

Unleashing the power of Presto: The Uber case study

Exploring the Power of Data Warehouse Functionality

TransOrg’s Cloud Data Engineering Services on AWS, GCP & Snowflake

TransOrg’s Cloud Data Engineering Services on AWS, GCP & Snowflake

Stay Connected