Data Quality, ML and Software Development - Artificial Intelligence Zone

AI in DevOps: Streamlining Software Deployment and Operations

Unite.AI

OCTOBER 30, 2023

As emerging DevOps trends redefine software development, companies leverage advanced capabilities to speed up their AI adoption. When unstructured data surfaces during AI development, the DevOps process plays a crucial role in data cleansing, ultimately enhancing the overall model quality.

DevOps

DevOps Software Development Automation Artificial Intelligence

Real value, real time: Production AI with Amazon SageMaker and Tecton

AWS Machine Learning Blog

DECEMBER 4, 2024

Businesses are under pressure to show return on investment (ROI) from AI use cases, whether predictive machine learning (ML) or generative AI. Only 54% of ML prototypes make it to production, and only 5% of generative AI use cases make it to production. Using SageMaker, you can build, train and deploy ML models.

ML

ML Machine Learning Generative AI AI

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning Blog

NOVEMBER 14, 2024

We recently announced the general availability of cross-account sharing of Amazon SageMaker Model Registry using AWS Resource Access Manager (AWS RAM) , making it easier to securely share and discover machine learning (ML) models across your AWS accounts.

ML

ML Machine Learning Auto-complete Auto-classification

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Deep Learning Challenges in Software Development

Heartbeat

AUGUST 29, 2023

Deep learning is a branch of machine learning that makes use of neural networks with numerous layers to discover intricate data patterns. Deep learning models use artificial neural networks to learn from data. Online Learning : Incremental training of the model on new data as it arrives.

Software Development

Software Development Deep Learning Neural Network Convolutional Neural Networks

Build a multi-tenant generative AI environment for your enterprise on AWS

AWS Machine Learning Blog

NOVEMBER 7, 2024

SageMaker JumpStart is a machine learning (ML) hub that provides a wide range of publicly available and proprietary FMs from providers such as AI21 Labs, Cohere, Hugging Face, Meta, and Stability AI, which you can deploy to SageMaker endpoints in your own AWS account. It’s serverless so you don’t have to manage the infrastructure.

Generative AI

Generative AI Machine Learning AI AI

Reducing hallucinations in LLM agents with a verified semantic cache using Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

FEBRUARY 21, 2025

Previously, he was a Data & Machine Learning Engineer at AWS, where he worked closely with customers to develop enterprise-scale data infrastructure, including data lakes, analytics dashboards, and ETL pipelines. He specializes in designing, building, and optimizing large-scale data solutions.

LLM

LLM Large Language Models Natural Language Processing Machine Learning

The Weather Company enhances MLOps with Amazon SageMaker, AWS CloudFormation, and Amazon CloudWatch

AWS Machine Learning Blog

JULY 8, 2024

As industries begin adopting processes dependent on machine learning (ML) technologies, it is critical to establish machine learning operations (MLOps) that scale to support growth and utilization of this technology. There were noticeable challenges when running ML workflows in the cloud.

Data Scientist

Data Scientist ML Engineer Machine Learning Data Science

Create a data labeling project with Amazon SageMaker Ground Truth Plus

AWS Machine Learning Blog

OCTOBER 15, 2024

In addition to traditional custom-tailored deep learning models, SageMaker Ground Truth also supports generative AI use cases, enabling the generation of high-quality training data for artificial intelligence and machine learning (AI/ML) models. To learn more, see Use Amazon SageMaker Ground Truth Plus to Label Data.

ML

ML Machine Learning Software Development Artificial Intelligence

How are AI Projects Different

Towards AI

AUGUST 16, 2023

Michael Dziedzic on Unsplash I am often asked by prospective clients to explain the artificial intelligence (AI) software process, and I have recently been asked by managers with extensive software development and data science experience who wanted to implement MLOps.

Machine Learning

Machine Learning Software Development Data Drift Data Science

How Formula 1® uses generative AI to accelerate race-day issue resolution

AWS Machine Learning Blog

FEBRUARY 18, 2025

Recognizing this challenge as an opportunity for innovation, F1 partnered with Amazon Web Services (AWS) to develop an AI-driven solution using Amazon Bedrock to streamline issue resolution. Creating ETL pipelines to transform log data Preparing your data to provide quality results is the first step in an AI project.

Generative AI

Generative AI ETL LLM AI

Accelerate time to business insights with the Amazon SageMaker Data Wrangler direct connection to Snowflake

AWS Machine Learning Blog

JUNE 23, 2023

Amazon SageMaker Data Wrangler is a single visual interface that reduces the time required to prepare data and perform feature engineering from weeks to minutes with the ability to select and clean data, create features, and automate data preparation in machine learning (ML) workflows without writing any code.

Auto-complete

Auto-complete Auto-classification ML Data Quality

Steven Hillion, SVP of Data and AI at Astronomer – Interview Series

Unite.AI

JUNE 24, 2024

Steven Hillion is the Senior Vice President of Data and AI at Astronomer , where he leverages his extensive academic background in research mathematics and over 15 years of experience in Silicon Valley's machine learning platform development. Can you elaborate on the use of synthetic data to fine-tune smaller models for accuracy?

Data Scientist

Data Scientist Large Language Models Machine Learning Software Engineer

Maximizing compliance: Integrating gen AI into the financial regulatory framework

IBM Journey to AI blog

AUGUST 12, 2024

LLMs in comparison with traditional ML models Unlike traditional machine learning models, which often require extensive feature engineering and domain-specific adjustments, LLMs can generalize from vast datasets without the need for such tailored configurations. This makes them versatile and highly adaptable across different use cases.

Automation

Automation Generative AI AI AI

Amazon SageMaker Data Wrangler for dimensionality reduction

AWS Machine Learning Blog

APRIL 24, 2023

In the world of machine learning (ML), the quality of the dataset is of significant importance to model predictability. Although more data is usually better, large datasets with a high number of features can sometimes lead to non-optimal model performance due to the curse of dimensionality. For Target column , choose label.

Data Quality

Data Quality Deep Learning Machine Learning ML

Use a data-centric approach to minimize the amount of data required to train Amazon SageMaker models

AWS Machine Learning Blog

MARCH 9, 2023

As machine learning (ML) models have improved, data scientists, ML engineers and researchers have shifted more of their attention to defining and bettering data quality. Applying these techniques allows ML practitioners to reduce the amount of data required to train an ML model.

ML Engineer

ML Engineer Data Scientist Convolutional Neural Networks ML

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 16, 2023

Amazon SageMaker provides purpose-built tools for machine learning operations (MLOps) to help automate and standardize processes across the ML lifecycle. In this post, we describe how Philips partnered with AWS to develop AI ToolSuite—a scalable, secure, and compliant ML platform on SageMaker.

Data Scientist

Data Scientist ML Data Science Machine Learning

MLOps deployment best practices for real-time inference model serving endpoints with Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 21, 2023

After you build, train, and evaluate your machine learning (ML) model to ensure it’s solving the intended business problem proposed, you want to deploy that model to enable decision-making in business operations. SageMaker deployment guardrails Guardrails are an essential part of software development.

ML

ML Software Development Automation Metadata

Principal Financial Group uses AWS Post Call Analytics solution to extract omnichannel customer insights

AWS Machine Learning Blog

NOVEMBER 15, 2023

Therefore, when the Principal team started tackling this project, they knew that ensuring the highest standard of data security such as regulatory compliance, data privacy, and data quality would be a non-negotiable, key requirement.

Data Ingestion

Data Ingestion Metadata NLP Data Scientist

Generative AI in the Enterprise

O'Reilly Media

NOVEMBER 28, 2023

Few nonusers (2%) report that lack of data or data quality is an issue, and only 1.3% AI users are definitely facing these problems: 7% report that data quality has hindered further adoption, and 4% cite the difficulty of training a model on their data. Deploying and managing AI products isn’t simple.

Generative AI

Generative AI AI AI Data Analysis

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

AWS Machine Learning Blog

JULY 31, 2023

Although machine learning (ML) can provide valuable insights, ML experts were needed to build customer churn prediction models until the introduction of Amazon SageMaker Canvas. Additional key topics Advanced metrics are not the only important tools available to you for evaluating and improving ML model performance.

Auto-classification

Auto-classification Machine Learning ML Auto-complete

Anthony Deighton, CEO of Tamr – Interview Series

Unite.AI

AUGUST 15, 2024

This is the common belief that if you just build cool software, people will line up to buy it. This never works, and the solution is a robust marketing process connected with your software development process. Bad data for them could mean a provider gets more shifts than they can handle, leading to burnout.

Machine Learning

Machine Learning Computer Scientist LLM Large Language Models

Must-Have Skills for a Machine Learning Engineer

Pickl AI

NOVEMBER 28, 2024

Understanding Machine Learning algorithms and effective data handling are also critical for success in the field. Introduction Machine Learning ( ML ) is revolutionising industries, from healthcare and finance to retail and manufacturing. Fundamental Programming Skills Strong programming skills are essential for success in ML.

Machine Learning

Machine Learning Neural Network ML Engineer Algorithm

How to Build an Experiment Tracking Tool [Learnings From Engineers Behind Neptune]

The MLOps Blog

APRIL 17, 2023

As an MLOps engineer on your team, you are often tasked with improving the workflow of your data scientists by adding capabilities to your ML platform or by building standalone tools for them to use. And since you are reading this article, the data scientists you support have probably reached out for help.

Metadata

Metadata Data Scientist Explainability ML

Deploy large models at high performance using FasterTransformer on Amazon SageMaker

AWS Machine Learning Blog

APRIL 17, 2023

Compilation or integration to optimized runtime ML compilers, such as Amazon SageMaker Neo , apply techniques such as operator fusion, memory planning, graph optimizations, and automatic integration to optimized inference libraries. Rohith Nallamaddi is a Software Development Engineer at AWS.

Prompt Engineer

Prompt Engineer Prompt Engineering Deep Learning Machine Learning

Mind your words with NLP

Chatbots Life

SEPTEMBER 11, 2023

It provides a detailed overview of each library’s unique contributions and explains how they can be combined to create a functional system that can detect and correct linguistic errors in text data. Training data quality and bias: ML-based grammar checkers heavily rely on training data to learn patterns and make predictions.

NLP

NLP Natural Language Processing Python Algorithm

How to Build a CI/CD MLOps Pipeline [Case Study]

The MLOps Blog

MARCH 15, 2023

This includes the tools and techniques we used to streamline the ML model development and deployment processes, as well as the measures taken to monitor and maintain models in a production environment. Costs: Oftentimes, cost is the most important aspect of any ML model deployment. I would say the same happened in our case.

ETL

ETL Data Drift Machine Learning ML

Operationalizing knowledge for data-centric AI

Snorkel AI

FEBRUARY 27, 2023

His presentation also highlights the ways that Snorkel’s platform, Snorkel Flow, enables users to rapidly and programmatically label and develop datasets and then use them to train ML models. So all of this points to the pain or pessimistic bottleneck “takes” around data.

Machine Learning

Machine Learning Large Language Models AI AI

Operationalizing knowledge for data-centric AI

Snorkel AI

FEBRUARY 27, 2023

His presentation also highlights the ways that Snorkel’s platform, Snorkel Flow, enables users to rapidly and programmatically label and develop datasets and then use them to train ML models. So all of this points to the pain or pessimistic bottleneck “takes” around data.

Machine Learning

Machine Learning Large Language Models AI AI

Top Synthetic Data Tools/Startups For Machine Learning Models in 2023

Marktechpost

JULY 17, 2023

YData By enhancing the caliber of training datasets, YData offers a data-centric platform that speeds up the creation and raises the return on investment of AI solutions. Data scientists can now enhance datasets using cutting-edge synthetic data generation and automated data quality profiling.

Machine Learning

Machine Learning Data Scientist Computer Vision Deep Learning

Demand Forecasting Is Transforming the Retail Industry, Here’s How

Dlabs.ai

MARCH 30, 2022

According to Gartner, demand forecasting is the most widely used ML application in supply chain planning. We started with data loading and preprocessing, fixing optimization issues to allow the model to process years of historical across all 8,000 stores. Several breakthroughs enabled us to fix data quality issues within the dataset.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Machine Learning Explainability

GPT-4o

Bugra Akyildiz

MAY 25, 2024

There are also a variety of capabilities that can be very useful for ML/Data Science Practitioners for data related or feature related tasks. Data Tasks ChatGPT can handle a wide range of data-related tasks by writing and executing Python code behind the scenes, without users needing coding expertise.

ChatGPT

ChatGPT Machine Learning Data Analysis OpenAI

The Backbone of Data Engineering: 5 Key Architectural Patterns Explained

Mlearning.ai

MAY 16, 2023

Once the data is loaded into the data warehouse, it can be queried by business analysts and data scientists to perform various analyses such as customer segmentation, product recommendations, and trend analysis.

Explainability

Explainability ETL Big Data Machine Learning

Building AI Products With A Holistic Mental Model

Topbots

SEPTEMBER 11, 2023

While each of them offers exciting perspectives for research, a real-life product needs to combine the data, the model, and the human-machine interaction into a coherent system. AI development is a highly collaborative enterprise. Train your ML model from scratch.

UX Design

UX Design AI AI Automation

RPA in Finance and Banking: Use Cases and Implementation?—?NIX United

Mlearning.ai

FEBRUARY 15, 2024

Automation eliminates potential mistakes and enhances the data quality of the system. Step 3: Comprehensive Strategy Development While the two previous steps shape the background for your automation path, at this stage, you should start creating a strategy relying on the collected information.

Robotics

Robotics Automation Software Engineer Software Development

Deploying Conversational AI Products to Production With Jason Flaks

The MLOps Blog

JULY 18, 2023

This article was originally an episode of the MLOps Live , an interactive Q&A session where ML practitioners answer questions from other ML practitioners. Every episode is focused on one specific ML topic, and during this one, we talked to Jason Falks about deploying conversational AI products to production. Stephen: Great.

Conversational AI

Conversational AI Natural Language Processing Machine Learning AI

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs

AWS Machine Learning Blog

JANUARY 26, 2024

Many customers are looking for guidance on how to manage security, privacy, and compliance as they develop generative AI applications. Build organizational resiliency around generative AI Organizations can start adopting ways to build their capacity and capabilities for AI/ML and generative AI security within their organizations.

Generative AI

Generative AI ML LLM AI

LLM Scaling Laws vs. Everything Else

TheSequence

OCTOBER 15, 2023

Techniques such as distillation, RAG, quantization, and, of course, data quality curation have been developed to empower smaller and more efficient models. WATCH VIDEOS 🔎 ML Research Who is Harry Potter? Anysphere raised $8 million to build an AI native software development environment.

LLM

LLM Large Language Models Software Development Data Quality

Mathias Golombek, Chief Technology Officer of Exasol – Interview Series

Unite.AI

MAY 21, 2024

He joined the company as a software developer in 2004 after studying computer science with a heavy focus on databases, distributed systems, software development processes, and genetic algorithms. By 2005, he was responsible for the Database Optimizer team and in 2007 he became Head of Research & Development.

Software Development

Software Development Business Intelligence ETL Data Quality

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

From gathering and processing data to building models through experiments, deploying the best ones, and managing them at scale for continuous value in production—it’s a lot. As the number of ML-powered apps and services grows, it gets overwhelming for data scientists and ML engineers to build and deploy models at scale.

Machine Learning

Machine Learning Data Scientist ML Metadata

Exploring data using AI chat at Domo with Amazon Bedrock

AWS Machine Learning Blog

SEPTEMBER 9, 2024

Generative artificial intelligence (AI) has revolutionized this by allowing users to interact with data through natural language queries, providing instant insights and visualizations without needing technical expertise. This can democratize data access and speed up analysis.

Software Architect

Software Architect Generative AI AI AI

Advancing Parallel Programming with HPC-INSTRUCT: Optimizing Code LLMs for High-Performance Computing

Marktechpost

DECEMBER 29, 2024

LLMs have revolutionized software development by automating coding tasks and bridging the natural language and programming gap. This limitation arises from the scarcity of high-quality parallel code data in pre-training datasets and the inherent complexity of parallel programming. parameters.

Data Quality

Data Quality LLM Software Development Automation

Microsoft AI Introduces rStar-Math: A Self-Evolved System 2 Deep Thinking Approach that Significantly Boosts the Math Reasoning Capabilities of Small LLMs

Marktechpost

JANUARY 10, 2025

Technical Innovations and Benefits rStar-Maths success is underpinned by three core innovations: Code-Augmented CoT Data Synthesis: The system uses MCTS rollouts to generate step-by-step verified reasoning trajectories. Dont Forget to join our 60k+ ML SubReddit. Check out the Paper.

LLM

LLM Software Development OpenAI Artificial Intelligence

Super charge your LLMs with RAG at scale using AWS Glue for Apache Spark

AWS Machine Learning Blog

OCTOBER 24, 2024

The benefits of this solution are: You can flexibly achieve data cleaning, sanitizing, and data quality management in addition to chunking and embedding. You can build and manage an incremental data pipeline to update embeddings on Vectorstore at scale. You can choose a wide variety of embedding models.

LLM

LLM Big Data Architect Big Data Software Development

Generative AI for agriculture: How Agmatix is improving agriculture with Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 12, 2024

There are various technologies that help operationalize and optimize the process of field trials, including data management and analytics, IoT, remote sensing, robotics, machine learning (ML), and now generative AI. Multi-source data is initially received and stored in an Amazon Simple Storage Service (Amazon S3) data lake.

Generative AI

Generative AI Prompt Engineer Prompt Engineering AI

AI in DevOps: Streamlining Software Deployment and Operations

Real value, real time: Production AI with Amazon SageMaker and Tecton

Webinars

Trending Sources

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Webinars

Deep Learning Challenges in Software Development

Build a multi-tenant generative AI environment for your enterprise on AWS

Reducing hallucinations in LLM agents with a verified semantic cache using Amazon Bedrock Knowledge Bases

The Weather Company enhances MLOps with Amazon SageMaker, AWS CloudFormation, and Amazon CloudWatch

Create a data labeling project with Amazon SageMaker Ground Truth Plus

How are AI Projects Different

How Formula 1® uses generative AI to accelerate race-day issue resolution

Accelerate time to business insights with the Amazon SageMaker Data Wrangler direct connection to Snowflake

Steven Hillion, SVP of Data and AI at Astronomer – Interview Series

Maximizing compliance: Integrating gen AI into the financial regulatory framework

Amazon SageMaker Data Wrangler for dimensionality reduction

Use a data-centric approach to minimize the amount of data required to train Amazon SageMaker models

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

MLOps deployment best practices for real-time inference model serving endpoints with Amazon SageMaker

Principal Financial Group uses AWS Post Call Analytics solution to extract omnichannel customer insights

Generative AI in the Enterprise

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

Anthony Deighton, CEO of Tamr – Interview Series

Must-Have Skills for a Machine Learning Engineer

How to Build an Experiment Tracking Tool [Learnings From Engineers Behind Neptune]

Deploy large models at high performance using FasterTransformer on Amazon SageMaker

Mind your words with NLP

How to Build a CI/CD MLOps Pipeline [Case Study]

Operationalizing knowledge for data-centric AI

Operationalizing knowledge for data-centric AI

Top Synthetic Data Tools/Startups For Machine Learning Models in 2023

Demand Forecasting Is Transforming the Retail Industry, Here’s How

GPT-4o

The Backbone of Data Engineering: 5 Key Architectural Patterns Explained

Building AI Products With A Holistic Mental Model

RPA in Finance and Banking: Use Cases and Implementation?—?NIX United

Deploying Conversational AI Products to Production With Jason Flaks

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs

LLM Scaling Laws vs. Everything Else

Mathias Golombek, Chief Technology Officer of Exasol – Interview Series

Definite Guide to Building a Machine Learning Platform

Exploring data using AI chat at Domo with Amazon Bedrock

Advancing Parallel Programming with HPC-INSTRUCT: Optimizing Code LLMs for High-Performance Computing

Microsoft AI Introduces rStar-Math: A Self-Evolved System 2 Deep Thinking Approach that Significantly Boosts the Math Reasoning Capabilities of Small LLMs

Super charge your LLMs with RAG at scale using AWS Glue for Apache Spark

Generative AI for agriculture: How Agmatix is improving agriculture with Amazon Bedrock

Stay Connected