Auto-classification, Data Science and ML - Artificial Intelligence Zone

How Lumi streamlines loan approvals with Amazon SageMaker AI

AWS Machine Learning Blog

APRIL 4, 2025

They use real-time data and machine learning (ML) to offer customized loans that fuel sustainable growth and solve the challenges of accessing capital. The classification process needed to operate with low latency to support Lumis market-leading speed-to-decision commitment. This post is co-written with Paul Pagnan from Lumi.

Auto-classification

Auto-classification BERT Machine Learning AI

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning Blog

NOVEMBER 14, 2024

We recently announced the general availability of cross-account sharing of Amazon SageMaker Model Registry using AWS Resource Access Manager (AWS RAM) , making it easier to securely share and discover machine learning (ML) models across your AWS accounts.

ML

ML Machine Learning Auto-complete Auto-classification

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Amazon Redshift is the most popular cloud data warehouse that is used by tens of thousands of customers to analyze exabytes of data every day. SageMaker Studio is the first fully integrated development environment (IDE) for ML. You can use query_string to filter your dataset by SQL and unload it to Amazon S3.

ML

ML Auto-complete Auto-classification Machine Learning

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Top Low-Code and No-Code Platforms for Data Science in 2023

ODSC - Open Data Science

APRIL 17, 2023

Finally, H2O AutoML has the ability to support a wide range of machine learning tasks such as regression, time-series forecasting, anomaly detection, and classification. Auto-ViML : Like PyCaret, Auto-ViML is an open-source machine learning library in Python. This makes Auto-ViML an ideal tool for beginners and experts alike.

Data Science

Data Science Auto-classification Machine Learning Data Scientist

sktime?—?Python Toolbox for Machine Learning with Time Series

ODSC - Open Data Science

MAY 25, 2023

Here’s what you need to know: sktime is a Python package for time series tasks like forecasting, classification, and transformations with a familiar and user-friendly scikit-learn-like API. Build tuned auto-ML pipelines, with common interface to well-known libraries (scikit-learn, statsmodels, tsfresh, PyOD, fbprophet, and more!)

Machine Learning

Machine Learning Python Auto-classification Auto-complete

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

Alignment to other tools in the organization’s tech stack Consider how well the MLOps tool integrates with your existing tools and workflows, such as data sources, data engineering platforms, code repositories, CI/CD pipelines, monitoring systems, etc. and Pandas or Apache Spark DataFrames.

Machine Learning

Machine Learning Metadata Data Scientist Data Quality

Federated learning on AWS using FedML, Amazon EKS, and Amazon SageMaker

AWS Machine Learning Blog

MARCH 15, 2024

Many organizations are implementing machine learning (ML) to enhance their business decision-making through automation and the use of large distributed datasets. With increased access to data, ML has the potential to provide unparalleled business insights and opportunities.

Auto-complete

Auto-complete Auto-classification Machine Learning ML

Top MLOps Tools Guide: Weights & Biases, Comet and More

Unite.AI

JUNE 24, 2024

MLOps , or Machine Learning Operations, is a multidisciplinary field that combines the principles of ML, software engineering, and DevOps practices to streamline the deployment, monitoring, and maintenance of ML models in production environments. What is MLOps?

Data Drift

Data Drift Machine Learning Data Scientist ML

Simplifying the Image Classification Workflow with Lightning & Comet ML

Heartbeat

JUNE 26, 2023

A guide to performing end-to-end computer vision projects with PyTorch-Lightning, Comet ML and Gradio Image by Freepik Computer vision is the buzzword at the moment. Today, I’ll walk you through how to implement an end-to-end image classification project with Lightning , Comet ML, and Gradio libraries.

ML

ML Auto-classification Deep Learning Computer Vision

[Updated] 100+ Top Data Science Interview Questions

Mlearning.ai

MAY 23, 2023

Hey guys, in this blog we will see some of the most asked Data Science Interview Questions by interviewers in [year]. Data science has become an integral part of many industries, and as a result, the demand for skilled data scientists is soaring. What is Data Science?

Data Science

Data Science Neural Network Deep Learning Machine Learning

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

AWS Machine Learning Blog

OCTOBER 18, 2023

Purina used artificial intelligence (AI) and machine learning (ML) to automate animal breed detection at scale. The solution focuses on the fundamental principles of developing an AI/ML application workflow of data preparation, model training, model evaluation, and model monitoring. DynamoDB is used to store the pet attributes.

Auto-complete

Auto-complete Auto-classification Machine Learning ML

Alex Ratner, CEO & Co-Founder of Snorkel AI – Interview Series

Unite.AI

DECEMBER 1, 2023

I was fascinated by how much human knowledge—anything anyone had ever deemed patentable—was readily available, yet so inaccessible because it was so hard to do even the simplest analysis over complex technical text and multi-modal data. When that’s the case, the best way to improve these models is to supply them with more and better data.

Data Scientist

Data Scientist Auto-classification AI AI

How Vericast optimized feature engineering using Amazon SageMaker Processing

AWS Machine Learning Blog

MAY 3, 2023

For any machine learning (ML) problem, the data scientist begins by working with data. This includes gathering, exploring, and understanding the business and technical aspects of the data, along with evaluation of any manipulations that may be needed for the model building process.

Auto-classification

Auto-classification Auto-complete Machine Learning Metadata

Falcon 2 11B is now available on Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 31, 2024

The Falcon 2 11B model is available on SageMaker JumpStart, a machine learning (ML) hub that provides access to built-in algorithms, FMs, and pre-built ML solutions that you can deploy quickly and get started with ML faster. trillion token dataset primarily consisting of web data from RefinedWeb with 11 billion parameters.

Python

Python Machine Learning Auto-classification ML

How Kakao Games automates lifetime value prediction from game data using Amazon SageMaker and AWS Glue

AWS Machine Learning Blog

MARCH 1, 2023

Statistical methods and machine learning (ML) methods are actively developed and adopted to maximize the LTV. In this post, we share how Kakao Games and the Amazon Machine Learning Solutions Lab teamed up to build a scalable and reliable LTV prediction solution by using AWS data and ML services such as AWS Glue and Amazon SageMaker.

Automation

Automation ETL Data Drift ML

How to Practice Data-Centric AI and Have AI Improve its Own Dataset

ODSC - Open Data Science

OCTOBER 11, 2023

Don’t think you have to manually do all of the data curation work yourself! New algorithms/software can help you systematically curate your data via automation. In this post, I’ll give a high-level overview of how AI/ML can be used to automatically detect various issues common in real-world datasets.

Auto-classification

Auto-classification Auto-complete Data Drift Machine Learning

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

AWS Machine Learning Blog

JULY 31, 2023

Although machine learning (ML) can provide valuable insights, ML experts were needed to build customer churn prediction models until the introduction of Amazon SageMaker Canvas. It also enables you to evaluate the models using advanced metrics as if you were a data scientist.

Auto-classification

Auto-classification Machine Learning ML Auto-complete

Introduction to Graph Neural Networks

Heartbeat

JUNE 27, 2023

They are as follows: Node-level tasks refer to tasks that concentrate on nodes, such as node classification, node regression, and node clustering. Edge-level tasks , on the other hand, entail edge classification and link prediction. Graph-level tasks involve graph classification, graph regression, and graph matching.

Neural Network

Neural Network Convolutional Neural Networks Auto-classification Deep Learning

Advanced RAG patterns on Amazon SageMaker

AWS Machine Learning Blog

MARCH 28, 2024

You can deploy this solution with just a few clicks using Amazon SageMaker JumpStart , a fully managed platform that offers state-of-the-art foundation models for various use cases such as content writing, code generation, question answering, copywriting, summarization, classification, and information retrieval.

LLM

LLM Auto-complete Auto-classification Generative AI

Build well-architected IDP solutions with a custom lens – Part 5: Cost optimization

AWS Machine Learning Blog

NOVEMBER 22, 2023

If you’re not actively using the endpoint for an extended period, you should set up an auto scaling policy to reduce your costs. SageMaker provides different options for model inferences , and you can delete endpoints that aren’t being used or set up an auto scaling policy to reduce your costs on model endpoints.

IDP

IDP Auto-classification Machine Learning Auto-complete

How Memorial Sloan Kettering Cancer Center (MSKCC) used Snorkel Flow to scale clinical trial screening

Snorkel AI

SEPTEMBER 26, 2023

Scaling clinical trial screening with document classification Memorial Sloan Kettering Cancer Center, the world’s oldest and largest private cancer center, provides care to increase the quality of life of more than 150,000 cancer patients annually. However, lack of labeled training data bottlenecked their progress.

Auto-classification

Auto-classification Categorization Data Scientist ML

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

DataRobot Blog

JANUARY 10, 2023

Most, if not all, machine learning (ML) models in production today were born in notebooks before they were put into production. 42% of data scientists are solo practitioners or on teams of five or fewer people. 42% of data scientists are solo practitioners or on teams of five or fewer people. Auto-scale compute.

Auto-classification

Auto-classification Auto-complete Data Scientist Data Science

Build well-architected IDP solutions with a custom lens – Part 1: Operational excellence

AWS Machine Learning Blog

NOVEMBER 22, 2023

Harness a flywheel approach, wherein continuous data feedback is utilized to routinely orchestrate and evaluate enhancements to your models and processes. Common stages include data capture, document classification, document text extraction, content enrichment, document review and validation , and data consumption.

IDP

IDP Machine Learning Data Extraction ML

Evaluate the reliability of Retrieval Augmented Generation applications using Amazon Bedrock

AWS Machine Learning Blog

JUNE 20, 2024

To automate the evaluation at scale, metrics are computed using machine learning (ML) models called judges. Skip the preamble or explanation, and provide the classification. Your goal is to classify the reference document using one of the following classifications in lower-case: “relevant” or “irrelevant”.

Auto-classification

Auto-classification LLM Prompt Engineer Prompt Engineering

Use foundation models to improve model accuracy with Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 16, 2023

Photo by Scott Webb on Unsplash Determining the value of housing is a classic example of using machine learning (ML). Almost 50 years later, the estimation of housing prices has become an important teaching tool for students and professionals interested in using data and ML in business decision-making.

ML

ML Machine Learning Computer Vision Auto-classification

Snorkel Flow Summer 2023: faster, easier and more secure

Snorkel AI

JULY 14, 2023

classification, information extraction) using programmatic labeling, fine-tuning, and distillation. Latest features and platform improvements for Snorkel Flow Snorkel Flow provides an end-to-end machine learning solution designed around a data-centric approach. It allows you to dive deep into each LF and understand it in detail.

Auto-classification

Auto-classification Machine Learning Data Science Data Platform

Snorkel Flow Summer 2023: faster, easier and more secure

Snorkel AI

JULY 14, 2023

classification, information extraction) using programmatic labeling, fine-tuning, and distillation. Latest features and platform improvements for Snorkel Flow Snorkel Flow provides an end-to-end machine learning solution designed around a data-centric approach. It allows you to dive deep into each LF and understand it in detail.

Auto-classification

Auto-classification Machine Learning Data Science Data Platform

Benchmarking Computer Vision Models using PyTorch & Comet

Heartbeat

JULY 17, 2023

Make sure that you import Comet library before PyTorch to benefit from auto logging features Choosing Models for Classification When it comes to choosing a computer vision model for a classification task, there are several factors to consider, such as accuracy, speed, and model size. Pre-trained models, such as VGG, ResNet.

Computer Vision

Computer Vision Auto-classification Deep Learning Machine Learning

Snorkel Flow Summer 2023: faster, easier and more secure

Snorkel AI

JULY 14, 2023

classification, information extraction) using programmatic labeling, fine-tuning, and distillation. Latest features and platform improvements for Snorkel Flow Snorkel Flow provides an end-to-end machine learning solution designed around a data-centric approach. It allows you to dive deep into each LF and understand it in detail.

Auto-classification

Auto-classification Data Scientist Machine Learning LLM

How Pixability uses foundation models to accelerate NLP application development by months

Snorkel AI

JANUARY 11, 2023

Using Snorkel Flow, Pixability leveraged foundation models to build small, deployable classification models capable of categorizing videos across more than 600 different classes with 90% accuracy in just a few weeks. To help brands maximize their reach, they need to constantly and accurately categorize billions of YouTube videos.

NLP

NLP Auto-classification Categorization Natural Language Processing

Monitoring A Convolutional Neural Network (CNN) in Comet

Heartbeat

MARCH 1, 2023

Tracking your image classification experiments with Comet ML Photo from nmedia on Shutterstock.com Introduction Image classification is a task that involves training a neural network to recognize and classify items in images. A convolutional neural network (CNN) is primarily used for image classification.

Convolutional Neural Networks

Convolutional Neural Networks Neural Network Auto-classification Categorization

Smart Factories: Artificial Intelligence and Automation for Reduced OPEX in Manufacturing

DataRobot Blog

MARCH 10, 2022

Python is unarguably the most broadly used programming language throughout the data science community. DataRobot will automatically perform a data quality assessment, determine the problem domain to solve for whether that be binary classification, regression, etc., Consuming AI/ML Insights for Faster Decision Making.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Automation Auto-classification

Adapting language-based models beyond English

Snorkel AI

JANUARY 12, 2023

In this article, we discuss key Snorkel Flow features and capabilities that help data science and machine learning teams to adapt NLP models to non-English languages. For text classification, however, there are many similarities. This may require extensive customization and fine-tuning of the model.

BERT

BERT NLP Natural Language Processing Auto-classification

The Easiest Way to Determine Which Scikit-Learn Model Is Perfect for Your Data

Mlearning.ai

NOVEMBER 23, 2023

But deep down, we know we could achieve better results with a different approach, after all in ML, there’s no one-size-fits-all solution. For this post, we’ll be using LazyRegressor() because we’re working on a regression task but it’s the same step for classification problems (we’d just use LazyClassifier() instead). #

Auto-classification

Auto-classification Machine Learning Algorithm ML

Google Research, 2022 & beyond: Research community engagement

Google Research AI blog

FEBRUARY 28, 2023

Adherence to such public health programs is a prevalent challenge, so researchers from Google Research and the Indian Institute of Technology, Madras worked with ARMMAN to design an ML system that alerts healthcare providers about participants at risk of dropping out of the health information program. certainty when used correctly.

Robotics

Robotics Deep Learning Auto-classification ML

A Straightforward Tutorial of Streamlit

Viso.ai

MARCH 29, 2024

Streamlit is a good choice for developers and teams that are well-versed in data science and want to deploy AI models easily, and quickly, with a few lines of code. Viso Suite doesn’t just cover model training but extends to the entire ML pipeline from sourcing data to security.

Computer Vision

Computer Vision Auto-classification Machine Learning Python

Sentiment Analysis with Python and Streamlit

Heartbeat

JANUARY 25, 2023

Build and deploy your own sentiment classification app using Python and Streamlit Source:Author Nowadays, working on tabular data is not the only thing in Machine Learning (ML). Data formats like image, video, text, etc., This approach is mostly referred to for small datasets where ML models can not be effective.

Python

Python Auto-classification Deep Learning Machine Learning

Managing Computer Vision Projects with Micha? Tadeusiak

The MLOps Blog

FEBRUARY 27, 2023

This article was originally an episode of the MLOps Live , an interactive Q&A session where ML practitioners answer questions from other ML practitioners. Every episode is focused on one specific ML topic, and during this one, we talked to Michal Tadeusiak about managing computer vision projects. Then we are there to help.

Computer Vision

Computer Vision Auto-classification Auto-complete ML

Text to Exam Generator (NLP) Using Machine Learning

Mlearning.ai

JUNE 28, 2023

This piece of data that my mentor found is called “ SemCor Corpus [5] ” (We access the dataset via NLTK’s SemcorCorpusReader [6] ) The reformatted version of the dataset looks something like this. It might look quite overwhelming but this is what data science and computer engineering are about.

Machine Learning

Machine Learning NLP Auto-classification Natural Language Processing

Fine-tune Llama 2 for text generation on Amazon SageMaker JumpStart

AWS Machine Learning Blog

SEPTEMBER 6, 2023

You can easily try out these models and use them with SageMaker JumpStart, which is a machine learning (ML) hub that provides access to algorithms, models, and ML solutions so you can quickly get started with ML. What is Llama 2 Llama 2 is an auto-regressive language model that uses an optimized transformer architecture.

Auto-complete

Auto-complete Machine Learning ML Python

Big Medical Image Preprocessing With Apache Beam | A Step-by-Step Guide

Dlabs.ai

JANUARY 16, 2023

This article will walk you through how to process large medical images efficiently using Apache Beam — and we’ll use a specific example to explore the following: How to approach using huge images in ML/AI Different libraries for dealing with said images How to create efficient parallel processing pipelines Ready for some serious knowledge-sharing?

Neural Network

Neural Network ML Auto-classification Convolutional Neural Networks

Containerization of Machine Learning Applications

Heartbeat

DECEMBER 27, 2023

The machine learning (ML) lifecycle defines steps to derive values to meet business objectives using ML and artificial intelligence (AI). Use Case To drive the understanding of the containerization of machine learning applications, we will build an end-to-end machine learning classification application. Prerequisite Python 3.8

Machine Learning

Machine Learning Python ML Categorization

Train and host a computer vision model for tampering detection on Amazon SageMaker: Part 2

AWS Machine Learning Blog

JANUARY 31, 2024

In the first part of this three-part series, we presented a solution that demonstrates how you can automate detecting document tampering and fraud at scale using AWS AI and machine learning (ML) services for a mortgage underwriting use case. Set up the notebook environment with the image Data Science 3.0. With an ml.t3.medium

Computer Vision

Computer Vision Auto-complete Deep Learning Auto-classification

Model hosting patterns in Amazon SageMaker, Part 1: Common design patterns for building ML applications on Amazon SageMaker

AWS Machine Learning Blog

JANUARY 9, 2023

Machine learning (ML) applications are complex to deploy and often require the ability to hyper-scale, and have ultra-low latency requirements and stringent cost budgets. Deploying ML models at scale with optimized cost and compute efficiencies can be a daunting and cumbersome task. Design patterns for building ML applications.

ML

ML Auto-complete Auto-classification Deep Learning

How Lumi streamlines loan approvals with Amazon SageMaker AI

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Webinars

Trending Sources

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Webinars

Top Low-Code and No-Code Platforms for Data Science in 2023

sktime?—?Python Toolbox for Machine Learning with Time Series

MLOps Landscape in 2023: Top Tools and Platforms

Federated learning on AWS using FedML, Amazon EKS, and Amazon SageMaker

Top MLOps Tools Guide: Weights & Biases, Comet and More

Simplifying the Image Classification Workflow with Lightning & Comet ML

[Updated] 100+ Top Data Science Interview Questions

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

Alex Ratner, CEO & Co-Founder of Snorkel AI – Interview Series

How Vericast optimized feature engineering using Amazon SageMaker Processing

Falcon 2 11B is now available on Amazon SageMaker JumpStart

How Kakao Games automates lifetime value prediction from game data using Amazon SageMaker and AWS Glue

How to Practice Data-Centric AI and Have AI Improve its Own Dataset

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

Introduction to Graph Neural Networks

Advanced RAG patterns on Amazon SageMaker

Build well-architected IDP solutions with a custom lens – Part 5: Cost optimization

How Memorial Sloan Kettering Cancer Center (MSKCC) used Snorkel Flow to scale clinical trial screening

DataRobot Notebooks: Enhanced Code-First Experience for Rapid AI Experimentation

Build well-architected IDP solutions with a custom lens – Part 1: Operational excellence

Evaluate the reliability of Retrieval Augmented Generation applications using Amazon Bedrock

Use foundation models to improve model accuracy with Amazon SageMaker

Snorkel Flow Summer 2023: faster, easier and more secure

Snorkel Flow Summer 2023: faster, easier and more secure

Benchmarking Computer Vision Models using PyTorch & Comet

Snorkel Flow Summer 2023: faster, easier and more secure

How Pixability uses foundation models to accelerate NLP application development by months

Monitoring A Convolutional Neural Network (CNN) in Comet

Smart Factories: Artificial Intelligence and Automation for Reduced OPEX in Manufacturing

Adapting language-based models beyond English

The Easiest Way to Determine Which Scikit-Learn Model Is Perfect for Your Data

Google Research, 2022 & beyond: Research community engagement

A Straightforward Tutorial of Streamlit

Sentiment Analysis with Python and Streamlit

Managing Computer Vision Projects with Micha? Tadeusiak

Text to Exam Generator (NLP) Using Machine Learning

Fine-tune Llama 2 for text generation on Amazon SageMaker JumpStart

Big Medical Image Preprocessing With Apache Beam | A Step-by-Step Guide

Containerization of Machine Learning Applications

Train and host a computer vision model for tampering detection on Amazon SageMaker: Part 2

Model hosting patterns in Amazon SageMaker, Part 1: Common design patterns for building ML applications on Amazon SageMaker

Stay Connected