BERT, Data Science and Explainability - Artificial Intelligence Zone

Evolving Trends in Data Science: Insights from ODSC Conference Sessions from 2015 to 2024

ODSC - Open Data Science

MARCH 10, 2025

Over the past decade, data science has undergone a remarkable evolution, driven by rapid advancements in machine learning, artificial intelligence, and big data technologies. This blog dives deep into these changes of trends in data science, spotlighting how conference topics mirror the broader evolution of datascience.

Data Science

Data Science Neural Network Convolutional Neural Networks Large Language Models

The Rise and Fall of Data Science Trends: A 2018–2024 Conference Perspective

ODSC - Open Data Science

MARCH 12, 2025

The field of data science has evolved dramatically over the past several years, driven by technological breakthroughs, industry demands, and shifting priorities within the community. 20212024: With automated insights and AI-driven analytics improving, the emphasis shifted from visualization to explainability and storytelling.

Data Science

Data Science ETL Machine Learning AI Engineer

Middle Layers Excel: New Research Challenges Final-Layer Focus in Language Models

NYU Center for Data Science

MARCH 13, 2025

Their comparative analysis included decoder-only transformers like Pythia, encoder-only models like BERT, and state space models (SSMs) like Mamba. The teams latest research expands the analysis to more models and training regimes while offering a comprehensive theoretical framework to explain why intermediate representations excel.

BERT

BERT Large Language Models Explainability LLM

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

AWS Machine Learning Blog

JANUARY 19, 2024

In this post, we demonstrate how to use neural architecture search (NAS) based structural pruning to compress a fine-tuned BERT model to improve model performance and reduce inference times. Solution overview In this section, we present the overall workflow and explain the approach.

BERT

BERT Automation Neural Network Machine Learning

Transformers Encoder | The Crux of the NLP Issues

Analytics Vidhya

JULY 7, 2023

Introduction I’m going to explain transformers encoders to you in very simple way.

NLP

NLP Explainability BERT Data Science

Explain medical decisions in clinical settings using Amazon SageMaker Clarify

AWS Machine Learning Blog

AUGUST 21, 2023

Explainability of machine learning (ML) models used in the medical domain is becoming increasingly important because models need to be explained from a number of perspectives in order to gain adoption. Explainability of these predictions is required in order for clinicians to make the correct choices on a patient-by-patient basis.

Explainability

Explainability BERT ML NLP

BERT Language Model and Transformers

Heartbeat

SEPTEMBER 11, 2023

The following is a brief tutorial on how BERT and Transformers work in NLP-based analysis using the Masked Language Model (MLM). Introduction In this tutorial, we will provide a little background on the BERT model and how it works. The BERT model was pre-trained using text from Wikipedia. What is BERT? How Does BERT Work?

BERT

BERT NLP Deep Learning Machine Learning

RoBERTa: A Modified BERT Model for NLP

Heartbeat

MARCH 15, 2023

An open-source machine learning model called BERT was developed by Google in 2018 for NLP, but this model had some limitations, and due to this, a modified BERT model called RoBERTa (Robustly Optimized BERT Pre-Training Approach) was developed by the team at Facebook in the year 2019. What is RoBERTa?

BERT

BERT NLP Deep Learning Neural Network

How Getir reduced model training durations by 90% with Amazon SageMaker and AWS Batch

AWS Machine Learning Blog

DECEMBER 4, 2023

In this post, we explain how we built an end-to-end product category prediction pipeline to help commercial teams by using Amazon SageMaker and AWS Batch , reducing model training duration by 90%. An important aspect of our strategy has been the use of SageMaker and AWS Batch to refine pre-trained BERT models for seven different languages.

BERT

BERT Auto-complete Data Scientist Data Science

The Evolution of Interpretability: Angelica Chen’s Exploration of “Sudden Drops in the Loss”

NYU Center for Data Science

OCTOBER 10, 2023

In a recent interview, Chen explained the importance of studying interpretability artifacts not just at the end of a model’s training but throughout its entire learning process. “A The paper is a case study of syntax acquisition in BERT (Bidirectional Encoder Representations from Transformers).

BERT

BERT Deep Learning Machine Learning Data Science

How foundation models and data stores unlock the business potential of generative AI

IBM Journey to AI blog

AUGUST 1, 2023

A specific kind of foundation model known as a large language model (LLM) is trained on vast amounts of text data for NLP tasks. BERT (Bi-directional Encoder Representations from Transformers) is one of the earliest LLM foundation models developed. An open-source model, Google created BERT in 2018.

Generative AI

Generative AI Data Scientist Machine Learning BERT

Is Traditional Machine Learning Still Relevant?

Unite.AI

NOVEMBER 6, 2023

Prominent transformer models include BERT , GPT-4 , and T5. These techniques explain complex ML models and provide insights about their predictions, thus helping ML practitioners understand their models even better. These models are creating an impact on industries ranging from healthcare, retail, marketing, finance , etc.

Machine Learning

Machine Learning Neural Network Deep Learning Convolutional Neural Networks

6 Examples of Doman-Specific Large Language Models

ODSC - Open Data Science

SEPTEMBER 6, 2023

Based on the BERT model, it has been fine-tuned on a dataset of biomedical text. But if we’ve learned anything, climate science and all the data produced by researchers could also benefit from LLMs. Part of the BERT family of models, ClimateBERT is specifically trained on climate-related text. Get your pass today !

Large Language Models

Large Language Models NLP LLM BERT

Building a Text Classifier App with Hugging Face, BERT, and Comet

Heartbeat

SEPTEMBER 12, 2023

Implementing end-to-end deep learning projects has never been easier with these awesome tools Image by Freepik LLMs such as GPT, BERT, and Llama 2 are a game changer in AI. Here are the topics we’ll cover in this article: Fine-tuning the BERT model with the Transformers library for text classification. Monitoring this app with Comet.

BERT

BERT Deep Learning Machine Learning ML

The Ascent of ChatGPT

ODSC - Open Data Science

FEBRUARY 14, 2023

Some examples of large language models include GPT (Generative Pre-training Transformer), BERT (Bidirectional Encoder Representations from Transformers), and RoBERTa (Robustly Optimized BERT Approach). You can also get data science training on-demand wherever you are with our Ai+ Training platform.

ChatGPT

ChatGPT Large Language Models OpenAI Conversational AI

How AI speeds patient classification and recruitment in clinical trials

Snorkel AI

SEPTEMBER 21, 2023

The medical industry is exploding with data. The increasing universality of electronic health records (EHRs) , the maturity of genomic data science, and the growing popularity of wearable devices and health apps have created an enormous influx of data for both practitioners and researchers. Chat with us today!

BERT

BERT Data Scientist Natural Language Processing Data Science

The Top AI Slides from ODSC West 2024

ODSC - Open Data Science

NOVEMBER 19, 2024

ODSC West 2024 showcased a wide range of talks and workshops from leading data science, AI, and machine learning experts. This blog highlights some of the most impactful AI slides from the world’s best data science instructors, focusing on cutting-edge advancements in AI, data modeling, and deployment strategies.

Deep Learning

Deep Learning Data Science Neural Network BERT

How AI speeds patient classification and recruitment in clinical trials

Snorkel AI

SEPTEMBER 21, 2023

The medical industry is exploding with data. The increasing universality of electronic health records (EHRs) , the maturity of genomic data science, and the growing popularity of wearable devices and health apps have created an enormous influx of data for both practitioners and researchers. Chat with us today!

BERT

BERT Data Scientist Natural Language Processing Data Science

How AI-powered claims processing creates new efficiencies in insurance

Snorkel AI

OCTOBER 18, 2023

FMs can also have low explainability, making them hard to understand, adjust, or improve. The Snorkel advantage for claims processing Snorkel offers a data-centric AI framework that insurance providers can use to generate high-quality training data for ML models and create custom models to streamline claims processing.

BERT

BERT Machine Learning Explainability Large Language Models

Transformer Tune-up: Fine-tune XLNet and ELECTRA for Deep Learning Sentiment Analysis (Part 3)

Towards AI

JUNE 5, 2023

In Part 1 (fine-tuning a BERT model), I explained what a transformer model is and the various open source models types that are available from Hugging Face’s free transformers library. We also walked through how to fine-tune a BERT model to conduct sentiment analysis. In Part… Read the full blog for free on Medium.

Deep Learning

Deep Learning BERT NLP Explainability

Transcribe Audio Using Speech Recognition and Process With RoBERTa

Heartbeat

OCTOBER 10, 2023

RoBERTa RoBERTa (Robustly Optimized BERT Approach) is a natural language processing (NLP) model based on the BERT (Bidirectional Encoder Representations from Transformers) architecture. This refers to the fact that BERT was pre-trained on one set of tasks but fine-tuned on a different set of tasks for downstream NLP applications.

BERT

BERT NLP Machine Learning Deep Learning

Improving ALBERT’s Efficiency with Knowledge Distillation

Heartbeat

JUNE 28, 2023

In this article, we will explore about ALBERT ( A lite weighted version of BERT machine learning model) What is ALBERT? ALBERT (A Lite BERT) is a language model developed by Google Research in 2019. BERT, GPT-2, and XLNet are some examples of models that can be used as teacher models for ALBERT.

BERT

BERT Machine Learning Deep Learning Neural Network

How AI-powered claims processing creates new efficiencies in insurance

Snorkel AI

OCTOBER 18, 2023

FMs can also have low explainability, making them hard to understand, adjust, or improve. The Snorkel advantage for claims processing Snorkel offers a data-centric AI framework that insurance providers can use to generate high-quality training data for ML models and create custom models to streamline claims processing.

BERT

BERT Machine Learning Explainability Large Language Models

Top 10 Free Machine Learning And Artificial Intelligence Courses In 2023

Dlabs.ai

MAY 22, 2022

Machine Learning for Data Science and Analytics Authors: Ansaf Salleb-Aouissi, Cliff Stein, David Blei, Itsik Peer Associate, Mihalis Yannakakis, Peter Orbanz If you ever dreamt of attending classes at Columbia University but never had the chance, this artificial intelligence course focused on ML is the next best thing.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Machine Learning Neural Network

How AI facilitates more fair and accurate credit scoring

Snorkel AI

OCTOBER 4, 2023

Data teams can fine-tune LLMs like BERT, GPT-3.5 These could be risk prediction models, future earning models, or robust consumer profiles that incorporate traditional and alternative data. Lenders and credit agencies can use Snorkel to: Quickly and programmatically develop training data for credit scoring models.

Data Scientist

Data Scientist AI Neural Network AI

11 Books Every Data Scientist Must Read In 2023

Dlabs.ai

MARCH 9, 2022

The recommendations cover everything from data science to data analysis, programming, and general business. Meaning you’ll have a better understanding of all the mechanisms to make you a more effective data scientist if you read even just a few of these books. This works the other way round, too.

Data Scientist

Data Scientist Machine Learning Python Data Science

Must-Have Prompt Engineering Skills for 2024

ODSC - Open Data Science

JANUARY 29, 2024

They design intricate sequences of prompts, leveraging their knowledge of AI, machine learning, and data science to guide powerful LLMs (Large Language Models) towards complex tasks. Data science methodologies and skills can be leveraged to design these experiments, analyze results, and iteratively improve prompt strategies.

Prompt Engineering

Prompt Engineering Prompt Engineer Data Science LLM

How AI facilitates more fair and accurate credit scoring

Snorkel AI

OCTOBER 4, 2023

Data teams can fine-tune LLMs like BERT, GPT-3.5 These could be risk prediction models, future earning models, or robust consumer profiles that incorporate traditional and alternative data. Lenders and credit agencies can use Snorkel to: Quickly and programmatically develop training data for credit scoring models.

Data Scientist

Data Scientist Neural Network AI AI

How AI speeds patient classification and recruitment in clinical trials

Snorkel AI

SEPTEMBER 21, 2023

The medical industry is exploding with data. The increasing universality of electronic health records (EHRs) , the maturity of genomic data science, and the growing popularity of wearable devices and health apps have created an enormous influx of data for both practitioners and researchers.

Data Scientist

Data Scientist BERT Natural Language Processing AI

Large language models: their history, capabilities and limitations

Snorkel AI

MAY 25, 2023

BERT, the first breakout large language model In 2019, a team of researchers at Goole introduced BERT (which stands for bidirectional encoder representations from transformers). By making BERT bidirectional, it allowed the inputs and outputs to take each others’ context into account. BERT), or consist of both (e.g.,

Large Language Models

Large Language Models BERT Neural Network LLM

Large language models: their history, capabilities and limitations

Snorkel AI

MAY 25, 2023

BERT, the first breakout large language model In 2019, a team of researchers at Goole introduced BERT (which stands for bidirectional encoder representations from transformers). By making BERT bidirectional, it allowed the inputs and outputs to take each others’ context into account. BERT), or consist of both (e.g.,

Large Language Models

Large Language Models BERT Neural Network LLM

Working of Encoder-Decoder model

Mlearning.ai

MAY 25, 2023

Sequence models have gained traction in the past 5 years and it’s been very active research of study, even the GPT models — on which ChatGPT was implemented using the concept of Transformers and BERT , which is based on Self-Attention model — it is based on Attention models — which uses the base of Encoder-Decoder sequence models.

Neural Network

Neural Network Data Science ML Machine Learning

Gamification in AI?—?How Learning is Just a Game

Applied Data Science

MARCH 11, 2022

Both of these computations have a complexity scaling in the cube of the data’s number of features. This explain this statement at the NeurIPS 2017 Test-of-Time Award: It seems easier to train a bi-directional LSTM with attention than to compute the PCA of a large matrix. — Rahimi

Neural Network

Neural Network Data Science AI AI

Understanding Language Models in NLP

Heartbeat

FEBRUARY 7, 2023

Currently, there’s a lot of development in this field from BERT to GPT-2 and these models are pre-trained on very large corpora. Editor’s Note: Heartbeat is a contributor-driven online publication and community dedicated to providing premier educational resources for data science, machine learning, and deep learning practitioners.

NLP

NLP Neural Network Deep Learning BERT

Using Hugging Face Transformers for Sentiment Analysis in R

Heartbeat

JULY 19, 2023

Hugging Face transformer models BERT, GPT-2, RoBERTa, and T5 are included in the library. BERT is one of the most popular Hugging Face transformer models (Bidirectional Encoder Representations from Transformers). To train the transformer model BERT, a massive corpus of text was used. Next, we move on to the model inference step.

BERT

BERT Natural Language Processing NLP Python

Large Language Models: A Complete Guide

Heartbeat

MAY 29, 2023

This technique is commonly used in neural network-based models such as BERT, where it helps to handle out-of-vocabulary words. Three examples of tokenization methods; image from FreeCodeCamp Tokenization is a fundamental step in data preparation for NLP tasks.

Large Language Models

Large Language Models Machine Learning LLM Natural Language Processing

Unlocking the Potential of LLMs: From MLOps to LLMOps

Heartbeat

OCTOBER 5, 2023

The emergence of Large Language Models (LLMs) like OpenAI's GPT , Meta's Llama , and Google's BERT has ushered in a new era in this field. Interpretability and Explainability: As LLMs become more powerful, the focus on understanding model decision-making processes will intensify.

Large Language Models

Large Language Models DevOps Machine Learning Prompt Engineer

Explosion in 2019: Our Year in Review

Explosion

DECEMBER 28, 2019

The update fixed outstanding bugs on the tracker, gave the docs a huge makeover, improved both speed and accuracy, made installation significantly easier and faster, and added some exciting new features, like ULMFit/BERT/ELMo-style language model pretraining. Sep 24: Data science instructor Vincent returned for “Intro to NLP with spaCy #2”.

NLP

NLP BERT Machine Learning Python

2021 in Review: What Just Happened in the World of Artificial Intelligence?

Applied Data Science

JANUARY 4, 2022

Initially introduced for Natural Language Processing (NLP) applications like translation, this type of network was used in both Google’s BERT and OpenAI’s GPT-2 and GPT-3. ADSP is a London based consultancy that implements end-to-end data science solutions for businesses, delivering measurable value. But at what cost?

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Neural Network Deep Learning

The Age of Health Informatics: Part 1

Heartbeat

OCTOBER 23, 2023

Revolutionizing Healthcare through Data Science and Machine Learning Image by Cai Fang on Unsplash Introduction In the digital transformation era, healthcare is experiencing a paradigm shift driven by integrating data science, machine learning, and information technology.

Data Scientist

Data Scientist Machine Learning Big Data Algorithm

How AI facilitates more fair and accurate credit scoring

Snorkel AI

OCTOBER 4, 2023

Data teams can fine-tune LLMs like BERT, GPT-3.5 These could be risk prediction models, future earning models, or robust consumer profiles that incorporate traditional and alternative data. Lenders and credit agencies can use Snorkel to: Quickly and programmatically develop training data for credit scoring models.

Data Scientist

Data Scientist AI AI Neural Network

A gentle introduction to MACAW: A superior QA model to GPT-3

Mlearning.ai

JULY 21, 2023

are all ‘unstructured data’ , and an advanced AI is required to understand the information in order to obtain our answers. This is why you would have heard of new QA technologies such as BERT, GPT3, ELECTRA etc. If not, then a quick summary- Huggingface is a data-science platform that helps us use pre-trained models for our purposes.

NLP

NLP Natural Language Processing BERT Computer Vision

Creating your whole codebase at once using LLMs – how long until AI replaces human developers?

deepsense.ai

OCTOBER 8, 2023

We compare the existing solutions and explain how they work behind the scenes. In this article we will focus solely on AI Agents capable of solving software engineering and data science problems. A comprehensive review of the state of the art in terms of code-writing agents. Back to point 2.

Auto-complete

Auto-complete LLM Software Engineer AI

Graph Convolutional Networks for NLP Using Comet

Heartbeat

JUNE 6, 2023

Editorially independent, Heartbeat is sponsored and published by Comet, an MLOps platform that enables data scientists & ML teams to track, compare, explain, & optimize their experiments. . & Nie, JY. We’re committed to supporting and inspiring developers and engineers from all walks of life.

NLP

NLP Convolutional Neural Networks Neural Network Natural Language Processing

Evolving Trends in Data Science: Insights from ODSC Conference Sessions from 2015 to 2024

The Rise and Fall of Data Science Trends: A 2018–2024 Conference Perspective

Webinars

Trending Sources

Middle Layers Excel: New Research Challenges Final-Layer Focus in Language Models

Webinars

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

Transformers Encoder | The Crux of the NLP Issues

Explain medical decisions in clinical settings using Amazon SageMaker Clarify

BERT Language Model and Transformers

RoBERTa: A Modified BERT Model for NLP

How Getir reduced model training durations by 90% with Amazon SageMaker and AWS Batch

The Evolution of Interpretability: Angelica Chen’s Exploration of “Sudden Drops in the Loss”

How foundation models and data stores unlock the business potential of generative AI

Is Traditional Machine Learning Still Relevant?

6 Examples of Doman-Specific Large Language Models

Building a Text Classifier App with Hugging Face, BERT, and Comet

The Ascent of ChatGPT

How AI speeds patient classification and recruitment in clinical trials

The Top AI Slides from ODSC West 2024

How AI speeds patient classification and recruitment in clinical trials

How AI-powered claims processing creates new efficiencies in insurance

Transformer Tune-up: Fine-tune XLNet and ELECTRA for Deep Learning Sentiment Analysis (Part 3)

Transcribe Audio Using Speech Recognition and Process With RoBERTa

Improving ALBERT’s Efficiency with Knowledge Distillation

How AI-powered claims processing creates new efficiencies in insurance

Top 10 Free Machine Learning And Artificial Intelligence Courses In 2023

How AI facilitates more fair and accurate credit scoring

11 Books Every Data Scientist Must Read In 2023

Must-Have Prompt Engineering Skills for 2024

How AI facilitates more fair and accurate credit scoring

How AI speeds patient classification and recruitment in clinical trials

Large language models: their history, capabilities and limitations

Large language models: their history, capabilities and limitations

Working of Encoder-Decoder model

Gamification in AI?—?How Learning is Just a Game

Understanding Language Models in NLP

Using Hugging Face Transformers for Sentiment Analysis in R

Large Language Models: A Complete Guide

Unlocking the Potential of LLMs: From MLOps to LLMOps

Explosion in 2019: Our Year in Review

2021 in Review: What Just Happened in the World of Artificial Intelligence?

The Age of Health Informatics: Part 1

How AI facilitates more fair and accurate credit scoring

A gentle introduction to MACAW: A superior QA model to GPT-3

Creating your whole codebase at once using LLMs – how long until AI replaces human developers?

Graph Convolutional Networks for NLP Using Comet

Stay Connected