AI Researcher, ML and Natural Language Processing

AI Researcher

Natural Language Processing

20 GitHub Repositories to Master Natural Language Processing (NLP)

Marktechpost

OCTOBER 25, 2024

Natural Language Processing (NLP) is a rapidly growing field that deals with the interaction between computers and human language. Transformers is a state-of-the-art library developed by Hugging Face that provides pre-trained models and tools for a wide range of natural language processing (NLP) tasks.

Natural Language Processing

Natural Language Processing NLP Deep Learning Python

Combining the Best of Both Worlds: Retrieval-Augmented Generation for Knowledge-Intensive Natural Language Processing

Marktechpost

MAY 27, 2024

Knowledge-intensive Natural Language Processing (NLP) involves tasks requiring deep understanding and manipulation of extensive factual information. Researchers from Facebook AI Research, University College London, and New York University introduced Retrieval-Augmented Generation (RAG) models to address these limitations.

Natural Language Processing

Natural Language Processing NLP BERT AI Research

Join 15,000+

professionals

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Relevance, Reach, Return: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Trending Sources

Meta AI Researchers Introduce GenBench: A Revolutionary Framework for Advancing Generalization in Natural Language Processing

Marktechpost

OCTOBER 28, 2023

A model’s capacity to generalize or effectively apply its learned knowledge to new contexts is essential to the ongoing success of Natural Language Processing (NLP). Having the taxonomy in place makes it easier to get good generalizations, which further fosters the growth of Natural Language Processing.

Natural Language Processing

Natural Language Processing AI Research AI Researcher NLP

Webinars

Relevance, Reach, Return: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

LAION AI Unveils LAION-DISCO-12M: Enabling Machine Learning Research in Foundation Models with 12 Million YouTube Audio Links and Metadata

Marktechpost

NOVEMBER 19, 2024

The machine learning community faces a significant challenge in audio and music applications: the lack of a diverse, open, and large-scale dataset that researchers can freely access for developing foundation models. It provides researchers worldwide with access to a comprehensive dataset, free from licensing fees or restricted access.

Metadata

Metadata Machine Learning Natural Language Processing Computer Vision

Can AI Really Understand Sarcasm? This Paper from NYU Explores Advanced Models in Natural Language Processing

Marktechpost

DECEMBER 30, 2023

Natural Language Processing (NLP) is useful in many fields, bringing about transformative communication, information processing, and decision-making changes. All credit for this research goes to the researchers of this project. The post Can AI Really Understand Sarcasm?

Natural Language Processing

Natural Language Processing NLP Deep Learning AI

Rethinking Reproducibility As the New Frontier in AI Research

Unite.AI

DECEMBER 20, 2023

In particular, the instances of irreproducible findings, such as in a review of 62 studies diagnosing COVID-19 with AI , emphasize the necessity to reevaluate practices and highlight the significance of transparency. Multiple factors contribute to the reproducibility crisis in AI research.

AI Researcher

AI Researcher AI Research Algorithm Machine Learning

2024 BAIR Graduate Directory

BAIR

MARCH 11, 2024

graduates have each expanded the frontiers of AI research and are now ready to embark on new adventures in academia, industry, and beyond. These fantastic individuals bring with them a wealth of knowledge, fresh ideas, and a drive to continue contributing to the advancement of AI.

Robotics

Robotics Natural Language Processing Machine Learning Deep Learning

Hugging Face Releases Picotron: A Tiny Framework that Solves LLM Training 4D Parallelization

Marktechpost

DECEMBER 19, 2024

The rise of large language models (LLMs) has transformed natural language processing, but training these models comes with significant challenges. All credit for this research goes to the researchers of this project. Dont Forget to join our 60k+ ML SubReddit. For instance, Llama-3.1-405B

LLM

LLM Natural Language Processing Large Language Models AI Researcher

Amazon AI Researchers Introduce Chronos: A New Machine Learning Framework for Pretrained Probabilistic Time Series Models

Marktechpost

MARCH 15, 2024

This process involves scaling and quantizing the data into discrete bins, similar to how words form a vocabulary in language models. This tokenization allows Chronos to use the same architectures as natural language processing tasks, such as the T5 family of models, to forecast future data points in a time series.

Machine Learning

Machine Learning AI Researcher AI Research Natural Language Processing

AMD Releases Instella: A Series of Fully Open-Source State-of-the-Art 3B Parameter Language Model

Marktechpost

MARCH 6, 2025

By releasing Instella openly, AMD provides the community with the opportunity to study, refine, and adapt the model for a range of applicationsfrom academic research to practical, everyday solutions. All credit for this research goes to the researchers of this project.

Natural Language Processing

Natural Language Processing AI Researcher AI Research ML

Google AI Researchers Introduce MADLAD-400: A 2.8T Token Web-Domain Dataset that Covers 419 Languages

Marktechpost

SEPTEMBER 14, 2023

In the ever-evolving field of Natural Language Processing (NLP), the development of machine translation and language models has been primarily driven by the availability of vast training datasets in languages like English. All Credit For This Research Goes To the Researchers on This Project.

AI Research

AI Research AI Researcher Natural Language Processing NLP

Meta AI Researchers Propose Advanced Long-Context LLMs: A Deep Dive into Upsampling, Training Techniques, and Surpassing GPT-3.5-Turbo-16k’s Performance

Marktechpost

OCTOBER 7, 2023

The emergence of Large Language Models (LLMs) in natural language processing represents a groundbreaking development. Ultimately, the team hopes to empower researchers and developers to harness the potential of long-context LLMs for a wide array of applications, ushering in a new era of natural language processing.

AI Research

AI Research AI Researcher Natural Language Processing Large Language Models

This AI Research Shares a Comprehensive Overview of Large Language Models (LLMs) on Graphs

Marktechpost

DECEMBER 13, 2023

The well-known Large Language Models (LLMs) like GPT, BERT, PaLM, and LLaMA have brought in some great advancements in Natural Language Processing (NLP) and Natural Language Generation (NLG). All credit for this research goes to the researchers of this project.

Large Language Models

Large Language Models AI Research AI Researcher Neural Network

LLMOps: The Next Frontier for Machine Learning Operations

Unite.AI

FEBRUARY 7, 2024

Machine learning (ML) is a powerful technology that can solve complex problems and deliver customer value. However, ML models are challenging to develop and deploy. This is why Machine Learning Operations (MLOps) has emerged as a paradigm to offer scalable and measurable values to Artificial Intelligence (AI) driven businesses.

Machine Learning

Machine Learning Large Language Models LLM BERT

DeepSeek AI Introduces CODEI/O: A Novel Approach that Transforms Code-based Reasoning Patterns into Natural Language Formats to Enhance LLMs’ Reasoning Capabilities

Marktechpost

FEBRUARY 15, 2025

Large Language Models (LLMs) have advanced significantly in natural language processing, yet reasoning remains a persistent challenge. DeepSeek AI Research presents CODEI/O , an approach that converts code-based reasoning into natural language. Check out the Paper and GitHub Page.

Large Language Models

Large Language Models Natural Language Processing Conversational AI AI

This AI Research Introduces Owl: A New Large Language Model for IT Operations

Marktechpost

SEPTEMBER 21, 2023

In the ever-evolving landscape of Natural Language Processing (NLP) and Artificial Intelligence (AI), Large Language Models (LLMs) have emerged as powerful tools, demonstrating remarkable capabilities in various NLP tasks. All Credit For This Research Goes To the Researchers on This Project.

Large Language Models

Large Language Models AI Researcher AI Research NLP

Enhancing Autoregressive Decoding Efficiency: A Machine Learning Approach by Qualcomm AI Research Using Hybrid Large and Small Language Models

Marktechpost

MARCH 3, 2024

Central to Natural Language Processing (NLP) advancements are large language models (LLMs), which have set new benchmarks for what machines can achieve in understanding and generating human language. Join our 38k+ ML SubReddit , 41k+ Facebook Community, Discord Channel , and LinkedIn Gr oup.

Machine Learning

Machine Learning AI Research AI Researcher Large Language Models

AI News Weekly - Issue #341: Elon Musk unveils new AI company set to rival ChatGPT - Jul 13th 2023

AI Weekly

JULY 13, 2023

theguardian.com Sarah Silverman sues OpenAI and Meta claiming AI training infringed copyright The US comedian and author Sarah Silverman is suing the ChatGPT developer OpenAI and Mark Zuckerberg’s Meta for copyright infringement over claims that their artificial intelligence models were trained on her work without permission. AlphaGO was.

Neural Network

Neural Network Robotics ChatGPT Computer Vision

AI News Weekly - Issue #373: House launching AI task force - Feb 22nd 2024

AI Weekly

FEBRUARY 22, 2024

artificialintelligence-news.com Google’s new AI hub in Paris proves that Google feels insecure about AI This morning, Google’s CEO Sundar Pichai inaugurated a new hub in Paris dedicated to AI. ucf.edu Generative AI and Generative Conversations: Contrasting Futures for Organizational Change?

Natural Language Processing

Natural Language Processing Artificial Intelligence Artificial Intelligence ChatGPT

Meta AI Research Introduces MobileLLM: Pioneering Machine Learning Innovations for Enhanced On-Device Intelligence

Marktechpost

MARCH 3, 2024

By reimagining the architecture of these models and integrating innovative techniques for efficient parameter use, the research team has achieved remarkable performance gains and broadened the horizon for the deployment of LLMs. Join our 38k+ ML SubReddit , 41k+ Facebook Community, Discord Channel , and LinkedIn Gr oup.

Machine Learning

Machine Learning AI Research AI Researcher Natural Language Processing

Qualcomm AI Research Proposes the GPTVQ Method: A Fast Machine Learning Method for Post-Training Quantization of Large Networks Using Vector Quantization (VQ)

Marktechpost

MARCH 4, 2024

Efficiency of Large Language Models (LLMs) is a focal point for researchers in AI. A groundbreaking study by Qualcomm AI Research introduces a method known as GPTVQ, which leverages vector quantization (VQ) to enhance the size-accuracy trade-off in neural network quantization significantly.

Machine Learning

Machine Learning AI Researcher AI Research Large Language Models

Can Large Language Models Really Do Math? This Artificial Intelligence AI Research Introduce MathGLM: A Robust Model To Solve Mathematical Problems Without a Calculator

Marktechpost

SEPTEMBER 12, 2023

When it comes to downstream natural language processing (NLP) tasks, large language models (LLMs) have proven to be exceptionally effective. The post Can Large Language Models Really Do Math? Their text comprehension and generation abilities make them extremely flexible for use in a wide range of NLP applications.

Large Language Models

Large Language Models Artificial Intelligence Artificial Intelligence AI Research

This AI Research Introduces Flash-Decoding: A New Artificial Intelligence Approach Based on FlashAttention to Make Long-Context LLM Inference Up to 8x Faster

Marktechpost

OCTOBER 18, 2023

Large language models (LLMs) such as ChatGPT and Llama have garnered substantial attention due to their exceptional natural language processing capabilities, enabling various applications ranging from text generation to code completion. All Credit For This Research Goes To the Researchers on This Project.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence LLM AI Research

EXAONE 3.0 Released: A 7.8B Open-Sourced State of the Art Language Model from LG AI Research

Marktechpost

AUGUST 9, 2024

LG AI Research has recently announced the release of EXAONE 3.0. The release as an open-source large language model is unique to the current version with great results and 7.8B LG AI Research is driving a new development direction, marking it competitive with the latest technology trends. parameters. Released: A 7.8B

AI Researcher

AI Researcher AI Research NLP Natural Language Processing

Do Language Models Know When They Are Hallucinating? This AI Research from Microsoft and Columbia University Explores Detecting Hallucinations with the Creation of Probes

Marktechpost

DECEMBER 31, 2023

Large Language Models (LLMs), the latest innovation of Artificial Intelligence (AI), use deep learning techniques to produce human-like text and perform various Natural Language Processing (NLP) and Natural Language Generation (NLG) tasks. The post Do Language Models Know When They Are Hallucinating?

AI Research

AI Research AI Researcher Large Language Models Natural Language Processing

Microsoft AI Research Introduces Generalized Instruction Tuning (called GLAN): A General and Scalable Artificial Intelligence Method for Instruction Tuning of Large Language Models (LLMs)

Marktechpost

MARCH 2, 2024

The currently existing techniques for instruction tuning frequently rely on Natural Language Processing (NLP) datasets, which are scarce, or self-instruct approaches that produce artificial datasets having difficulty with diversity. Join our 38k+ ML SubReddit , 41k+ Facebook Community, Discord Channel , and LinkedIn Gr oup.

Large Language Models

Large Language Models Artificial Intelligence Artificial Intelligence AI Research

NVIDIA AI Researchers Explore Upcycling Large Language Models into Sparse Mixture-of-Experts

Marktechpost

OCTOBER 13, 2024

Mixture of Experts (MoE) models are becoming critical in advancing AI, particularly in natural language processing. MoE architectures differ from traditional dense models by selectively activating subsets of specialized expert networks for each input. If you like our work, you will love our newsletter.

Large Language Models

Large Language Models AI Research AI Researcher Natural Language Processing

A New AI Research from Apple and Equall AI Uncovers Redundancies in Transformer Architecture: How Streamlining the Feed Forward Network Boosts Efficiency and Accuracy

Marktechpost

SEPTEMBER 10, 2023

Transformer design that has recently become popular has taken over as the standard method for Natural Language Processing (NLP) activities, particularly Machine Translation (MT). All Credit For This Research Goes To the Researchers on This Project. If you like our work, you will love our newsletter.

AI Research

AI Research AI Researcher NLP Natural Language Processing

Salesforce AI Research Introduces SummHay: A Robust AI Benchmark for Evaluating Long-Context Summarization in LLMs and RAG Systems

Marktechpost

JULY 6, 2024

Natural language processing (NLP) in artificial intelligence focuses on enabling machines to understand and generate human language. This field encompasses a variety of tasks, including language translation, sentiment analysis, and text summarization. Also, don’t forget to follow us on Twitter.

AI Research

AI Research AI Researcher Large Language Models Natural Language Processing

This AI Research from China Explores the Illusionary Mind of AI: A Deep Dive into Hallucinations in Large Language Models

Marktechpost

NOVEMBER 25, 2023

Large language models have recently brought about a paradigm change in natural language processing, leading to previously unheard-of advancements in language creation, comprehension, and reasoning. All credit for this research goes to the researchers of this project.

Large Language Models

Large Language Models AI Research AI Researcher Natural Language Processing

Apple AI Research Introduces AIM: A Collection of Vision Models Pre-Trained with an Autoregressive Objective

Marktechpost

JANUARY 19, 2024

Task-agnostic model pre-training is now the norm in Natural Language Processing, driven by the recent revolution in large language models (LLMs) like ChatGPT. These models showcase proficiency in tackling intricate reasoning tasks, adhering to instructions, and serving as the backbone for widely used AI assistants.

AI Research

AI Research AI Researcher Natural Language Processing Large Language Models

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Flipboard

FEBRUARY 10, 2025

In this post, we dive into how organizations can use Amazon SageMaker AI , a fully managed service that allows you to build, train, and deploy ML models at scale, and can build AI agents using CrewAI, a popular agentic framework and open source models like DeepSeek-R1. Focus on AI Research and Development** . . . .

LLM

LLM AI AI Python

Google AI Research Proposes TRICE: A New Machine Learning Algorithm for Tuning LLMs to be Better at Solving Question-Answering Tasks Using Chain-of-Thought (CoT) Prompting

Marktechpost

DECEMBER 9, 2023

These findings highlight the potential for continued advancements in natural language processing and its application to problem-solving. Future research directions include evaluating the MCMC-EM fine-tuning technique on diverse tasks and datasets to assess its generalizability.

Machine Learning

Machine Learning Algorithm AI Research AI Researcher

A New AI Research Introduces AttrPrompt: A LLM-as-Training-Data-Generator for a New Paradigm in Zero-Shot Learning

Marktechpost

JULY 2, 2023

The performance of large language models (LLMs) has been impressive across many different natural language processing (NLP) applications. Don’t forget to join our 25k+ ML SubReddit , Discord Channel , and Email Newsletter , where we share the latest AI research news, cool AI projects, and more.

LLM

LLM AI Research AI Researcher Large Language Models

DeepSeek AI Researchers Propose Expert-Specialized Fine-Tuning, or ESFT to Reduce Memory by up to 90% and Time by up to 30%

Marktechpost

JULY 6, 2024

Natural language processing is advancing rapidly, focusing on optimizing large language models (LLMs) for specific tasks. These models, often containing billions of parameters, pose a significant challenge in customization. Also, don’t forget to follow us on Twitter. Join our Telegram Channel and LinkedIn Gr oup.

Large Language Models

Large Language Models AI Research AI Researcher Natural Language Processing

IBM AI Research Introduces Unitxt: An Innovative Library For Customizable Textual Data Preparation And Evaluation Tailored To Generative Language Models

Marktechpost

JANUARY 30, 2024

Though it has always played an essential part in natural language processing, textual data processing now sees new uses in the field. Multiple teams working on different natural language processing (NLP) activities have already used Unitxt as a core utility for LLMs in IBM.

AI Research

AI Research AI Researcher Natural Language Processing LLM

NeoBERT: Modernizing Encoder Models for Enhanced Language Understanding

Marktechpost

MARCH 3, 2025

Encoder models like BERT and RoBERTa have long been cornerstones of natural language processing (NLP), powering tasks such as text classification, retrieval, and toxicity detection. All credit for this research goes to the researchers of this project. Check out the Paper and Model on Hugging Face.

BERT

BERT Data Scarcity Natural Language Processing Large Language Models

Top 10 Influential AI Research Papers in 2023 from Google, Meta, Microsoft, and More

Topbots

DECEMBER 5, 2023

Top 10 AI Research Papers 2023 1. Sparks of AGI by Microsoft Summary In this research paper, a team from Microsoft Research analyzes an early version of OpenAI’s GPT-4, which was still under active development at the time. Sign up for more AI research updates. Enjoy this article?

AI Research

AI Research AI Researcher Natural Language Processing Neural Network

Nota AI Researchers Introduce LD-Pruner: A Novel Performance-Preserving Structured Pruning Method for Compressing Latent Diffusion Models LDMs

Marktechpost

APRIL 23, 2024

Generative models have emerged as transformative tools across various domains, including computer vision and natural language processing, by learning data distributions and generating samples from them. Among these models, Diffusion Models (DMs) have garnered attention for their ability to produce high-quality images.

AI Research

AI Research AI Researcher Natural Language Processing Computer Vision

Cohere AI Researchers Investigate Overcoming Quantization Cliffs in Large-Scale Machine Learning Models Through Optimization Techniques

Marktechpost

DECEMBER 27, 2023

Artificial intelligence’s ascent of large language models (LLMs) has redefined natural language processing. Quantization, the process of reducing model weights and activations to lower bit precision, is crucial for deploying models on resource-constrained devices.

Machine Learning

Machine Learning AI Research AI Researcher Large Language Models

Revolutionizing Your Device Experience: How Apple’s AI is Redefining Technology

Unite.AI

JULY 24, 2024

In the consumer technology sector, AI began to gain prominence with features like voice recognition and automated tasks. Over the past decade, advancements in machine learning, Natural Language Processing (NLP), and neural networks have transformed the field.

Machine Learning

Machine Learning NLP AI AI

Meet InternLM-20B: An Open-Sourced 20B Parameter Pretrained Artificial Intelligence AI Framework

Marktechpost

SEPTEMBER 30, 2023

Researchers continually strive to build models that can understand, reason, and generate text like humans in the rapidly evolving field of natural language processing. These models must grapple with complex linguistic nuances, bridge language gaps, and adapt to diverse tasks. Check out the Project and Github.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Natural Language Processing NLP

Meet Falcon 180B: The Largest Openly Available Language Model With 180 Billion Parameters

Marktechpost

SEPTEMBER 9, 2023

The demand for powerful and versatile language models has become more pressing in natural language processing and artificial intelligence. However, building language models that can excel in various language tasks remains a complex challenge. trillion tokens. Check out the Reference Article and Project.

Natural Language Processing

Natural Language Processing Large Language Models Artificial Intelligence Artificial Intelligence

Google AI Research Introduces Process Advantage Verifiers: A Novel Machine Learning Approach to Improving LLM Reasoning Capabilities

Marktechpost

OCTOBER 15, 2024

Large language models (LLMs) have become crucial in natural language processing, particularly for solving complex reasoning tasks. However, while LLMs can process and generate responses based on vast amounts of data, improving their reasoning capabilities is an ongoing challenge.

Machine Learning

Machine Learning LLM AI Research AI Researcher

20 GitHub Repositories to Master Natural Language Processing (NLP)

Combining the Best of Both Worlds: Retrieval-Augmented Generation for Knowledge-Intensive Natural Language Processing

Webinars

Trending Sources

Meta AI Researchers Introduce GenBench: A Revolutionary Framework for Advancing Generalization in Natural Language Processing

Webinars

LAION AI Unveils LAION-DISCO-12M: Enabling Machine Learning Research in Foundation Models with 12 Million YouTube Audio Links and Metadata

Can AI Really Understand Sarcasm? This Paper from NYU Explores Advanced Models in Natural Language Processing

Rethinking Reproducibility As the New Frontier in AI Research

2024 BAIR Graduate Directory

Hugging Face Releases Picotron: A Tiny Framework that Solves LLM Training 4D Parallelization

Amazon AI Researchers Introduce Chronos: A New Machine Learning Framework for Pretrained Probabilistic Time Series Models

AMD Releases Instella: A Series of Fully Open-Source State-of-the-Art 3B Parameter Language Model

Google AI Researchers Introduce MADLAD-400: A 2.8T Token Web-Domain Dataset that Covers 419 Languages

Meta AI Researchers Propose Advanced Long-Context LLMs: A Deep Dive into Upsampling, Training Techniques, and Surpassing GPT-3.5-Turbo-16k’s Performance

This AI Research Shares a Comprehensive Overview of Large Language Models (LLMs) on Graphs

LLMOps: The Next Frontier for Machine Learning Operations

DeepSeek AI Introduces CODEI/O: A Novel Approach that Transforms Code-based Reasoning Patterns into Natural Language Formats to Enhance LLMs’ Reasoning Capabilities

This AI Research Introduces Owl: A New Large Language Model for IT Operations

Enhancing Autoregressive Decoding Efficiency: A Machine Learning Approach by Qualcomm AI Research Using Hybrid Large and Small Language Models

AI News Weekly - Issue #341: Elon Musk unveils new AI company set to rival ChatGPT - Jul 13th 2023

AI News Weekly - Issue #373: House launching AI task force - Feb 22nd 2024

Meta AI Research Introduces MobileLLM: Pioneering Machine Learning Innovations for Enhanced On-Device Intelligence

Qualcomm AI Research Proposes the GPTVQ Method: A Fast Machine Learning Method for Post-Training Quantization of Large Networks Using Vector Quantization (VQ)

Can Large Language Models Really Do Math? This Artificial Intelligence AI Research Introduce MathGLM: A Robust Model To Solve Mathematical Problems Without a Calculator

This AI Research Introduces Flash-Decoding: A New Artificial Intelligence Approach Based on FlashAttention to Make Long-Context LLM Inference Up to 8x Faster

EXAONE 3.0 Released: A 7.8B Open-Sourced State of the Art Language Model from LG AI Research

Do Language Models Know When They Are Hallucinating? This AI Research from Microsoft and Columbia University Explores Detecting Hallucinations with the Creation of Probes

Microsoft AI Research Introduces Generalized Instruction Tuning (called GLAN): A General and Scalable Artificial Intelligence Method for Instruction Tuning of Large Language Models (LLMs)

NVIDIA AI Researchers Explore Upcycling Large Language Models into Sparse Mixture-of-Experts

A New AI Research from Apple and Equall AI Uncovers Redundancies in Transformer Architecture: How Streamlining the Feed Forward Network Boosts Efficiency and Accuracy

Salesforce AI Research Introduces SummHay: A Robust AI Benchmark for Evaluating Long-Context Summarization in LLMs and RAG Systems

This AI Research from China Explores the Illusionary Mind of AI: A Deep Dive into Hallucinations in Large Language Models

Apple AI Research Introduces AIM: A Collection of Vision Models Pre-Trained with an Autoregressive Objective

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Google AI Research Proposes TRICE: A New Machine Learning Algorithm for Tuning LLMs to be Better at Solving Question-Answering Tasks Using Chain-of-Thought (CoT) Prompting

A New AI Research Introduces AttrPrompt: A LLM-as-Training-Data-Generator for a New Paradigm in Zero-Shot Learning

DeepSeek AI Researchers Propose Expert-Specialized Fine-Tuning, or ESFT to Reduce Memory by up to 90% and Time by up to 30%

IBM AI Research Introduces Unitxt: An Innovative Library For Customizable Textual Data Preparation And Evaluation Tailored To Generative Language Models

NeoBERT: Modernizing Encoder Models for Enhanced Language Understanding

Top 10 Influential AI Research Papers in 2023 from Google, Meta, Microsoft, and More

Nota AI Researchers Introduce LD-Pruner: A Novel Performance-Preserving Structured Pruning Method for Compressing Latent Diffusion Models LDMs

Cohere AI Researchers Investigate Overcoming Quantization Cliffs in Large-Scale Machine Learning Models Through Optimization Techniques

Revolutionizing Your Device Experience: How Apple’s AI is Redefining Technology

Meet InternLM-20B: An Open-Sourced 20B Parameter Pretrained Artificial Intelligence AI Framework

Meet Falcon 180B: The Largest Openly Available Language Model With 180 Billion Parameters

Google AI Research Introduces Process Advantage Verifiers: A Novel Machine Learning Approach to Improving LLM Reasoning Capabilities

Stay Connected