AI Researcher, LLM and NLP - Artificial Intelligence Zone

Building a Legal AI Chatbot: A Step-by-Step Guide Using bigscience/T0pp LLM, Open-Source NLP Models, Streamlit, PyTorch, and Hugging Face Transformers

Marktechpost

FEBRUARY 23, 2025

In this tutorial, we will build an efficient Legal AI CHatbot using open-source tools. It provides a step-by-step guide to creating a chatbot using bigscience/T0pp LLM , Hugging Face Transformers, and PyTorch. join(tokens) sample_text = "The contract is valid for 5 years, terminating on December 31, 2025."

NLP

NLP AI Chatbots Chatbots LLM

Meta AI Introduces MLGym: A New AI Framework and Benchmark for Advancing AI Research Agents

Marktechpost

FEBRUARY 23, 2025

Researchers from the University College London, University of WisconsinMadison, University of Oxford, Meta, and other institutes have introduced a new framework and benchmark for evaluating and developing LLM agents in AI research. It comprises four key components: Agents, Environment, Datasets, and Tasks. Pro, Claude-3.5-Sonnet,

AI Researcher

AI Researcher AI Research Software Engineer AI

Alibaba Released Babel: An Open Multilingual Large Language Model LLM Serving Over 90% of Global Speakers

Marktechpost

MARCH 6, 2025

Addressing this challenge requires innovative approaches to training and optimizing multilingual LLMs to deliver consistent performance across languages with varying resource availability. A critical challenge in multilingual NLP is the uneven distribution of linguistic resources. while Babel-83B set a new benchmark at 73.2.

Large Language Models

Large Language Models LLM NLP Data Quality

Webinars

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

LLMOps: The Next Frontier for Machine Learning Operations

Unite.AI

FEBRUARY 7, 2024

LLMs are deep neural networks that can generate natural language texts for various purposes, such as answering questions, summarizing documents, or writing code. LLMs, such as GPT-4 , BERT , and T5 , are very powerful and versatile in Natural Language Processing (NLP). However, LLMs are also very different from other models.

Machine Learning

Machine Learning Large Language Models LLM BERT

Microsoft AI Research Proposes a New Artificial Intelligence Framework for Collaborative NLP Development (CoDev) that Enables Multiple Users to Align a Model with Their Beliefs

Marktechpost

OCTOBER 3, 2023

Although NLP models have demonstrated extraordinary strengths, they have challenges. Researchers from Microsoft describe the Collaborative Development of NLP Models (CoDev) in this study. The LLM is then directed to provide instances where the local and global models conflict.

Artificial Intelligence

Artificial Intelligence Artificial Intelligence NLP AI Researcher

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

Flipboard

FEBRUARY 10, 2025

DeepSeek-R1 is an advanced LLM developed by the AI startup DeepSeek. Access to Hugging Face Hub You must have access to Hugging Face Hubs deepseek-ai/DeepSeek-R1-Distill-Llama-8B model weights from your environment. Access to code The code used in this post is available in the following GitHub repo.

LLM

LLM AI AI Python

A New AI Research Introduces AttrPrompt: A LLM-as-Training-Data-Generator for a New Paradigm in Zero-Shot Learning

Marktechpost

JULY 2, 2023

The performance of large language models (LLMs) has been impressive across many different natural language processing (NLP) applications. In recent studies, LLMs have been proposed as task-specific training data generators to reduce the necessity of task-specific data and annotations, especially for text classification.

LLM

LLM AI Researcher AI Research Large Language Models

This AI Research Introduces Owl: A New Large Language Model for IT Operations

Marktechpost

SEPTEMBER 21, 2023

In the ever-evolving landscape of Natural Language Processing (NLP) and Artificial Intelligence (AI), Large Language Models (LLMs) have emerged as powerful tools, demonstrating remarkable capabilities in various NLP tasks. Within the field of IT, the importance of NLP and LLM technologies is on the rise.

Large Language Models

Large Language Models AI Researcher AI Research NLP

Enhancing Autoregressive Decoding Efficiency: A Machine Learning Approach by Qualcomm AI Research Using Hybrid Large and Small Language Models

Marktechpost

MARCH 3, 2024

Central to Natural Language Processing (NLP) advancements are large language models (LLMs), which have set new benchmarks for what machines can achieve in understanding and generating human language. One of the primary challenges in NLP is the computational demand for autoregressive decoding in LLMs.

Machine Learning

Machine Learning AI Researcher AI Research Large Language Models

Can Synthetic Clinical Text Generation Revolutionize Clinical NLP Tasks? Meet ClinGen: An AI Model that Involves Clinical Knowledge Extraction and Context-Informed LLM Prompting

Marktechpost

NOVEMBER 14, 2023

Medical data extraction, analysis, and interpretation from unstructured clinical literature are included in the emerging discipline of clinical natural language processing (NLP). Even with its importance, particular difficulties arise while developing methodologies for clinical NLP. If you like our work, you will love our newsletter.

NLP

NLP LLM AI Modeling Large Language Models

John Snow Labs is All In on Generative AI, Achieving 82M Spark NLP Downloads, 5x NLP Lab Growth, and New State-of-the-Art LLM Accuracy Benchmarks

John Snow Labs

JANUARY 25, 2024

The shift across John Snow Labs’ product suite has resulted in several notable company milestones over the past year including: 82 million downloads of the open-source Spark NLP library. The no-code NLP Lab platform has experienced 5x growth by teams training, tuning, and publishing AI models.

NLP

NLP LLM Generative AI Large Language Models

Unbundling the Graph in GraphRAG

O'Reilly Media

NOVEMBER 19, 2024

Also, in place of expensive retraining or fine-tuning for an LLM, this approach allows for quick data updates at low cost. at Google, and “ Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks ” by Patrick Lewis, et al., Convert an incoming prompt to a graph query, then use the result set to select chunks for the LLM.

LLM

LLM NLP Hybrid AI Large Language Models

A New AI Research Introduces Recognize Anything Model (RAM): A Robust Base Model For Image Tagging

Flipboard

JUNE 10, 2023

When it comes to natural language processing (NLP) tasks, large language models (LLM) trained on massive online datasets perform exceptionally well. …

Natural Language Processing

Natural Language Processing Large Language Models AI Researcher AI Research

AI News Weekly - Issue #380: 63% of IT and security pros believe AI will improve corporate cybersecurity - Apr 11th 2024

AI Weekly

APRIL 11, 2024

The Microsoft AI London outpost will focus on advancing state-of-the-art language models, supporting infrastructure, and tooling for foundation models. techcrunch.com Applied use cases Can AI Find Its Way Into Accounts Payable? No legacy process is safe.

Robotics

Robotics Artificial Intelligence Artificial Intelligence Large Language Models

Meet FLM-101B: An Open-Source Decoder-Only LLM With 101 Billion Parameters

Marktechpost

SEPTEMBER 13, 2023

Lately, Large language models (LLMs) are excelling in NLP and multimodal tasks but are facing two significant challenges: high computational costs and difficulties in conducting fair evaluations. These costs limit LLM development to a few major players, restricting research and applications.

LLM

LLM Large Language Models NLP AI Researcher

Beyond the Frequency Game: AoR Evaluates Reasoning Chains for Accurate LLM Decisions

Marktechpost

MAY 25, 2024

Large Language Models (LLMs) have driven remarkable advancements across various Natural Language Processing (NLP) tasks. The progression in this field continues to transform how machines comprehend and process language, opening new avenues for research and development. on the AQuA dataset compared to the Self-Consistency method.

LLM

LLM Natural Language Processing Large Language Models NLP

This AI Research Introduces GAIA: A Benchmark Defining the Next Milestone in General AI Proficiency

Marktechpost

NOVEMBER 28, 2023

It is a General AI Assistant that focuses on real-world questions, avoiding LLM evaluation pitfalls. With human-crafted questions that reflect AI assistant use cases, GAIA ensures practicality. By targeting open-ended generation in NLP, GAIA aims to redefine evaluation benchmarks and advance the next generation of AI systems.

AI Researcher

AI Researcher AI Research NLP AI

A New AI Research Introduces Directional Stimulus Prompting (DSP): A New Prompting Framework to Better Guide the LLM in Generating the Desired Summary

Marktechpost

JULY 20, 2023

Natural language processing (NLP) has seen a paradigm shift in recent years, with the advent of Large Language Models (LLMs) that outperform formerly relatively tiny Language Models (LMs) like GPT-2 and T5 Raffel et al. on a variety of NLP tasks. Figure 1 depicts a sample of the summarising job.

LLM

LLM AI Researcher AI Research Prompt Engineer

Microsoft Researchers Propose a Novel Framework for LLM Calibration Using Pareto Optimal Self-Supervision without Using Labeled Training Data

Flipboard

JULY 3, 2023

Particularly after using reinforcement learning with human input, the intrinsic confidence score from the generative LLMs is sometimes unavailable or not effectively calibrated with regard to the intended aim. Heuristic techniques are costly to compute and are subject to bias from the LLM itself, such as sampling an ensemble of LLM answers.

LLM

LLM Large Language Models AI Tools NLP

Meet DISC-FinLLM: A Chinese Financial Large Language Model (LLM) Based On Multiple Experts Fine-Tuning

Marktechpost

NOVEMBER 9, 2023

These Natural Language Processing (NLP) based models handle large and complicated datasets, which causes them to face a unique challenge in the finance industry. They are drawn from both self-constructed and available NLP datasets. The researchers have conducted multiple assessment benchmarks for evaluating DISC-FinLLM’s.

Large Language Models

Large Language Models LLM NLP Natural Language Processing

Meet PowerInfer: A Fast Large Language Model (LLM) on a Single Consumer-Grade GPU that Speeds up Machine Learning Model Inference By 11 Times

Marktechpost

DECEMBER 23, 2023

Generative Large Language Models (LLMs) are well known for their remarkable performance in a variety of tasks, including complex Natural Language Processing (NLP), creative writing, question answering, and code generation. Two current strategies to deal with these memory problems are offloading and model compression.

Large Language Models

Large Language Models Machine Learning LLM Natural Language Processing

Microsoft AI Research Introduces Automatic Prompt Optimization (APO): A Simple and General-Purpose Framework for the Automatic Optimization of LLM Prompts

Flipboard

MAY 13, 2023

0 Shares The recent development of potent large language models (LLMs) has changed NLP. These LLMs have proven extraordinary ability to produce text …

Large Language Models

Large Language Models NLP AI Researcher AI Research

Intel AI Research Releases FastDraft: A Cost-Effective Method for Pre-Training and Aligning Draft Models with Any LLM for Speculative Decoding

Marktechpost

NOVEMBER 24, 2024

Transformer architectures have revolutionized Natural Language Processing (NLP), enabling significant language understanding and generation progress. However, the efficiency of LLMs in real-world deployment remains a challenge due to their substantial resource demands, particularly in tasks requiring sequential token generation.

LLM

LLM AI Researcher AI Research Auto-complete

Microsoft AI Research Open-Sources PromptWizard: A Feedback-Driven AI Framework for Efficient and Scalable LLM Prompt Optimization

Marktechpost

DECEMBER 18, 2024

Unlike earlier methods, it aligns task-specific requirements with a systematic optimization process, offering an efficient and scalable solution for diverse NLP applications. During the generation phase, the system uses LLMs to create multiple variations of a base prompt by applying cognitive heuristics.

AI Researcher

AI Researcher AI Research LLM Prompt Engineer

Alibaba AI Researchers Released a New gte-Qwen2-7B-Instruct Embedding Model Based on the Qwen2-7B Model with Better Performance

Marktechpost

JUNE 21, 2024

Text embeddings (TEs) are low-dimensional vector representations of texts of different sizes, which are important for many natural language processing (NLP) tasks. Pre-trained language models, like BERT and GPT, have shown great success in various NLP tasks. 7B-instruct model but with the updated Qwen2-7B base model. 7B-instruct model.

AI Researcher

AI Researcher AI Research BERT Natural Language Processing

This AI Paper from China Propose ‘Magnus’: Revolutionizing Efficient LLM Serving for LMaaS with Semantic-Based Request Length Prediction

Marktechpost

JUNE 13, 2024

Transformer-based generative Large Language Models (LLMs) have shown considerable strength in a broad range of Natural Language Processing (NLP) tasks. For this, top AI firms like OpenAI, Google, and Baidu offer a language model-as-a-service (LMaaS) by granting access to their LLMs through APIs.

LLM

LLM Natural Language Processing Large Language Models NLP

This AI Research from Apple Investigates a Known Issue of LLMs’ Behavior with Respect to Gender Stereotypes

Marktechpost

SEPTEMBER 26, 2023

Large language models (LLMs) have made tremendous strides in the last several months, crushing state-of-the-art benchmarks in many different areas. There has been a meteoric rise in people using and researching Large Language Models (LLMs), particularly in Natural Language Processing (NLP). Check out the Paper.

AI Researcher

AI Researcher AI Research Large Language Models Natural Language Processing

The Full Story of Large Language Models and RLHF

AssemblyAI

MAY 3, 2023

Effective methods allowing for better control, or steerability , of large-scale AI systems are currently in extremely high demand in the world of AI research. But, how to determine how much data one needs to train an LLM? RLHF is perhaps the most popular of the current methods. Et voilà !

Large Language Models

Large Language Models Neural Network LLM ChatGPT

Do Language Models Know When They Are Hallucinating? This AI Research from Microsoft and Columbia University Explores Detecting Hallucinations with the Creation of Probes

Marktechpost

DECEMBER 31, 2023

Large Language Models (LLMs), the latest innovation of Artificial Intelligence (AI), use deep learning techniques to produce human-like text and perform various Natural Language Processing (NLP) and Natural Language Generation (NLG) tasks. If you like our work, you will love our newsletter.

AI Researcher

AI Researcher AI Research Large Language Models Natural Language Processing

Cache-Augmented Generation (CAG) vs Retrieval-Augmented Generation (RAG)

Towards AI

JANUARY 22, 2025

Setting the Stage: Why Augmentation Matters Imagine youre chatting with an LLM about complex topics like medical research or historical events. As we continue to push the boundaries of AI, hybrid models combining the best of CAG and RAG may well become the standard, offering unparalleled efficiency and accuracy.

Neural Network

Neural Network Chatbots Large Language Models NLP

Microsoft AI Research Introduces Generalized Instruction Tuning (called GLAN): A General and Scalable Artificial Intelligence Method for Instruction Tuning of Large Language Models (LLMs)

Marktechpost

MARCH 2, 2024

Instruction tuning comes as a solution, which includes fine-tuning LLMs on instructions matched with replies that humans like. The input, a taxonomy, has been created with minimal human effort through LLM prompting and verification. Don’t Forget to join our Telegram Channel You may also like our FREE AI Courses….

Large Language Models

Large Language Models Artificial Intelligence Artificial Intelligence AI Researcher

Can Language Feedback Revolutionize AI Training? This Paper Introduces Contrastive Unlikelihood Training (CUT) Framework for Enhanced LLM Alignment

Marktechpost

DECEMBER 30, 2023

In implementing CUT, researchers conducted experiments in two settings: offline alignment using pre-existing model-agnostic judgment data and online alignment, where the model learns from judgments on its own generated responses. The post Can Language Feedback Revolutionize AI Training? The results of implementing CUT were remarkable.

LLM

LLM NLP AI AI

NVIDIA AI Software Party at a Hardware Show

TheSequence

JANUARY 12, 2025

NVIDIA NIM Microservices NVIDIA’s NIM (NVIDIA Inference Microservices) is a significant leap forward in the integration of AI into modern software systems. Built for the new GeForce RTX 50 Series GPUs, NIM offers pre-built containers powered by NVIDIA's inference software, including Triton Inference Server and TensorRT-LLM.

Robotics

Robotics LLM Large Language Models AI

Hypernetworks and Long-Form AI: Jason Phang’s Transformative Research in NLP

NYU Center for Data Science

DECEMBER 1, 2023

The quest to refine AI’s understanding of extensive textual data has recently been advanced due to two recent papers by CDS PhD student Jason Phang , who is the first author of two recent NLP papers that secured “best paper” accolades at ICML 2023 and EMNLP 2023.

NLP

NLP Large Language Models Natural Language Processing AI

Meet Moxin LLM 7B: A Fully Open-Source Language Model Developed in Accordance with the Model Openness Framework (MOF)

Marktechpost

DECEMBER 19, 2024

The rapid development of Large Language Models (LLMs) has transformed natural language processing (NLP). Tackling these barriers is crucial for fostering trust, collaboration, and progress in the AI ecosystem. It is a valuable tool for researchers, developers, and businesses seeking flexible and high-performing solutions.

LLM

LLM Natural Language Processing NLP Large Language Models

Prometheus-Eval and Prometheus 2: Setting New Standards in LLM Evaluation and Open-Source Innovation with State-of-the-art Evaluator Language Model

Marktechpost

MAY 22, 2024

In natural language processing (NLP), researchers constantly strive to enhance language models’ capabilities, which play a crucial role in text generation, translation, and sentiment analysis. Researchers can now assess their models more confidently, knowing they have a comprehensive and accessible tool.

LLM

LLM Natural Language Processing NLP Python

Stanford and Cornell Researchers Introduce Tart: An Innovative Plug-and-Play Transformer Module Enhancing AI Reasoning Capabilities in a Task-Agnostic Manner

Flipboard

JUNE 17, 2023

They divide an LLM’s capacity for in-context learning into two components: the acquisition of effective task representations and the execution of probabilistic inference, or reasoning, over these representations. Is the gap caused by a lack of information in the representations or by the LLMs’ inability to analyze them?

LLM

LLM Large Language Models NLP Prompt Engineer

IBM AI Research Introduces Unitxt: An Innovative Library For Customizable Textual Data Preparation And Evaluation Tailored To Generative Language Models

Marktechpost

JANUARY 30, 2024

Because of this, analyzing textual data for LLMs is becoming more complicated. It contains several non-trivial design decisions and characteristics, which make it more difficult to keep LLM research flexible and reproducible. Modern LLM training frameworks demand a large amount of data to achieve state-of-the-art performance.

AI Researcher

AI Researcher AI Research Natural Language Processing LLM

How Risky Is Your Open-Source LLM Project? A New Research Explains The Risk Factors Associated With Open-Source LLMs

Marktechpost

JULY 7, 2023

They considered all the projects that fit these criteria: Projects must have been created eight months ago or less (approx November 2022, to June 2023, at the time of this paper’s publication) Projects are related to the topics: LLM, ChatGPT, Open-AI, GPT-3.5, or GPT-4 Projects must have at least 3,000 stars on GitHub.

LLM

LLM Explainability Large Language Models Machine Learning

Microsoft Introduces Automatic Prompt Optimization Framework for LLMs

Analytics Vidhya

MAY 15, 2023

Microsoft AI Research has recently introduced a new framework called Automatic Prompt Optimization (APO) to significantly improve the performance of large language models (LLMs). This framework is designed to help users create better prompts with minimal manual intervention & optimize prompt engineering for better results.

Prompt Engineering

Prompt Engineering Prompt Engineer Large Language Models AI Researcher

Do Large Language Models Really Need All Those Layers? This AI Research Unmasks Model Efficiency: The Quest for Essential Components in Large Language Models

Marktechpost

JULY 15, 2023

This year, a paper presented at the Association for Computational Linguistics (ACL) meeting delves into the importance of model scale for in-context learning and examines the interpretability of LLM architectures. The study focuses on the OPT-66B model, a 66-billion-parameter LLM developed by Meta as an open replica of GPT-3.

Large Language Models

Large Language Models AI Researcher AI Research Computational Linguistics

10 Best AI Agents for Business Automation (2025)

Unite.AI

MARCH 28, 2025

Multi-LLM support: (OpenAI, Anthropic, HuggingFace, etc.) Key features: No-code AI agent builder: Intuitive visual workflow editor to create agents without programming. Multiple ready-made agent templates: (by industry/function) e.g. AI Sales, AI Marketing, AI Research assistants.

Automation

Automation Chatbots AI AI

NeoBERT: Modernizing Encoder Models for Enhanced Language Understanding

Marktechpost

MARCH 3, 2025

Encoder models like BERT and RoBERTa have long been cornerstones of natural language processing (NLP), powering tasks such as text classification, retrieval, and toxicity detection. In conclusion, NeoBERT represents a paradigm shift for encoder models, bridging the gap between stagnant architectures and modern LLM advancements.

BERT

BERT Data Scarcity Natural Language Processing Large Language Models

The GenAI Frontier: 10 Transformative LLM Research Papers of 2023 from LLaMA to GPT-4

Topbots

DECEMBER 5, 2023

Top LLM Research Papers 2023 1. LLaMA by Meta AI Summary The Meta AI team asserts that smaller models trained on more tokens are easier to retrain and fine-tune for specific product applications. The instruction tuning involves fine-tuning the Q-Former while keeping the image encoder and LLM frozen.

LLM

LLM Large Language Models Natural Language Processing Chatbots

Building a Legal AI Chatbot: A Step-by-Step Guide Using bigscience/T0pp LLM, Open-Source NLP Models, Streamlit, PyTorch, and Hugging Face Transformers

Meta AI Introduces MLGym: A New AI Framework and Benchmark for Advancing AI Research Agents

Webinars

Trending Sources

Alibaba Released Babel: An Open Multilingual Large Language Model LLM Serving Over 90% of Global Speakers

Webinars

LLMOps: The Next Frontier for Machine Learning Operations

Microsoft AI Research Proposes a New Artificial Intelligence Framework for Collaborative NLP Development (CoDev) that Enables Multiple Users to Align a Model with Their Beliefs

Build agentic AI solutions with DeepSeek-R1, CrewAI, and Amazon SageMaker AI

A New AI Research Introduces AttrPrompt: A LLM-as-Training-Data-Generator for a New Paradigm in Zero-Shot Learning

This AI Research Introduces Owl: A New Large Language Model for IT Operations

Enhancing Autoregressive Decoding Efficiency: A Machine Learning Approach by Qualcomm AI Research Using Hybrid Large and Small Language Models

Can Synthetic Clinical Text Generation Revolutionize Clinical NLP Tasks? Meet ClinGen: An AI Model that Involves Clinical Knowledge Extraction and Context-Informed LLM Prompting

John Snow Labs is All In on Generative AI, Achieving 82M Spark NLP Downloads, 5x NLP Lab Growth, and New State-of-the-Art LLM Accuracy Benchmarks

Unbundling the Graph in GraphRAG

A New AI Research Introduces Recognize Anything Model (RAM): A Robust Base Model For Image Tagging

AI News Weekly - Issue #380: 63% of IT and security pros believe AI will improve corporate cybersecurity - Apr 11th 2024

Meet FLM-101B: An Open-Source Decoder-Only LLM With 101 Billion Parameters

Beyond the Frequency Game: AoR Evaluates Reasoning Chains for Accurate LLM Decisions

This AI Research Introduces GAIA: A Benchmark Defining the Next Milestone in General AI Proficiency

A New AI Research Introduces Directional Stimulus Prompting (DSP): A New Prompting Framework to Better Guide the LLM in Generating the Desired Summary

Microsoft Researchers Propose a Novel Framework for LLM Calibration Using Pareto Optimal Self-Supervision without Using Labeled Training Data

Meet DISC-FinLLM: A Chinese Financial Large Language Model (LLM) Based On Multiple Experts Fine-Tuning

Meet PowerInfer: A Fast Large Language Model (LLM) on a Single Consumer-Grade GPU that Speeds up Machine Learning Model Inference By 11 Times

Microsoft AI Research Introduces Automatic Prompt Optimization (APO): A Simple and General-Purpose Framework for the Automatic Optimization of LLM Prompts

Intel AI Research Releases FastDraft: A Cost-Effective Method for Pre-Training and Aligning Draft Models with Any LLM for Speculative Decoding

Microsoft AI Research Open-Sources PromptWizard: A Feedback-Driven AI Framework for Efficient and Scalable LLM Prompt Optimization

Alibaba AI Researchers Released a New gte-Qwen2-7B-Instruct Embedding Model Based on the Qwen2-7B Model with Better Performance

This AI Paper from China Propose ‘Magnus’: Revolutionizing Efficient LLM Serving for LMaaS with Semantic-Based Request Length Prediction

This AI Research from Apple Investigates a Known Issue of LLMs’ Behavior with Respect to Gender Stereotypes

The Full Story of Large Language Models and RLHF

Do Language Models Know When They Are Hallucinating? This AI Research from Microsoft and Columbia University Explores Detecting Hallucinations with the Creation of Probes

Cache-Augmented Generation (CAG) vs Retrieval-Augmented Generation (RAG)

Microsoft AI Research Introduces Generalized Instruction Tuning (called GLAN): A General and Scalable Artificial Intelligence Method for Instruction Tuning of Large Language Models (LLMs)

Can Language Feedback Revolutionize AI Training? This Paper Introduces Contrastive Unlikelihood Training (CUT) Framework for Enhanced LLM Alignment

NVIDIA AI Software Party at a Hardware Show

Hypernetworks and Long-Form AI: Jason Phang’s Transformative Research in NLP

Meet Moxin LLM 7B: A Fully Open-Source Language Model Developed in Accordance with the Model Openness Framework (MOF)

Prometheus-Eval and Prometheus 2: Setting New Standards in LLM Evaluation and Open-Source Innovation with State-of-the-art Evaluator Language Model

Stanford and Cornell Researchers Introduce Tart: An Innovative Plug-and-Play Transformer Module Enhancing AI Reasoning Capabilities in a Task-Agnostic Manner

IBM AI Research Introduces Unitxt: An Innovative Library For Customizable Textual Data Preparation And Evaluation Tailored To Generative Language Models

How Risky Is Your Open-Source LLM Project? A New Research Explains The Risk Factors Associated With Open-Source LLMs

Microsoft Introduces Automatic Prompt Optimization Framework for LLMs

Do Large Language Models Really Need All Those Layers? This AI Research Unmasks Model Efficiency: The Quest for Essential Components in Large Language Models

10 Best AI Agents for Business Automation (2025)

NeoBERT: Modernizing Encoder Models for Enhanced Language Understanding

The GenAI Frontier: 10 Transformative LLM Research Papers of 2023 from LLaMA to GPT-4

Stay Connected