AI Research, Inference Engine and Large Language Models

Deci Introduces DeciCoder: An Open-Source 1B-Parameter Large Language Model For Code Generation

Marktechpost

SEPTEMBER 1, 2023

With the advent of increasingly complex models, the demand for accurate code generation has surged, but so have concerns about energy consumption and operational costs. Existing code generation models have grappled with the delicate balance between accuracy and efficiency. The implications of this development are profound.

Large Language Models

Large Language Models Inference Engine LLM Automation

Salesforce AI Introduces ReGenesis: A Novel AI Approach to Improving Large Language Model Reasoning Capabilities

Marktechpost

OCTOBER 18, 2024

Large language models (LLMs) have revolutionized how machines process and generate human language, but their ability to reason effectively across diverse tasks remains a significant challenge. In response to these limitations, researchers from Salesforce AI Research introduced a novel method called ReGenesis.

Large Language Models

Large Language Models Inference Engine AI AI

Meet PowerInfer: A Fast Large Language Model (LLM) on a Single Consumer-Grade GPU that Speeds up Machine Learning Model Inference By 11 Times

Marktechpost

DECEMBER 23, 2023

Generative Large Language Models (LLMs) are well known for their remarkable performance in a variety of tasks, including complex Natural Language Processing (NLP), creative writing, question answering, and code generation. If you like our work, you will love our newsletter.

Large Language Models

Large Language Models Machine Learning LLM Natural Language Processing

Webinars

Relevance, Reach, Return: How to Turn Marketing Trends From Hype to High-Impact

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Deci Introduces DeciCoder: An Open-Source 1B-Parameter Large Language Model For Code Generation

Marktechpost

AUGUST 25, 2023

With the advent of increasingly complex models, the demand for accurate code generation has surged, but so have concerns about energy consumption and operational costs. Existing code generation models have grappled with the delicate balance between accuracy and efficiency. The implications of this development are profound.

Large Language Models

Large Language Models Inference Engine LLM Automation

Salesforce AI Research Propose Programmatic VLM Evaluation (PROVE): A New Benchmarking Paradigm for Evaluating VLM Responses to Open-Ended Queries

Marktechpost

OCTOBER 24, 2024

Researchers from Salesforce AI Research have proposed Programmatic VLM Evaluation (PROVE), a new benchmarking paradigm that evaluates VLM responses to open-ended visual queries. If you like our work, you will love our newsletter. Don’t Forget to join our 55k+ ML SubReddit.

AI Research

AI Research AI Researcher Inference Engine Large Language Models

SGLang: An Open-Source Inference Engine Transforming LLM Deployment through CPU Scheduling, Cache-Aware Load Balancing, and Rapid Structured Output Generation

Marktechpost

FEBRUARY 21, 2025

SGLang is an open-source inference engine designed by the SGLang team to address these challenges. It optimizes CPU and GPU resources during inference, achieving significantly higher throughput than many competitive solutions. Also,feel free to follow us on Twitter and dont forget to join our 75k+ ML SubReddit.

Inference Engine

Inference Engine LLM Large Language Models Metadata

Google AI Researchers Propose ‘MODEL SWARMS’: A Collaborative Search Algorithm to Flexibly Adapt Diverse LLM Experts to Wide-Ranging Purposes

Marktechpost

OCTOBER 17, 2024

There is a need for flexible and efficient adaptation of large language models (LLMs) to various tasks. Existing approaches, such as mixture-of-experts (MoE) and model arithmetic, struggle with requiring substantial tuning data, inflexible model composition, or strong assumptions about how models should be used.

LLM

LLM Algorithm AI Researcher AI Research

Google AI Research Introduces Process Advantage Verifiers: A Novel Machine Learning Approach to Improving LLM Reasoning Capabilities

Marktechpost

OCTOBER 15, 2024

Large language models (LLMs) have become crucial in natural language processing, particularly for solving complex reasoning tasks. These models are designed to handle mathematical problem-solving, decision-making, and multi-step logical deductions. If you like our work, you will love our newsletter.

Machine Learning

Machine Learning LLM AI Research AI Researcher

This AI Research from Cohere for AI Compares Merging vs Data Mixing as a Recipe for Building High-Performant Aligned LLMs

Marktechpost

OCTOBER 20, 2024

Large language models (LLMs) have revolutionized the field of artificial intelligence by performing a wide range of tasks across different domains. These models are expected to work seamlessly in multiple languages, solving complex problems while ensuring safety. If you like our work, you will love our newsletter.

AI Research

AI Research AI Researcher Inference Engine AI

Allen Institute for AI Released olmOCR: A High-Performance Open Source Toolkit Designed to Convert PDFs and Document Images into Clean and Structured Plain Text

Marktechpost

FEBRUARY 26, 2025

It outperforms traditional OCR tools in structured data recognition and large-scale processing and has the highest ELO score in human evaluations. Improves language model training by increasing accuracy by 1.3 percentage points on AI benchmark datasets like ARC Challenge and DROP.

Metadata

Metadata Inference Engine Deep Learning AI

Google AI Researchers Introduced a Set of New Methods for Enhancing Long-Context LLM Performance in Retrieval-Augmented Generation

Marktechpost

OCTOBER 16, 2024

Large language models (LLMs) have revolutionized various fields by enabling more effective data processing, complex problem-solving, and natural language understanding. If you like our work, you will love our newsletter. Don’t Forget to join our 50k+ ML SubReddit.

LLM

LLM AI Research AI Researcher Inference Engine

Meta AI Researchers Introduce Token-Level Detective Reward Model (TLDR) to Provide Fine-Grained Annotations for Large Vision Language Models

Marktechpost

OCTOBER 26, 2024

The training data is enhanced through a systematic perturbation process using large language models, specifically targeting eight key taxonomies: spatial relationships, visual attributes, attribute binding, object identification, counting, small object detection, text OCR, and counterfactual scenarios.

AI Research

AI Research AI Researcher Data Scarcity Inference Engine

Meta AI Releases Meta Lingua: A Minimal and Fast LLM Training and Inference Library for Research

Marktechpost

OCTOBER 18, 2024

Meta Lingua’s importance lies in its ability to simplify the experimentation process for NLP researchers. In an era where large language models are at the forefront of AI research, having access to a robust yet simple-to-use tool can make all the difference. Don’t Forget to join our 50k+ ML SubReddit.

LLM

LLM NLP Inference Engine Large Language Models

CMU Researchers Introduce ReLM: An AI System For Validating And Querying LLMs Using Standard Regular Expressions

Marktechpost

JUNE 8, 2023

There are rising worries about the potential negative impacts of large language models (LLMs), such as data memorization, bias, and unsuitable language, despite LLMs’ widespread praise for their capacity to generate natural-sounding text. Check Out The Paper , Github , and CMU Article.

Large Language Models

Large Language Models LLM Inference Engine AI

Meta AI Releases New Quantized Versions of Llama 3.2 (1B & 3B): Delivering Up To 2-4x Increases in Inference Speed and 56% Reduction in Model Size

Marktechpost

OCTOBER 24, 2024

The rapid growth of large language models (LLMs) has brought significant advancements across various sectors, but it has also presented considerable challenges. High energy costs, lengthy training times, and the need for expensive hardware are barriers to accessibility for many organizations and researchers.

Large Language Models

Large Language Models NLP Natural Language Processing Inference Engine

MIND (Math Informed syNthetic Dialogue): How Structured Synthetic Data Improves the Mathematical and Logical Capabilities of AI-Powered Language Models

Marktechpost

OCTOBER 21, 2024

Large language models (LLMs) can understand and generate human-like text across various applications. This research field is evolving rapidly as AI researchers explore new methods to enhance LLMs’ capabilities in handling advanced reasoning tasks, particularly in mathematics.

Large Language Models

Large Language Models Inference Engine AI AI

Agent-as-a-Judge: An Advanced AI Framework for Scalable and Accurate Evaluation of AI Systems Through Continuous Feedback and Human-level Judgments

Marktechpost

OCTOBER 18, 2024

The lack of effective evaluation methods poses a serious problem for AI research and development. Current evaluation frameworks, such as LLM-as-a-Judge, which uses large language models to judge outputs from other AI systems, must account for the entire task-solving process.

Large Language Models

Large Language Models LLM AI Development AI Developer

DIFFUSEARCH: Revolutionizing Chess AI with Implicit Search and Discrete Diffusion Modeling

Marktechpost

OCTOBER 21, 2024

Large Language Models (LLMs) have gained significant attention in AI research due to their impressive capabilities. However, their limitation lies with long-term planning and complex problem-solving. If you like our work, you will love our newsletter. Don’t Forget to join our 50k+ ML SubReddit.

Neural Network

Neural Network Inference Engine Large Language Models AI

Controllable Safety Alignment (CoSA): An AI Framework Designed to Adapt Models to Diverse Safety Requirements without Re-Training

Marktechpost

OCTOBER 21, 2024

As large language models (LLMs) become increasingly capable and better day by day, their safety has become a critical topic for research. To create a safe model, model providers usually pre-define a policy or a set of rules. The adapted strategy first produces an LLM that is easily controllable for safety.

Large Language Models

Large Language Models Inference Engine LLM AI

Setting Up a Training, Fine-Tuning, and Inferencing of LLMs with NVIDIA GPUs and CUDA

Unite.AI

JUNE 21, 2024

According to NVIDIA's benchmarks , TensorRT can provide up to 8x faster inference performance and 5x lower total cost of ownership compared to CPU-based inference for large language models like GPT-3. Accelerating LLM Training with GPUs and CUDA.

Deep Learning

Deep Learning Neural Network Convolutional Neural Networks Large Language Models

TOPS of the Class: Decoding AI Performance on RTX AI PCs and Workstations

NVIDIA

JUNE 12, 2024

This is the kind of horsepower needed to handle AI-assisted digital content creation, AI super resolution in PC gaming, generating images from text or video, querying local large language models (LLMs) and more. LLM performance is measured in the number of tokens generated by the model. Source: Jan.ai

LLM

LLM Generative AI AI AI

Lin Qiao, CEO & Co-Founder of Fireworks AI – Interview Series

Unite.AI

APRIL 24, 2024

Lin Qiao, was formerly head of Meta's PyTorch and is the Co-Founder and CEO of Fireworks AI. Fireworks AI is a production AI platform that is built for developers, Fireworks partners with the world's leading generative AI researchers to serve the best models, at the fastest speeds.

AI

AI AI OpenAI Inference Engine

Winners of the Essay competition on the Automation of Wisdom and Philosophy

AI Impacts

OCTOBER 28, 2024

The result of using these methods and technologies would be an AI-powered inference engine we can query to see the rational support, empirical or otherwise, of key premises to arguments that bear on important practical decisions.

Automation

Automation Explainability AI AI

Artificial Intelligence Zone

Deci Introduces DeciCoder: An Open-Source 1B-Parameter Large Language Model For Code Generation

Salesforce AI Introduces ReGenesis: A Novel AI Approach to Improving Large Language Model Reasoning Capabilities

Webinars

Trending Sources

Meet PowerInfer: A Fast Large Language Model (LLM) on a Single Consumer-Grade GPU that Speeds up Machine Learning Model Inference By 11 Times

Webinars

Deci Introduces DeciCoder: An Open-Source 1B-Parameter Large Language Model For Code Generation

Salesforce AI Research Propose Programmatic VLM Evaluation (PROVE): A New Benchmarking Paradigm for Evaluating VLM Responses to Open-Ended Queries

SGLang: An Open-Source Inference Engine Transforming LLM Deployment through CPU Scheduling, Cache-Aware Load Balancing, and Rapid Structured Output Generation

Google AI Researchers Propose ‘MODEL SWARMS’: A Collaborative Search Algorithm to Flexibly Adapt Diverse LLM Experts to Wide-Ranging Purposes

Google AI Research Introduces Process Advantage Verifiers: A Novel Machine Learning Approach to Improving LLM Reasoning Capabilities

This AI Research from Cohere for AI Compares Merging vs Data Mixing as a Recipe for Building High-Performant Aligned LLMs

Allen Institute for AI Released olmOCR: A High-Performance Open Source Toolkit Designed to Convert PDFs and Document Images into Clean and Structured Plain Text

Google AI Researchers Introduced a Set of New Methods for Enhancing Long-Context LLM Performance in Retrieval-Augmented Generation

Meta AI Researchers Introduce Token-Level Detective Reward Model (TLDR) to Provide Fine-Grained Annotations for Large Vision Language Models

Meta AI Releases Meta Lingua: A Minimal and Fast LLM Training and Inference Library for Research

CMU Researchers Introduce ReLM: An AI System For Validating And Querying LLMs Using Standard Regular Expressions

Meta AI Releases New Quantized Versions of Llama 3.2 (1B & 3B): Delivering Up To 2-4x Increases in Inference Speed and 56% Reduction in Model Size

MIND (Math Informed syNthetic Dialogue): How Structured Synthetic Data Improves the Mathematical and Logical Capabilities of AI-Powered Language Models

Agent-as-a-Judge: An Advanced AI Framework for Scalable and Accurate Evaluation of AI Systems Through Continuous Feedback and Human-level Judgments

DIFFUSEARCH: Revolutionizing Chess AI with Implicit Search and Discrete Diffusion Modeling

Controllable Safety Alignment (CoSA): An AI Framework Designed to Adapt Models to Diverse Safety Requirements without Re-Training

Setting Up a Training, Fine-Tuning, and Inferencing of LLMs with NVIDIA GPUs and CUDA

TOPS of the Class: Decoding AI Performance on RTX AI PCs and Workstations

Lin Qiao, CEO & Co-Founder of Fireworks AI – Interview Series

Winners of the Essay competition on the Automation of Wisdom and Philosophy

Stay Connected