Large Language Models, ML and Webinar - Artificial Intelligence Zone

Large Language Models

Webinar

SepLLM: A Practical AI Approach to Efficient Sparse Attention in Large Language Models

Marktechpost

JANUARY 11, 2025

Large Language Models (LLMs) have shown remarkable capabilities across diverse natural language processing tasks, from generating text to contextual reasoning. Dont Forget to join our 60k+ ML SubReddit. However, their efficiency is often hampered by the quadratic complexity of the self-attention mechanism.

SepLLM: A Practical AI Approach to Efficient Sparse Attention in Large Language Models

Ordnance Survey: Navigating the role of AI and ethical considerations in geospatial technology

Webinars

Trending Sources

LogLLM: Leveraging Large Language Models for Enhanced Log-Based Anomaly Detection

Webinars

13 Free AI Courses on AI Agents in 2025

Large Language Models LLMs for OCR Post-Correction

Understanding the Hidden Layers in Large Language Models LLMs

Layer-of-Thoughts Prompting (LoT): A Unique Approach that Uses Large Language Model (LLM) based Retrieval with Constraint Hierarchies

Ivo Everts, Databricks: Enhancing open-source AI and improving data governance

This AI Paper from China Introduces KV-Cache Optimization Techniques for Efficient Large Language Model Inference

From RAG to ReST: A Survey of Advanced Techniques in Large Language Model Development

Intel Labs Introduce RAG Foundry: An Open-Source Python Framework for Augmenting Large Language Models LLMs for RAG Use Cases

Understanding the Inevitable Nature of Hallucinations in Large Language Models: A Call for Realistic Expectations and Management Strategies

Lagent: A Lightweight Open-Source Python Framework that Allows Users to Efficiently Build Large Language Model (LLM)-Based Agents

Scale AI and Meta Introduces Defense Llama: The LLM Purpose-Built for American National Security

LASR: A Novel Machine Learning Approach to Symbolic Regression Using Large Language Models

What are Small Language Models (SLMs)?

Microsoft Researchers Combine Small and Large Language Models for Faster, More Accurate Hallucination Detection

CompeteAI: An Artificial Intelligence AI Framework that Understands the Competition Dynamics of Large Language Model-based Agents

ODYSSEY: A New Open-Source AI Framework that Empowers Large Language Model (LLM)-based Agents with Open-World Skills to Explore the Vast Minecraft World

SPARE: Training-Free Representation Engineering for Managing Knowledge Conflicts in Large Language Models

Enhancing Large Language Models with Diverse Instruction Data: A Clustering and Iterative Refinement Approach

Zhipu AI Releases GLM-4-Voice: A New Open-Source End-to-End Speech Large Language Model

This AI Paper by NVIDIA Introduces NVLM 1.0: A Family of Multimodal Large Language Models with Improved Text and Image Processing Capabilities

VulScribeR: A Large Language Model-Based Approach for Generating Diverse and Realistic Vulnerable Code Samples

Diagram of Thought (DoT): An AI Framework that Models Iterative Reasoning in Large Language Models (LLMs) as the Construction of a Directed Acyclic Graph (DAG) within a Single Model

WorFBench: A Benchmark for Evaluating Complex Workflow Generation in Large Language Model Agents

Meta AI Releases LayerSkip: A Novel AI Approach to Accelerate Inference in Large Language Models (LLMs)

WTU-Eval: A New Standard Benchmark Tool for Evaluating Large Language Models LLMs Usage Capabilities

Michelangelo: An Artificial Intelligence Framework for Evaluating Long-Context Reasoning in Large Language Models Beyond Simple Retrieval Tasks

CMU Researchers Release Pangea-7B: A Fully Open Multimodal Large Language Models MLLMs for 39 Languages

SimLayerKV: An Efficient Solution to KV Cache Challenges in Large Language Models

Salesforce AI Introduces ReGenesis: A Novel AI Approach to Improving Large Language Model Reasoning Capabilities

Optimizing Large Language Models for Concise and Accurate Responses through Constrained Chain-of-Thought Prompting

GuideLLM Released by Neural Magic: A Powerful Tool for Evaluating and Optimizing the Deployment of Large Language Models (LLMs)

MiniCTX: Advancing Context-Dependent Theorem Proving in Large Language Models

Balancing Act: The Impact of Format Restrictions on Reasoning in Large Language Models

ByteDance Introduced Hierarchical Large Language Model (HLLM) Architecture to Transform Sequential Recommendations, Overcoming Cold-Start Challenges, and Enhancing Scalability with State-of-the-Art Performance

Trust-Align: An AI Framework for Improving the Trustworthiness of Retrieval-Augmented Generation in Large Language Models

VideoLLaMA 2 Released: A Set of Video Large Language Models Designed to Advance Multimodal Research in the Arena of Video-Language Modeling

Microsoft AI Introduces Activation Steering: A Novel AI Approach to Improving Instruction-Following in Large Language Models

Katanemo Open Sources Arch-Function: A Set of Large Language Models (LLMs) Promising Ultra-Fast Speeds at Function-Calling Tasks for Agentic Workflows

Researchers at FPT Software AI Center Introduce XMainframe: A State-of-the-Art Large Language Model (LLM) Specialized for Mainframe Modernization to Address the $100B Legacy Code Modernization

How Can We Convert Unstructured Text into Actionable Knowledge? This AI Paper Unveils iText2KG for Incremental Knowledge Graphs Construction Using Large Language Models

Learning and Knowledge Retrieval: A Comprehensive Framework for In-Context Learning in Large Language Models (LLMs)

Stay Connected