Auto-complete, Generative AI and Natural Language Processing

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

AWS Machine Learning Blog

NOVEMBER 13, 2024

However, as the reach of live streams expands globally, language barriers and accessibility challenges have emerged, limiting the ability of viewers to fully comprehend and participate in these immersive experiences. For the complete list of model IDs, see Amazon Bedrock model IDs.

Generative AI

Generative AI Auto-complete Natural Language Processing AI

Announcing general availability of Amazon Bedrock Knowledge Bases GraphRAG with Amazon Neptune Analytics

AWS Machine Learning Blog

MARCH 7, 2025

Today, Amazon Web Services (AWS) announced the general availability of Amazon Bedrock Knowledge Bases GraphRAG (GraphRAG), a capability in Amazon Bedrock Knowledge Bases that enhances Retrieval-Augmented Generation (RAG) with graph data in Amazon Neptune Analytics.

Auto-complete

Auto-complete Natural Language Processing Explainability Metadata

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

AWS Machine Learning Blog

JANUARY 29, 2025

Open foundation models (FMs) have become a cornerstone of generative AI innovation, enabling organizations to build and customize AI applications while maintaining control over their costs and deployment strategies. You can access your imported custom models on-demand and without the need to manage underlying infrastructure.

Auto-complete

Auto-complete Generative AI Data Scientist ML

Webinars

The Intersection of AI and Sales: Personalization Without Compromise

How to Achieve High-Accuracy Results When Using LLMs

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

MORE WEBINARS

AI code-generation software: What it is and how it works

IBM Journey to AI blog

SEPTEMBER 19, 2023

The user enters a text prompt describing what the code should do, and the generative AI code development tool automatically creates the code. It can also modernize legacy code and translate code from one programming language to another. How does generative AI code generation work?

Auto-complete

Auto-complete Generative AI Neural Network Artificial Intelligence

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

AWS Machine Learning Blog

FEBRUARY 25, 2025

To build a generative AI -based conversational application integrated with relevant data sources, an enterprise needs to invest time, money, and people. Amazon Q Business offers multiple pre-built data source connectors that can connect to your data sources and help you create your generative AI solution with minimal configuration.

Generative AI

Generative AI Auto-complete AI AI

Beyond ChatGPT; AI Agent: A New World of Workers

Unite.AI

AUGUST 28, 2023

With advancements in deep learning, natural language processing (NLP), and AI, we are in a time period where AI agents could form a significant portion of the global workforce. These AI agents, transcending chatbots and voice assistants, are shaping a new paradigm for both industries and our daily lives.

Auto-complete

Auto-complete ChatGPT Large Language Models Neural Network

Making Sense of the Mess: LLMs Role in Unstructured Data Extraction

Unite.AI

MAY 29, 2024

This advancement has spurred the commercial use of generative AI in natural language processing (NLP) and computer vision, enabling automated and intelligent data extraction. Typically, the generative AI model provides a prompt describing the desired data, and the ensuing response contains the extracted data.

Data Extraction

Data Extraction Neural Network Large Language Models NLP

Generative AI Developers Harness NVIDIA Technologies to Transform In-Vehicle Experiences

NVIDIA

MARCH 18, 2024

NVIDIA GTC , running this week at the San Jose Convention Center, will spotlight the groundbreaking work NVIDIA and its partners are doing to bring the transformative power of generative AI , large language models and visual language models to the mobility sector.

Generative AI

Generative AI AI Developer AI Development Auto-complete

Build verifiable explainability into financial services workflows with Automated Reasoning checks for Amazon Bedrock Guardrails

AWS Machine Learning Blog

FEBRUARY 19, 2025

Foundational models (FMs) and generative AI are transforming how financial service institutions (FSIs) operate their core business functions. FMs are probabilistic in nature and produce a range of outcomes. This is where the combination of generative AI and Automated Reasoning come into play.

Automation

Automation Explainability Auto-complete Generative AI

Use Amazon SageMaker Studio to build a RAG question answering solution with Llama 2, LangChain, and Pinecone for fast experimentation

Flipboard

NOVEMBER 20, 2023

Retrieval Augmented Generation (RAG) allows you to provide a large language model (LLM) with access to data from external knowledge sources such as repositories, databases, and APIs without the need to fine-tune it. Use the deployed models in your question answering generative AI applications.

Auto-complete

Auto-complete LLM Machine Learning Natural Language Processing

Build a self-service digital assistant using Amazon Lex and Knowledge Bases for Amazon Bedrock

AWS Machine Learning Blog

JULY 1, 2024

It features natural language understanding capabilities to recognize more accurate identification of user intent and fulfills the user intent faster. Amazon Bedrock simplifies the process of developing and scaling generative AI applications powered by large language models (LLMs) and other foundation models (FMs).

Auto-complete

Auto-complete Chatbots Generative AI Software Development

Boost productivity on Amazon SageMaker Studio: Introducing JupyterLab Spaces and generative AI tools

AWS Machine Learning Blog

DECEMBER 14, 2023

Additionally, we cover the seamless integration of generative AI tools like Amazon CodeWhisperer and Jupyter AI within SageMaker Studio JupyterLab Spaces, illustrating how they empower developers to use AI for coding assistance and innovative problem-solving. Choose Create JupyterLab space. Choose Create space.

Generative AI

Generative AI AI Tools ML Auto-complete

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

AWS Machine Learning Blog

MARCH 10, 2025

By surrounding unparalleled human expertise with proven technology, data and AI tools, Octus unlocks powerful truths that fuel decisive action across financial markets. Visit octus.com to learn how we deliver rigorously verified intelligence at speed and create a complete picture for professionals across the entire credit lifecycle.

DevOps

DevOps Metadata Auto-complete Automation

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

AWS Machine Learning Blog

MAY 22, 2024

Using machine learning (ML) and natural language processing (NLP) to automate product description generation has the potential to save manual effort and transform the way ecommerce platforms operate. jpg and the complete metadata from styles/38642.json. lora_alpha=32, # the alpha parameter for Lora scaling.

Generative AI

Generative AI Machine Learning Natural Language Processing Large Language Models

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

SEPTEMBER 24, 2024

To address this challenge, the contact center team at DoorDash wanted to harness the power of generative AI to deploy a solution quickly, and at scale, while maintaining their high standards for issue resolution and customer satisfaction. The stack will take about a minute or two to deploy.

Generative AI

Generative AI Auto-complete LLM Automation

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

AWS Machine Learning Blog

MARCH 13, 2025

MAX_BATCH_PREFILL_TOKENS : This parameter caps the total number of tokens processed during the prefill stage across all batched requests, a phase that is both memory-intensive and compute-bound, thereby optimizing resource utilization and preventing out-of-memory errors. The best performance was observed on ml.p4dn.24xlarge 48xlarge , ml.g6e.12xlarge

LLM

LLM Machine Learning AI AI

Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint

AWS Machine Learning Blog

APRIL 25, 2024

The added benefit of asynchronous inference is the cost savings by auto scaling the instance count to zero when there are no requests to process. SageMaker features and capabilities help developers and data scientists get started with natural language processing (NLP) on AWS with ease.

Auto-complete

Auto-complete Python ML Natural Language Processing

Uncover hidden connections in unstructured financial data with Amazon Bedrock and Amazon Neptune

AWS Machine Learning Blog

APRIL 17, 2024

Second, using this graph database along with generative AI to detect second and third-order impacts from news events. For instance, this solution can highlight that delays at a parts supplier may disrupt production for downstream auto manufacturers in a portfolio though none are directly referenced.

Auto-complete

Auto-complete Generative AI Automation Natural Language Processing

Optimize deployment cost of Amazon SageMaker JumpStart foundation models with Amazon SageMaker asynchronous endpoints

AWS Machine Learning Blog

SEPTEMBER 5, 2023

The success of generative AI applications across a wide range of industries has attracted the attention and interest of companies worldwide who are looking to reproduce and surpass the achievements of competitors or solve new and exciting use cases. as the engines that power the generative AI innovation.

Auto-complete

Auto-complete Python Computer Vision ML

Unlocking creativity: How generative AI and Amazon SageMaker help businesses produce ad creatives for marketing campaigns with AWS

AWS Machine Learning Blog

AUGUST 1, 2023

Advertising agencies can use generative AI and text-to-image foundation models to create innovative ad creatives and content. In this post, we demonstrate how you can generate new images from existing base images using Amazon SageMaker , a fully managed service to build, train, and deploy ML models for at scale.

Generative AI

Generative AI Auto-complete Computer Vision Machine Learning

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 4, 2023

The world of artificial intelligence (AI) and machine learning (ML) has been witnessing a paradigm shift with the rise of generative AI models that can create human-like text, images, code, and audio. Compared to classical ML models, generative AI models are significantly bigger and more complex.

Generative AI

Generative AI Deep Learning Machine Learning Python

Deploy DeepSeek-R1 distilled Llama models with Amazon Bedrock Custom Model Import

AWS Machine Learning Blog

JANUARY 29, 2025

Open foundation models (FMs) have become a cornerstone of generative AI innovation, enabling organizations to build and customize AI applications while maintaining control over their costs and deployment strategies. You can access your imported custom models on-demand and without the need to manage underlying infrastructure.

Auto-complete

Auto-complete Generative AI Data Scientist ML

Announcing Rekogniton Custom Moderation: Enhance accuracy of pre-trained Rekognition moderation models with your data

AWS Machine Learning Blog

OCTOBER 19, 2023

Solution overview Training a custom moderation adapter involves five steps that you can complete using the AWS Management Console or the API interface: Create a project Upload the training data Assign ground truth labels to images Train the adapter Use the adapter Let’s walk through these steps in more detail using the console.

Auto-complete

Auto-complete Computer Vision Artificial Intelligence Artificial Intelligence

Advanced RAG patterns on Amazon SageMaker

AWS Machine Learning Blog

MARCH 28, 2024

These generative AI applications are not only used to automate existing business processes, but also have the ability to transform the experience for customers using these applications. This identity is called the AWS account root user.

LLM

LLM Auto-complete Auto-classification Generative AI

Improve performance of Falcon models with Amazon SageMaker

AWS Machine Learning Blog

OCTOBER 11, 2023

What is the optimal framework and configuration for hosting large language models (LLMs) for text-generating generative AI applications? This condition can be a maximum length for the generated text, a specific token that signals the end of the text, or any other criteria set by the user or the application.

Auto-complete

Auto-complete LLM Machine Learning Deep Learning

Improve throughput performance of Llama 2 models using Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 25, 2023

We’re at an exciting inflection point in the widespread adoption of machine learning (ML), and we believe most customer experiences and applications will be reinvented with generative AI. Generative AI can create new content and ideas, including conversations, stories, images, videos, and music.

Auto-complete

Auto-complete Deep Learning Machine Learning Computer Vision

Best Large Language Models & Frameworks of 2023

AssemblyAI

SEPTEMBER 18, 2023

These models aren't just large in terms of size—they're also massive in their capacity to understand human prompts and generate vast amounts of original text. Original natural language processing (NLP) models were limited in their understanding of language. Want to dive deeper?

Large Language Models

Large Language Models BERT Auto-complete LLM

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

AWS Machine Learning Blog

MAY 1, 2024

Their impressive generative abilities have led to widespread adoption across various sectors and use cases, including content generation, sentiment analysis, chatbot development, and virtual assistant technology. This results in faster restarts and workload completion. Llama2 by Meta is an example of an LLM offered by AWS.

ML

ML Auto-complete Deep Learning Generative AI

Falcon 2 11B is now available on Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 31, 2024

It’s a next generation model in the Falcon family—a more efficient and accessible large language model (LLM) that is trained on a 5.5 It’s built on causal decoder-only architecture, making it powerful for auto-regressive tasks. After deployment is complete, you will see that an endpoint is created.

Python

Python Machine Learning Auto-classification ML

Use custom metadata created by Amazon Comprehend to intelligently process insurance claims using Amazon Kendra

AWS Machine Learning Blog

DECEMBER 5, 2023

Customers can create the custom metadata using Amazon Comprehend , a natural-language processing (NLP) service managed by AWS to extract insights about the content of documents, and ingest it into Amazon Kendra along with their data into the index. For example, metadata can be used for filtering and searching. append(e["Text"].upper())

Metadata

Metadata Auto-classification Auto-complete Content Enrichment

Faster LLMs with speculative decoding and AWS Inferentia2

AWS Machine Learning Blog

AUGUST 5, 2024

In recent years, we have seen a big increase in the size of large language models (LLMs) used to solve natural language processing (NLP) tasks such as question answering and text summarization. Introduction Modern language models are based on the transformer architecture. compared to 76.4).

Auto-complete

Auto-complete Large Language Models ML Natural Language Processing

Build production-ready generative AI applications for enterprise search using Haystack pipelines and Amazon SageMaker JumpStart with LLMs

AWS Machine Learning Blog

AUGUST 14, 2023

In this post, we showcase how to build an end-to-end generative AI application for enterprise search with Retrieval Augmented Generation (RAG) by using Haystack pipelines and the Falcon-40b-instruct model from Amazon SageMaker JumpStart and Amazon OpenSearch Service.

Generative AI

Generative AI LLM NLP ML

Boosting Salesforce Einstein’s code generating model performance with Amazon SageMaker

AWS Machine Learning Blog

JULY 24, 2024

Salesforce Einstein is a set of AI technologies that integrate with Salesforce’s Customer Success Platform to help businesses improve productivity and client engagement. SageMaker allowed the Einstein team to use auto-scaling of these GPUs to meet demand without manual intervention.

LLM

LLM Machine Learning Auto-complete NLP

Breaking Down AutoGPT: What It Is, Its Features, Limitations, Artificial General Intelligence (AGI) And Impact of Autonomous Agents on Generative AI

Marktechpost

JULY 11, 2023

Introduction Generative AI is evolving and getting popular. The major reason for the exponentially increasing popularity is the development of Large Language Models. LLMs, the Artificial Intelligence models that are designed to process natural language and generate human-like responses, are trending.

Auto-complete

Auto-complete Generative AI Large Language Models OpenAI

Top Call Center QA Challenges & How To Overcome Them with AI

LevelAI

SEPTEMBER 1, 2023

Limited options for auto-QA Many companies use automated QA (auto QA) services to monitor customer interactions. However, this is a relatively small market with limited solutions, and most auto-QA tools fail to deliver actionable results. Level AI offers QA-GPT , a powerful QA auditor you can tailor to your exact business.

Auto-complete

Auto-complete Automation AI AI

MeetGeek Review: Can AI Record & Transcribe My Meetings?

Unite.AI

JANUARY 16, 2024

Auto-Recording & Transcription With MeetGeek AI, you can automatically record and transcribe your meetings for free without taking notes! MeetGeek generates AI-powered meeting summaries with highlights and actionable tasks that can automatically be shared with others.

AI

AI AI Auto-complete UX Design

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

AWS Machine Learning Blog

APRIL 16, 2024

To learn more about SageMaker Studio JupyterLab Spaces, refer to Boost productivity on Amazon SageMaker Studio: Introducing JupyterLab Spaces and generative AI tools. To store information in Secrets Manager, complete the following steps: On the Secrets Manager console, choose Store a new secret.

Data Scientist

Data Scientist Generative AI Machine Learning Auto-complete

Create a document lake using large-scale text extraction from documents with Amazon Textract

AWS Machine Learning Blog

JANUARY 8, 2024

When the script ends, a completion status along with the time taken will be returned to the SageMaker studio console. Clean up When the Python script is complete, you can save costs by shutting down or stopping the Amazon SageMaker Studio notebook or container that you spun up. We have packaged this solution in a.ipynb script and.py

IDP

IDP Python Auto-complete Machine Learning

Build well-architected IDP solutions with a custom lens – Part 5: Cost optimization

AWS Machine Learning Blog

NOVEMBER 22, 2023

An intelligent document processing (IDP) project usually combines optical character recognition (OCR) and natural language processing (NLP) to read and understand a document and extract specific terms or words. About the Authors Suyin Wang is an AI/ML Specialist Solutions Architect at AWS.

IDP

IDP Auto-classification Machine Learning Auto-complete

Implement a multi-object tracking solution on a custom dataset with Amazon SageMaker

AWS Machine Learning Blog

JUNE 1, 2023

Deploy the trained ByteTrack model with different deployment options depending on your use case: real-time processing, asynchronous, or batch prediction. Prerequisites Before getting started, complete the following prerequisites: Create an AWS account or use an existing AWS account. Create a SageMaker notebook instance.

Auto-complete

Auto-complete Machine Learning Algorithm ML

Evolving Trends in Prompt Engineering for Large Language Models (LLMs) with Built-in Responsible AI…

ODSC - Open Data Science

AUGUST 24, 2023

Be sure to check out their talk, “Evolving Trends in Prompt Engineering for Large Language Models (LLMs) with Built-in Responsible AI Practices,” there! The emergence of Large Language Models (LLMs) has inaugurated a new era in the realm of artificial intelligence, reshaping the possibilities for organizations across diverse sectors.

Prompt Engineering

Prompt Engineering Prompt Engineer Large Language Models Responsible AI

Fine-tune Llama 2 for text generation on Amazon SageMaker JumpStart

AWS Machine Learning Blog

SEPTEMBER 6, 2023

Now you can also fine-tune 7 billion, 13 billion, and 70 billion parameters Llama 2 text generation models on SageMaker JumpStart using the Amazon SageMaker Studio UI with a few clicks or using the SageMaker Python SDK. In this post, we walk through how to fine-tune Llama 2 pre-trained text generation models via SageMaker JumpStart.

Auto-complete

Auto-complete Machine Learning ML Python

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

AWS Machine Learning Blog

SEPTEMBER 26, 2024

The SageMaker option offers several advantages, including easy integration of image generation APIs with video generation endpoints to create end-to-end pipelines. Once the SageMaker HyperPod cluster deletion is complete, delete the CloudFormation stack. He is passionate about computer vision, NLP, generative AI, and MLOps.

Algorithm

Algorithm ML Data Scientist Machine Learning

Transcribe, translate, and summarize live streams in your browser with AWS AI and generative AI services

Announcing general availability of Amazon Bedrock Knowledge Bases GraphRAG with Amazon Neptune Analytics

Webinars

Trending Sources

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock

Webinars

AI code-generation software: What it is and how it works

Derive generative AI powered insights from Alation Cloud Services using Amazon Q Business Custom Connector

Beyond ChatGPT; AI Agent: A New World of Workers

Making Sense of the Mess: LLMs Role in Unstructured Data Extraction

Generative AI Developers Harness NVIDIA Technologies to Transform In-Vehicle Experiences

Build verifiable explainability into financial services workflows with Automated Reasoning checks for Amazon Bedrock Guardrails

Use Amazon SageMaker Studio to build a RAG question answering solution with Llama 2, LangChain, and Pinecone for fast experimentation

Build a self-service digital assistant using Amazon Lex and Knowledge Bases for Amazon Bedrock

Boost productivity on Amazon SageMaker Studio: Introducing JupyterLab Spaces and generative AI tools

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Deploy a Hugging Face (PyAnnote) speaker diarization model on Amazon SageMaker as an asynchronous endpoint

Uncover hidden connections in unstructured financial data with Amazon Bedrock and Amazon Neptune

Optimize deployment cost of Amazon SageMaker JumpStart foundation models with Amazon SageMaker asynchronous endpoints

Unlocking creativity: How generative AI and Amazon SageMaker help businesses produce ad creatives for marketing campaigns with AWS

Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker

Deploy DeepSeek-R1 distilled Llama models with Amazon Bedrock Custom Model Import

Announcing Rekogniton Custom Moderation: Enhance accuracy of pre-trained Rekognition moderation models with your data

Advanced RAG patterns on Amazon SageMaker

Improve performance of Falcon models with Amazon SageMaker

Improve throughput performance of Llama 2 models using Amazon SageMaker

Best Large Language Models & Frameworks of 2023

Simple guide to training Llama 2 with AWS Trainium on Amazon SageMaker

Top Ten Stories in AI Writing for 2023

Falcon 2 11B is now available on Amazon SageMaker JumpStart

Use custom metadata created by Amazon Comprehend to intelligently process insurance claims using Amazon Kendra

Faster LLMs with speculative decoding and AWS Inferentia2

Build production-ready generative AI applications for enterprise search using Haystack pipelines and Amazon SageMaker JumpStart with LLMs

Boosting Salesforce Einstein’s code generating model performance with Amazon SageMaker

Breaking Down AutoGPT: What It Is, Its Features, Limitations, Artificial General Intelligence (AGI) And Impact of Autonomous Agents on Generative AI

Top Call Center QA Challenges & How To Overcome Them with AI

MeetGeek Review: Can AI Record & Transcribe My Meetings?

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

Create a document lake using large-scale text extraction from documents with Amazon Textract

Build well-architected IDP solutions with a custom lens – Part 5: Cost optimization

Implement a multi-object tracking solution on a custom dataset with Amazon SageMaker

Evolving Trends in Prompt Engineering for Large Language Models (LLMs) with Built-in Responsible AI…

Fine-tune Llama 2 for text generation on Amazon SageMaker JumpStart

Scalable training platform with Amazon SageMaker HyperPod for innovation: a video generation case study

Stay Connected