Auto-complete, Generative AI and Metadata - Artificial Intelligence Zone

Announcing general availability of Amazon Bedrock Knowledge Bases GraphRAG with Amazon Neptune Analytics

AWS Machine Learning Blog

MARCH 7, 2025

Today, Amazon Web Services (AWS) announced the general availability of Amazon Bedrock Knowledge Bases GraphRAG (GraphRAG), a capability in Amazon Bedrock Knowledge Bases that enhances Retrieval-Augmented Generation (RAG) with graph data in Amazon Neptune Analytics.

Auto-complete

Auto-complete Natural Language Processing Explainability Metadata

Use custom metadata created by Amazon Comprehend to intelligently process insurance claims using Amazon Kendra

AWS Machine Learning Blog

DECEMBER 5, 2023

Enterprises may want to add custom metadata like document types (W-2 forms or paystubs), various entity types such as names, organization, and address, in addition to the standard metadata like file type, date created, or size to extend the intelligent search while ingesting the documents.

Metadata

Metadata Auto-classification Auto-complete Content Enrichment

Evaluate large language models for your machine translation tasks on AWS

AWS Machine Learning Blog

JANUARY 7, 2025

When using the FAISS adapter, translation units are stored into a local FAISS index along with the metadata. The request is sent to the prompt generator. Also note the completion metrics on the left pane, displaying latency, input/output tokens, and quality scores. Cohere Embed supports 108 languages. Rerun the translation.

Large Language Models

Large Language Models Prompt Engineering Prompt Engineer Metadata

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Relevance, Reach, Revenue: How to Turn Marketing Trends From Hype to High-Impact

MORE WEBINARS

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

AWS Machine Learning Blog

MARCH 10, 2025

By surrounding unparalleled human expertise with proven technology, data and AI tools, Octus unlocks powerful truths that fuel decisive action across financial markets. Visit octus.com to learn how we deliver rigorously verified intelligence at speed and create a complete picture for professionals across the entire credit lifecycle.

DevOps

DevOps Metadata Auto-complete Automation

Scale AI training and inference for drug discovery through Amazon EKS and Karpenter

AWS Machine Learning Blog

APRIL 19, 2024

At the same time, our generative AI models automatically design molecules targeting improvement across numerous properties, searching millions of candidates, and requiring enormous throughput and medium latency. We wanted to build a scalable system to support AI training and inference.

Auto-complete

Auto-complete Metadata AI AI

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

AWS Machine Learning Blog

SEPTEMBER 17, 2024

This data is used to enrich the generative AI prompt to deliver more context-specific and accurate responses without continuously retraining the FM, while also improving transparency and minimizing hallucinations. Prerequisites Complete the following prerequisite steps: Make sure you have model access in Amazon Bedrock.

Generative AI

Generative AI Metadata Chatbots Auto-complete

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 26, 2024

For years, Rad AI has been a reliable partner to radiology practices and health systems, consistently delivering high availability and generating complete results seamlessly in 0.5–3 In this post, we share how Rad AI reduced real-time inference latency by 50% using Amazon SageMaker. 3 seconds, with minimal latency.

Machine Learning

Machine Learning ML AI AI

How Veritone uses Amazon Bedrock, Amazon Rekognition, Amazon Transcribe, and information retrieval to update their video search pipeline

AWS Machine Learning Blog

MAY 7, 2024

Using generative AI and new multimodal foundation models (FMs) could be very strategic for Veritone and the businesses they serve, because it would significantly improve media indexing and retrieval based on contextual meaning—a critical first step to eventually generating new content.

Metadata

Metadata Generative AI Machine Learning Large Language Models

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

AWS Machine Learning Blog

NOVEMBER 14, 2024

However, model governance functions in an organization are centralized and to perform those functions, teams need access to metadata about model lifecycle activities across those accounts for validation, approval, auditing, and monitoring to manage risk and compliance. It can take up to 20 minutes for the setup to complete.

ML

ML Machine Learning Auto-complete Auto-classification

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

SEPTEMBER 24, 2024

To address this challenge, the contact center team at DoorDash wanted to harness the power of generative AI to deploy a solution quickly, and at scale, while maintaining their high standards for issue resolution and customer satisfaction. This represents about a full page of text. For the percentage overlap, use 10%. Choose Next.

Generative AI

Generative AI Auto-complete LLM Automation

How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker

AWS Machine Learning Blog

JUNE 13, 2023

Forethought is a leading generative AI suite for customer service. Once these gaps are identified, SupportGPT can automatically generate articles and other content to fill these knowledge voids, ensuring the support knowledge base remains customer-centric and up to date. The following diagram illustrates our legacy architecture.

Generative AI

Generative AI Auto-complete AI Modeling Machine Learning

Accelerate pre-training of Mistral’s Mathstral model with highly resilient clusters on Amazon SageMaker HyperPod

AWS Machine Learning Blog

SEPTEMBER 18, 2024

Similarly, a study by Meta AI and Carnegie Melon university found that, in the worst cases, 43 percent of compute time was wasted because of overheads due to hardware failures. This can adversely impact a customer’s ability to keep up with the pace of innovation in generative AI and can also increase the time-to-market for their models.

Auto-complete

Auto-complete ML Generative AI Deep Learning

Why Accelerated Data Processing Is Crucial for AI Innovation in Every Industry

NVIDIA

JUNE 7, 2024

In early trials, cuOpt delivered routing solutions in 10 seconds , achieving a 90% reduction in cloud costs and enabling technicians to complete more service calls daily. The company found that data scientists were having to remove features from algorithms just so they would run to completion.

Auto-complete

Auto-complete Metadata Data Scientist Data Science

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

AWS Machine Learning Blog

MAY 22, 2024

With the advancement of Generative AI , we can use vision-language models (VLMs) to predict product attributes directly from images. You can use a managed service, such as Amazon Rekognition , to predict product attributes as explained in Automating product description generation with Amazon Bedrock.

Generative AI

Generative AI Machine Learning Natural Language Processing Large Language Models

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

AWS Machine Learning Blog

JUNE 11, 2024

Launch the instance using Neuron DLAMI Complete the following steps: On the Amazon EC2 console, choose your desired AWS Region and choose Launch Instance. You can update your Auto Scaling groups to use new AMI IDs without needing to create new launch templates or new versions of launch templates each time an AMI ID changes.

Deep Learning

Deep Learning ML Automation Auto-complete

Build a serverless meeting summarization backend with large language models on Amazon SageMaker JumpStart

AWS Machine Learning Blog

MAY 17, 2023

AWS delivers services that meet customers’ artificial intelligence (AI) and machine learning (ML) needs with services ranging from custom hardware like AWS Trainium and AWS Inferentia to generative AI foundation models (FMs) on Amazon Bedrock. Download the generated text file to view the transcription. format(' '.join(chunk_summaries),

Large Language Models

Large Language Models Auto-complete ML Generative AI

Evaluate the reliability of Retrieval Augmented Generation applications using Amazon Bedrock

AWS Machine Learning Blog

JUNE 20, 2024

Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with security, privacy, and responsible AI.

Auto-classification

Auto-classification LLM Prompt Engineering Prompt Engineer

Create a document lake using large-scale text extraction from documents with Amazon Textract

AWS Machine Learning Blog

JANUARY 8, 2024

When the script ends, a completion status along with the time taken will be returned to the SageMaker studio console. These JSON files will contain all the Amazon Textract metadata, including the text that was extracted from within the documents. This is a key prerequisite to using your documents for generative AI and search.

IDP

IDP Python Auto-complete Machine Learning

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

AWS Machine Learning Blog

APRIL 16, 2024

To learn more about SageMaker Studio JupyterLab Spaces, refer to Boost productivity on Amazon SageMaker Studio: Introducing JupyterLab Spaces and generative AI tools. To store information in Secrets Manager, complete the following steps: On the Secrets Manager console, choose Store a new secret.

Data Scientist

Data Scientist Generative AI Machine Learning Auto-complete

Evolving Trends in Prompt Engineering for Large Language Models (LLMs) with Built-in Responsible AI…

ODSC - Open Data Science

AUGUST 24, 2023

LLMs such as GPT-4 , PaLM-2 , Llama-2 , and others are propelling the surge of Generative AI, ushering in novel applications that are reshaping both technological and business landscapes. Evaluating Prompt Completion: The goal is to establish effective evaluation criteria to gauge LLMs’ performance across tasks and domains.

Prompt Engineering

Prompt Engineering Prompt Engineer Large Language Models Responsible AI

Google’s Dr. Arsanjani on Enterprise Foundation Model Challenges

Snorkel AI

MARCH 2, 2023

I am Ali Arsanjani, and I lead partner engineering for Google Cloud, specializing in the area of AI-ML, and I’m very happy to be here today with everyone. Others, toward language completion and further downstream tasks. In retail: generating product descriptions and recommendations and customer churn and these types of things.

Large Language Models

Large Language Models Prompt Engineering Prompt Engineer Neural Network

Google’s Arsanjani on Enterprise Foundation Model Challenges

Snorkel AI

MARCH 2, 2023

I am Ali Arsanjani, and I lead partner engineering for Google Cloud, specializing in the area of AI-ML, and I’m very happy to be here today with everyone. Others, toward language completion and further downstream tasks. In retail: generating product descriptions and recommendations and customer churn and these types of things.

Large Language Models

Large Language Models Prompt Engineering Prompt Engineer Neural Network

LLM Fine-Tuning and Model Selection Using Neptune and Transformers

The MLOps Blog

JANUARY 19, 2024

But nowadays, it is used for various tasks, ranging from language modeling to computer vision and generative AI. The encoder receives the inputs and generates a contextualized interpretation of the inputs, called embeddings. This helps in training large AI models, even on computers with little memory. <pre

LLM

LLM Auto-complete Large Language Models Natural Language Processing

Accelerate your generative AI distributed training workloads with the NVIDIA NeMo Framework on Amazon EKS

AWS Machine Learning Blog

JULY 16, 2024

NVIDIA NeMo Framework NVIDIA NeMo is an end-to-end cloud-centered framework for training and deploying generative AI models with billions and trillions of parameters at scale. NVIDIA NeMo simplifies generative AI model development, making it more cost-effective and efficient for enterprises. 24xlarge instances.

Generative AI

Generative AI Auto-complete Auto-classification Deep Learning

How United Airlines built a cost-efficient Optical Character Recognition active learning pipeline

AWS Machine Learning Blog

SEPTEMBER 21, 2023

We normalize these images into a set of uniform thumbnails, which constitute the functional input for the active learning pipeline (auto-labeling and inference). The auto-labeling pipeline focuses on automating SageMaker Ground Truth jobs and sampling images for labeling through those jobs.

Auto-complete

Auto-complete Machine Learning Computer Vision ML

Inference AudioCraft MusicGen models using Amazon SageMaker

AWS Machine Learning Blog

AUGUST 6, 2024

Originating from advancements in artificial intelligence (AI) and deep learning, these models are designed to understand and translate descriptive text into coherent, aesthetically pleasing music. Generative AI models are revolutionizing music creation and consumption. Create a Hugging Face model. Deploy the model on SageMaker.

Auto-complete

Auto-complete Metadata Generative AI Deep Learning

Introducing Amazon EKS support in Amazon SageMaker HyperPod

AWS Machine Learning Blog

SEPTEMBER 11, 2024

Training job resiliency with the job auto resume functionality – In this section, we demonstrate how scientists can submit and manage their distributed training jobs using either the native Kubernetes CLI (kubectl) or optionally the new HyperPod CLI (hyperpod) with automatic job recovery enabled.

Auto-complete

Auto-complete ML Machine Learning Automation

Search enterprise data assets using LLMs backed by knowledge graphs

Flipboard

NOVEMBER 27, 2024

The application needs to search through the catalog and show the metadata information related to all of the data assets that are relevant to the search context. This allows FMs to retain their inductive abilities while grounding their language understanding and generation in well-structured domain knowledge and logical reasoning.

Metadata

Metadata Auto-complete Data Discovery ML Engineer

Discover insights from your Amazon Aurora PostgreSQL database using the Amazon Q Business connector

AWS Machine Learning Blog

DECEMBER 11, 2024

Generative AI provides the ability to take relevant information from a data source and deliver well-constructed answers back to the user. Building a generative AI -based conversational application that is integrated with the data sources that contain relevant content requires time, money, and people.

Auto-complete

Auto-complete IDP Generative AI Metadata

Accelerate video Q&A workflows using Amazon Bedrock Knowledge Bases, Amazon Transcribe, and thoughtful UX design

AWS Machine Learning Blog

FEBRUARY 3, 2025

They proceed to verify the accuracy of the generated answer by selecting the buttons, which auto play the source video starting at that timestamp. The knowledge base sync process handles chunking and embedding of the transcript, and storing embedding vectors and file metadata in an Amazon OpenSearch Serverless vector database.

UX Design

UX Design Auto-complete LLM Prompt Engineering

Implement secure API access to your Amazon Q Business applications with IAM federation user access management

AWS Machine Learning Blog

NOVEMBER 22, 2024

Amazon Q Business is a conversational assistant powered by generative AI that enhances workforce productivity by answering questions and completing tasks based on information in your enterprise systems, which each user is authorized to access. On the Settings tab, note the Metadata URI. For Name , enter [link].

IDP

IDP Auto-complete Python Generative AI

Build AI-powered malware analysis using Amazon Bedrock with Deep Instinct

AWS Machine Learning Blog

JANUARY 9, 2025

DSX provides unmatched prevention and explainability by using a powerful combination of deep learning-based DSX Brain and generative AI DSX Companion to protect systems from known and unknown malware and ransomware in real-time.

Deep Learning

Deep Learning Neural Network Explainability AI

Artificial Intelligence Zone

Announcing general availability of Amazon Bedrock Knowledge Bases GraphRAG with Amazon Neptune Analytics

Use custom metadata created by Amazon Comprehend to intelligently process insurance claims using Amazon Kendra

Webinars

Trending Sources

Evaluate large language models for your machine translation tasks on AWS

Webinars

Transforming financial analysis with CreditAI on Amazon Bedrock: Octus’s journey with AWS

Scale AI training and inference for drug discovery through Amazon EKS and Karpenter

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

How Veritone uses Amazon Bedrock, Amazon Rekognition, Amazon Transcribe, and information retrieval to update their video search pipeline

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker

Accelerate pre-training of Mistral’s Mathstral model with highly resilient clusters on Amazon SageMaker HyperPod

Why Accelerated Data Processing Is Crucial for AI Innovation in Every Industry

Generating fashion product descriptions by fine-tuning a vision-language model with SageMaker and Amazon Bedrock

Get started quickly with AWS Trainium and AWS Inferentia using AWS Neuron DLAMI and AWS Neuron DLC

Build a serverless meeting summarization backend with large language models on Amazon SageMaker JumpStart

Evaluate the reliability of Retrieval Augmented Generation applications using Amazon Bedrock

Create a document lake using large-scale text extraction from documents with Amazon Textract

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

Evolving Trends in Prompt Engineering for Large Language Models (LLMs) with Built-in Responsible AI…

Google’s Dr. Arsanjani on Enterprise Foundation Model Challenges

Google’s Arsanjani on Enterprise Foundation Model Challenges

LLM Fine-Tuning and Model Selection Using Neptune and Transformers

Accelerate your generative AI distributed training workloads with the NVIDIA NeMo Framework on Amazon EKS

How United Airlines built a cost-efficient Optical Character Recognition active learning pipeline

Inference AudioCraft MusicGen models using Amazon SageMaker

Introducing Amazon EKS support in Amazon SageMaker HyperPod

Search enterprise data assets using LLMs backed by knowledge graphs

Discover insights from your Amazon Aurora PostgreSQL database using the Amazon Q Business connector

Accelerate video Q&A workflows using Amazon Bedrock Knowledge Bases, Amazon Transcribe, and thoughtful UX design

Implement secure API access to your Amazon Q Business applications with IAM federation user access management

Build AI-powered malware analysis using Amazon Bedrock with Deep Instinct

Stay Connected