Remove Auto-classification Remove BERT Remove Hybrid AI
article thumbnail

Amazon EC2 DL2q instance for cost-efficient, high-performance AI inference is now generally available

AWS Machine Learning Blog

Model category Number of models Examples​ NLP​ 157 BERT, BART, FasterTransformer, T5, Z-code MOE Generative AI – NLP 40 LLaMA, CodeGen, GPT, OPT, BLOOM, Jais, Luminous, StarCoder, XGen Generative AI – Image 3 Stable diffusion v1.5 opt/qti-aic/exec/qaic-exec -m=bert-base-cased/generatedModels/bert-base-cased_fix_outofrange_fp16.onnx

BERT 128