Amazon EC2 DL2q instance for cost-efficient, high-performance AI inference is now generally available
AWS Machine Learning Blog
NOVEMBER 22, 2023
Model category Number of models Examples NLP 157 BERT, BART, FasterTransformer, T5, Z-code MOE Generative AI – NLP 40 LLaMA, CodeGen, GPT, OPT, BLOOM, Jais, Luminous, StarCoder, XGen Generative AI – Image 3 Stable diffusion v1.5 opt/qti-aic/exec/qaic-exec -m=bert-base-cased/generatedModels/bert-base-cased_fix_outofrange_fp16.onnx
Let's personalize your content