Models
Providers
Benchmarks
MCP
Compare
Guides

Product

Models
Providers
Benchmarks
Compare
Prompts
Find a model
Trending
Collections
News
Changelog

Learn

New to AI?
Best AI by use case
Blog
Pricing
About
Support

Legal

Privacy
Terms
Cookies

Connect

GitHub
X / Twitter
Contact

© 2026 Modeldex — the AI model registry.

Press ? for keyboard shortcuts.

Home/News

News & Analysis

Editorial coverage, in-depth analysis, and developer guides — 27 articles.

All Analysis Guide News Research

Filtered by tag:#Artificial IntelligenceClear

NewsNews
Cost-efficient custom text-to-SQL using Amazon Nova Micro and Amazon Bedrock on-demand inference
In this post, we demonstrate two approaches to fine-tune Amazon Nova Micro for custom SQL dialect generation to deliver both cost efficiency and production ready performance.
Apr 16, 2026Zeek Granston
NewsNews
How Automated Reasoning checks in Amazon Bedrock transform generative AI compliance
In this post, you'll learn why probabilistic AI validation falls short in regulated industries and how Automated Reasoning checks use formal verification to deliver mathematically proven results. You'll also see how customers across six industries use this technology to produce formally verified, auditable AI outputs, and how to get started.
Apr 16, 2026Nafi Diallo
NewsNews
Accelerating decode-heavy LLM inference with speculative decoding on AWS Trainium and vLLM
In this post, you will learn how speculative decoding works and why it helps reduce cost per generated token on AWS Trainium2.
Apr 15, 2026Yahav Biran
NewsNews
New Adobe Premiere Color Grading Mode Accelerated on NVIDIA GPUs
The NAB Show 2026 trade show, running April 18-22 in Las Vegas, is set to showcase a wave of new features and optimizations for top video editing applications. Bringing together over 60,000 content professionals from across the broadcast and media and entertainment industries, the event highlights how video editors, livestreamers and professional creators are exploring […]
Apr 15, 2026Joel Pennington
NewsNews
Use-case based deployments on SageMaker JumpStart
We're excited to announce the launch of Amazon SageMaker JumpStart optimized deployments. SageMaker JumpStart improved deployments address the need for rich and straightforward deployment customization on SageMaker JumpStart by offering pre-defined deployment configurations, designed for specific use cases. Customers maintain the same level of visibility into the details of their proposed deployments, but now deployments are optimized for their specific use case and performance constraint.
Apr 14, 2026Dan Ferguson
NewsNews
Best practices to run inference on Amazon SageMaker HyperPod
This post explores how Amazon SageMaker HyperPod provides a comprehensive solution for inference workloads. We walk you through the platform’s key capabilities for dynamic scaling, simplified deployment, and intelligent resource management. By the end of this post, you’ll understand how to use the HyperPod automated infrastructure, cost optimization features, and performance enhancements to reduce your total cost of ownership by up to 40% while accelerating your generative AI deployments from concept to production.
Apr 14, 2026Vinay Arora
NewsNews
Understanding Amazon Bedrock model lifecycle
This post shows you how to manage FM transitions in Amazon Bedrock, so you can make sure your AI applications remain operational as models evolve. We discuss the three lifecycle states, how to plan migrations with the new extended access feature, and practical strategies to transition your applications to newer models without disruption.
Apr 9, 2026Saurabh Trikande
NewsNews
The future of managing agents at scale: AWS Agent Registry now in preview
Today, we're announcing AWS Agent Registry (preview) in AgentCore, a single place to discover, share, and reuse AI agents, tools, and agent skills across your enterprise.
Apr 9, 2026Preethi C N
NewsNews
Embed a live AI browser agent in your React app with Amazon Bedrock AgentCore
This post walks you through three steps: starting a session and generating the Live View URL, rendering the stream in your React application, and wiring up an AI agent that drives the browser while your users watch. At the end, you will have a working sample application you can clone and run.
Apr 9, 2026Sundar Raghavan
NewsNews
From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI
Open models are driving a new wave of on-device AI, extending innovation beyond the cloud to everyday devices. As these models advance, their value increasingly depends on access to local, real-time context that can turn meaningful insights into action. Designed for this shift, Google’s latest additions to the Gemma 4 family introduce a class of small, fast and omni-capable models built for efficient local execution across a wide range […]
Apr 2, 2026Michael Fukuyama
NewsNews
The Future of AI Is Open and Proprietary
AI is the defining technology of our time, quickly becoming core business infrastructure. It’s fueled by a diverse ecosystem of models: large and small, open and proprietary, generalist and specialist. This variety is essential for a future where every application will be powered by AI, every country will build it and every company will use […]
Mar 25, 2026Kari Briski
NewsNews
Blowing Off Steam: How Power-Flexible AI Factories Can Stabilize the Global Energy Grid
At the half-time whistle of the UEFA EURO 2020 round of 16 football match between England and Germany, millions of viewers stepped away from their screens in the U.K. to do the same thing at the same time — turn on their kettles. National Grid, which provides electricity for England and Wales, saw a demand […]
Mar 25, 2026Josh Parker

Tags

#AI #AI Factory #AI Infrastructure #AI for Good #AWS Trainium #Advanced (300)#Agentic AI #Amazon Bedrock #Amazon Bedrock AgentCore #Amazon Elastic Kubernetes Service #Amazon Machine Learning #Amazon Nova

← PreviousPage 2 of 3Next →

#Amazon SageMaker

#Amazon SageMaker AI

#Amazon SageMaker HyperPod

#Amazon SageMaker JumpStart

#Artificial Intelligence

#Best Practices

#Conversational AI

#Responsible AI