COMPANY
Uber
Overview
Uber is the global ride-hailing and delivery platform whose AI/ML stack handles dispatch, ETA prediction, and matching across billions of trips and deliveries per year. In 2026, Uber became one of the most-cited enterprise reference customers for hyperscaler custom AI silicon, signaling a broader industry shift away from merchant NVIDIA GPUs for the largest production AI workloads.
Timeline
- 2026-04-09-AI-Digest — Uber expanded its AWS contract on April 7, migrating its core Trip Serving Zones (the latency-critical rider–driver matching layer) onto AWS Graviton4 instances and starting a pilot to train AI models on AWS Trainium3, Amazon’s third-generation training accelerator. Uber’s models analyze data from billions of trips and deliveries to handle dispatch, ETA prediction, and recommendation. Notably, this walks back Uber’s 2023 commitment to migrate significant infrastructure to Google Cloud and Oracle — at least for AI workloads — and places Uber alongside Anthropic, OpenAI, and Apple as anchor customers cited by AWS for its custom-chip lineup.
Key Developments
-
Custom Silicon Migration: Uber’s move from merchant GPUs and general-purpose CPU instances to AWS Graviton4 and Trainium3 is one of the most concrete enterprise validations to date that hyperscaler-designed accelerators can serve production-critical AI workloads at scale.
-
Trip Serving on Graviton4: Running the matching layer that pairs riders and drivers in milliseconds on Graviton4 is a meaningful proof point for ARM-based hyperscaler chips in latency-critical workloads.
-
Trainium3 Training Pilot: The decision to begin training AI models on Trainium3 — rather than just inference — pushes the custom-silicon story into territory previously dominated almost exclusively by NVIDIA training clusters.
-
Strategic Walk-Back: Uber’s 2023 plan to lean on Google Cloud and Oracle is being reversed for AI workloads specifically, demonstrating that AI infrastructure spend is now reshaping multi-cloud commitments at the largest enterprise customers.
Related
See also: Amazon, NVIDIA, Anthropic, Apple, OpenAI, MOC - AI Infrastructure, MOC - Major Companies.