Microsoft Maia 200 Cuts AI Token Costs 30% with 140 Billion Transistors—3nm Chip Deployed January 27 in US Data Centers, Outperforms Amazon Trainium and Google TPU on Inference
Microsoft just made running AI inference 30% cheaper—but only for Microsoft. The Maia 200 chip powers their own products first, and the…