MAI-Image-2 multimodal model hits Arena top 3, rolling to Copilot/Foundry; expanded with Transcribe-1/Voice-1 + new foundational

Key Questions

What is MAI-Image-2 and its performance?

MAI-Image-2 is a multimodal model hitting Arena top 3 with 2x speed. It rolls out to Copilot, Foundry, Bing, PPT, Teams. GA starts April 2-5.

What are the specs for MAI-Transcribe-1?

MAI-Transcribe-1 has 3.8% WER, 2.5x faster, at $0.36/hr. Part of three new MAI models for speech, image, voice. Available on Foundry and Playground.

What is MAI-Voice-1 and its pricing?

MAI-Voice-1 is TTS at $22 per million characters. It expands Microsoft's in-house AI for self-sufficiency vs OpenAI. Optimizes for human-centered use.

Where are the new MAI models available?

GA April 2-5 on Copilot, Bing, PPT, Foundry, Teams. Formerly Azure AI Studio, now Foundry. Competes with Google, OpenAI mid-size models.

Why is Microsoft building its own AI models?

To reduce reliance on OpenAI post-partnership revisions. New stack includes MAI models for marketing, enterprise. Emphasizes cost-effective, proprietary tech.

How do MAI models undercut rivals?

Priced lower with strong performance like Arena #3 and fast transcription. Three models target speech-to-text, voice gen, images. Part of strategy for compute limits.

What is the rollout timeline for MAI models?

Launching April 2, 2026, via Microsoft Foundry. Includes Transcribe-1, Voice-1, Image-2. Signals Microsoft's AI independence push.

How do MAI models integrate with Microsoft products?

Roll to Copilot, Bing, PowerPoint, Teams, Foundry. Human-centered design for transcription, voice, images. Reduces OpenAI dependency while maintaining partnership.

Arena #3 (2x speed, ~$0.034/img); Transcribe-1 (3.88% WER, 25 langs); Voice-1 TTS; GA Apr 2-5 for Copilot/Bing/PPT/Foundry/Teams; Suleyman-led in-house multimodal push to reduce OpenAI dependence.

Sources (25)

Updated Apr 8, 2026

Microsoft AI Spotlight

MAI-Image-2 multimodal model hits Arena top 3, rolling to Copilot/Foundry; expanded with Transcribe-1/Voice-1 + new foundational

Key Questions

What is MAI-Image-2 and its performance?

What are the specs for MAI-Transcribe-1?

What is MAI-Voice-1 and its pricing?

Where are the new MAI models available?

Why is Microsoft building its own AI models?

How do MAI models undercut rivals?

What is the rollout timeline for MAI models?

How do MAI models integrate with Microsoft products?

What Is Microsoft MAI Transcribe 1? The Speech Model ... - MindStudio

Microsoft launches 3 AI models for transcription, image, and speech ...

Microsoft Launches Advanced Mid-Size AI Model to Compete with Google and OpenAI, ETEnterpriseai

Microsoft no longer wants to borrow its AI, it wants to build it

Microsoft's Three New MAI Models Are a Direct Play for Your Marketing ...

Microsoft Launches Three New MAI Models For Speech, Voice, And ...

Microsoft Releases Three New Multimodal AI Models - BotBeat

MAI Models by Microsoft: A Simple Story of Transcribe, Talk, and Create | by Sai Dheeraj Gummadi | Apr, 2026 | Medium

Microsoft charts new AI strategy after revised OpenAI partnership

The Sequence Radar #837: Last Week in AI: From Model Releases to Market Structure

Microsoft no longer wants to borrow its AI, it wants to build it

Microsoft releases trio of AI models for transcription, voice generation and ...

Microsoft builds its own AI stack to help wean it from its reliance on OpenAI

Microsoft’s New AI Models Undercut Rivals as Competition Heats Up

Microsoft and Google Launch New AI Models - Thurrott.com

Microsoft Launches MAI Models: Transcribe, Voice, and Image in ...

Microsoft now has an AI that can turn hours of audio into text instantly — and businesses will love it

Microsoft takes on AI rivals with three new foundational models

Microsoft Builds Its Own AI Model Stack To Reduce OpenAI Dependence

Microsoft releases MAI-Transcribe-1, the most accurate transcription model in the world

Microsoft building its own high-powered AI models as it looks to slash dependence on OpenAI

Microsoft's New AI Models Go Beyond Just Text

Microsoft Challenges OpenAI With Faster, In-House 'MAI' Models

Microsoft launches 3 new AI models in direct shot at OpenAI and Google | VentureBeat

Microsoft Launches 3 New AI Models in a Direct Challenge to OpenAI and Google