Thursday, April 9, 2026
ASX 200: 8,412 +0.43% | AUD/USD: 0.638 | RBA: 4.10% | BTC: $87.2K
← Back to home
Tech

Microsoft Launches Three In-House AI Models in Challenge to OpenAI

Microsoft launched three in-house AI models on April 2, MAI-Transcribe-1 for speech-to-text, MAI-Voice-1 for expressive voice synthesis, and MAI-Image-2 for image generation, available through Microsoft Foundry. The release is less about the models themselves and more about what they mean for the co

5 min read
Modern data centre server room with illuminated racks, enterprise AI infrastructure
Microsoft's MAI models are available through Microsoft Foundry, the company's enterprise platform for deploying AI capabilities.
Editor
Apr 8, 2026 · 5 min read
By Alex Mercer · 2026-04-07

TLDR

Microsoft launched three in-house AI models on April 2, MAI-Transcribe-1 for speech-to-text, MAI-Voice-1 for expressive voice synthesis, and MAI-Image-2 for image generation, available through Microsoft Foundry. The release is less about the models themselves and more about what they mean for the company's relationship with OpenAI.

KEY TAKEAWAYS

01Microsoft announced MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 on April 2, 2026, available in Microsoft Foundry
02MAI-Transcribe-1 now powers Copilot's Voice Mode transcriptions, replacing the OpenAI-provided transcription layer
03MAI-Voice-1 is described as expressive, meaning it handles prosody and emphasis rather than just text-to-speech conversion
04MAI-Image-2 competes directly with OpenAI's image generation and Google's Imagen models
05The release represents Microsoft's clearest public signal yet that it is building AI capabilities independent of its OpenAI investment

On April 2, Mustafa Suleyman, CEO of Microsoft AI, posted on X: "Today we're announcing 3 new world class MAI models, available in Foundry." The three models are MAI-Transcribe-1 for speech-to-text, MAI-Voice-1 for voice synthesis, and MAI-Image-2 for image generation. All three are available through Microsoft Foundry, the company's enterprise developer platform.

Suleyman described the approach behind the models in a blog post: "At Microsoft AI, we're building Humanist AI. We have a distinct view when creating our AI models, putting humans at the center, optimizing for how people actually communicate, training for practical use." He told VentureBeat that MAI-Transcribe-1 delivers transcription capability at "half the GPUs of the state-of-the-art competition."

MAI-Transcribe-1 supports transcription across 25 languages and runs at 2.5 times the speed of Microsoft's existing Azure Fast transcription service. It now powers Copilot's Voice Mode, replacing the previous transcription layer. At launch it supports batch transcription of pre-recorded audio rather than real-time streaming. MAI-Voice-1 generates 60 seconds of audio per second of processing time and supports custom voice creation. MAI-Image-2 ranks in the top three on the Arena.ai image generation leaderboard and is rolling out across Bing and PowerPoint.

The OpenAI Relationship, Reread

Microsoft invested $13 billion into OpenAI across multiple rounds. In return it got the right to deploy OpenAI models through Azure and a share of profits above a cap. That arrangement looked straightforward when GPT-4 was the dominant model. It is more complicated now that Google's Gemini models are strong, Anthropic's Claude is a serious enterprise option, and Microsoft is shipping models it built in-house.

TechCrunch described the release as Microsoft "taking on AI rivals." The MAI prefix is explicit branding. MAI stands for Microsoft AI, and labelling these models under that brand rather than integrating them invisibly into Copilot is a positioning decision. The MAI Superintelligence team, which built these models, was formed and announced in November 2025.

The OpenAI partnership remains active. OpenAI models are still available through Azure and the commercial relationship generates revenue for both companies. The MAI launch makes explicit what has been building for months: Microsoft wants to be a model developer, not only a distribution platform for others.

What This Changes for Developers

Developers building on Microsoft Foundry now have Microsoft's own models as a native option alongside OpenAI and third-party alternatives. The practical effect is optionality: an enterprise can run a Microsoft AI stack end-to-end without touching OpenAI's API. Latency, pricing, and quality in production conditions will determine how many do.

GeekWire said the models represent Microsoft's clearest move yet to expand beyond its reliance on OpenAI's technology. The batch-only constraint on MAI-Transcribe-1 at launch is a genuine limitation for real-time applications, but the model's speed and language coverage make it competitive for the document and media processing workloads that account for most enterprise transcription volume.

The Bigger Picture

OpenAI's $852 billion valuation is built on ChatGPT's consumer reach and API revenue. A large portion of that API revenue flows through enterprise agreements, the same enterprises Microsoft is now offering native alternatives to. Enterprise AI procurement cycles run 18 months. The MAI models will not displace OpenAI overnight. But the negotiating dynamic between Microsoft, OpenAI, and enterprise buyers has changed with the announcement.

FREQUENTLY ASKED QUESTIONS

What are Microsoft's MAI models?
MAI-Transcribe-1 is a speech-to-text model now powering Copilot's Voice Mode. MAI-Voice-1 is an expressive voice synthesis model. MAI-Image-2 is an image generation model. All three were announced April 2, 2026, and are available through Microsoft Foundry.
Does this mean Microsoft is ending its relationship with OpenAI?
No. The OpenAI partnership remains active and OpenAI models are still available through Azure. The MAI launch signals that Microsoft is building parallel in-house capability, not that it's abandoning the OpenAI relationship.
What is Microsoft Foundry?
Microsoft Foundry is Microsoft's enterprise developer platform for deploying AI capabilities. It hosts both Microsoft's own MAI models and third-party models including OpenAI's GPT series.
Are Microsoft's MAI models available to external developers?
Microsoft has made the MAI models available through its Azure AI Foundry platform to enterprise customers and selected developers. Unlike some competitor releases, Microsoft has not published weights or open-sourced the models.
Editor

Editor

The Bushletter editorial team. Independent business journalism covering markets, technology, policy, and culture.
What's your reaction?