Live
OpenAI announces GPT-5 with unprecedented reasoning capabilitiesGoogle DeepMind achieves breakthrough in protein folding for rare diseasesEU passes landmark AI Safety Act with global implicationsAnthropic raises $7B as enterprise demand for Claude surgesMeta open-sources Llama 4 with 1T parameter modelNVIDIA unveils next-gen Blackwell Ultra chips for AI data centersApple integrates on-device AI across entire product lineupSam Altman testifies before Congress on AI regulation frameworkMistral AI reaches $10B valuation after Series C funding roundStability AI launches video generation model rivaling SoraOpenAI announces GPT-5 with unprecedented reasoning capabilitiesGoogle DeepMind achieves breakthrough in protein folding for rare diseasesEU passes landmark AI Safety Act with global implicationsAnthropic raises $7B as enterprise demand for Claude surgesMeta open-sources Llama 4 with 1T parameter modelNVIDIA unveils next-gen Blackwell Ultra chips for AI data centersApple integrates on-device AI across entire product lineupSam Altman testifies before Congress on AI regulation frameworkMistral AI reaches $10B valuation after Series C funding roundStability AI launches video generation model rivaling Sora
Industry

Microsoft Expands Copilot Cowork With Multi-Model AI Verification — One Model Checks Another's Work

Microsoft's Wave 3 Copilot update introduces autonomous workflow handling across Microsoft 365 and a new dual-model 'Researcher' tool where one AI drafts and a second AI critiques. The system uses both Anthropic and OpenAI models, with internal benchmarks showing Claude Opus 4.6 outperforming Perplexity by 7 points on deep research tasks.

D.O.T.S AI Newsroom

D.O.T.S AI Newsroom

AI News Desk

2 min read
Microsoft Expands Copilot Cowork With Multi-Model AI Verification — One Model Checks Another's Work

Microsoft has rolled out the third wave of its Microsoft 365 Copilot expansion, broadening the availability of Copilot Cowork — a feature that enables AI systems to autonomously handle multi-step workflows across the Microsoft 365 suite — and introducing a new dual-model verification approach to agentic research tasks.

The most technically notable addition in Wave 3 is the Researcher tool, which implements a critique function: one AI model drafts a response, and a separate AI model reviews and challenges it. Microsoft's implementation routes both Anthropic and OpenAI models through the same workflow, allowing the system to leverage different model strengths at different stages of a task. Internal benchmarks published alongside the announcement show the system — using Claude Opus 4.6 in the research role — achieving a score that outperforms Perplexity by 7 points on deep research evaluation tasks.

What Cowork Actually Does

Cowork handles what Microsoft describes as "complete workflows" — not single responses, but sequences of actions involving file access, calendar management, document generation, and cross-application coordination. In practice, a Cowork task might involve pulling data from a SharePoint folder, drafting a briefing document, scheduling a review meeting, and sending a summary to a distribution list — all initiated by a single natural language prompt.

Wave 3 also introduces a Model Council feature, which surfaces responses from multiple AI models side-by-side, allowing users to identify where models agree and diverge before acting on AI-generated output. The feature is positioned as a trust mechanism for high-stakes decisions where users want to stress-test AI conclusions across multiple systems simultaneously.

The Competitive Framing

Microsoft's benchmark claims come with a notable gap: the comparison does not include OpenAI's GPT-5-based Deep Research tool, which launched after the Wave 3 evaluation was conducted. That omission limits the utility of the published performance data. The agentic research market is moving fast enough that benchmarks dated by a few months may not reflect the current competitive landscape.

The broader signal from Wave 3 is that Microsoft is now treating multi-model orchestration — routing different AI systems to different sub-tasks within a single workflow — as a product differentiator rather than an implementation detail. This represents a meaningful shift from the earlier Copilot architecture, which defaulted to a single underlying model. Whether enterprise users will navigate the complexity that multi-model systems introduce, or whether they will default to single-model simplicity, remains an open question.

Wave 3 features are available through Microsoft's Frontier program, which provides early access to Copilot capabilities ahead of general availability.

Back to Home

Related Stories

AWS Has Billions in Both Anthropic and OpenAI. Its Boss Explains Why That's Not a Problem.
Industry

AWS Has Billions in Both Anthropic and OpenAI. Its Boss Explains Why That's Not a Problem.

Amazon Web Services CEO Matt Garman defended the company's parallel multi-billion dollar investments in both Anthropic and OpenAI in a wide-ranging interview this week. The explanation reveals a cloud strategy built on AI model agnosticism — and a bet that AWS wins regardless of which AI lab dominates, as long as the compute runs on its infrastructure.

D.O.T.S AI Newsroom
Anthropic Poaches Microsoft's Azure AI Chief to Fix Its Infrastructure Problem
Industry

Anthropic Poaches Microsoft's Azure AI Chief to Fix Its Infrastructure Problem

Anthropic has recruited Eric Boyd, a senior Microsoft executive who led Azure AI services, as its new head of infrastructure. The hire is a direct response to the scaling bottlenecks that have limited Claude's availability during peak demand — and signals that Anthropic is treating infrastructure as a first-tier strategic priority heading into 2026.

D.O.T.S AI Newsroom
Intel's Nerdy Bet on Advanced Chip Packaging Could Decide Who Wins the AI Infrastructure Race
Industry

Intel's Nerdy Bet on Advanced Chip Packaging Could Decide Who Wins the AI Infrastructure Race

As the AI buildout pushes the limits of what individual chips can do, the unglamorous discipline of chip packaging — connecting multiple dies into a single system — is emerging as a genuine competitive moat. Wired reports that Intel is making an aggressive bet on advanced packaging technology that could position the company at the center of the next phase of AI hardware scaling, even as it struggles to compete on raw process technology.

D.O.T.S AI Newsroom