Microsoft’s new Researcher asks one AI to check another — what UK workers should still verify

One of the biggest problems with AI at work is not that it writes badly. It is that it can sound polished and confident while still missing something important. A summary can look tidy, a briefing can feel persuasive, and a research note can appear complete right up until a human spots the gap.

That is why Microsoft’s latest update to Researcher in Microsoft 365 Copilot is interesting. Instead of asking one model to do the whole job alone, Microsoft says its deep-research tool can now use more than one AI model in two ways: Critique, where one model produces a draft and another reviews it, and Council, where multiple models produce separate reports side by side so you can compare where they agree and where they do not.

For ordinary UK workers, that is a more sensible direction than the usual “just trust the bot” pitch. If AI is going to help with background research, policy notes or project prep, having a second model challenge the first is better than one system marking its own homework. Even so, this is not truth on tap, and many UK readers may not see it soon.

What Microsoft is actually changing

According to Microsoft’s announcement, Researcher is the company’s deep-research agent inside Microsoft 365 Copilot. The new Critique mode separates generation from review: one model plans the task, gathers material and writes the first report, while a second model checks source reliability, completeness, evidence grounding and structure before the final version appears.

The second new option, Council, takes a slightly different approach. Rather than refining a single report, it runs multiple models in parallel and shows their reports side by side, with an extra summary explaining where the answers line up and where they diverge. In theory, that gives the user a clearer sense of whether the AI is pointing in one direction confidently or whether the same question produces materially different interpretations.

That sounds helpful because it reflects a real problem with AI at work. People are not only asking for first drafts now. They are using these tools to compare products, scan long documents, pull out risks, and prepare briefing notes quickly. As we wrote in our earlier piece on making AI more useful at work, the best use cases usually start with a real bottleneck. Research and synthesis are absolutely one of those bottlenecks.

Why this is appealing in real life

If you have ever used AI to get up to speed on a topic before a meeting, you will probably recognise the appeal straight away. A tool that gives you one neat answer can save time, but it can also give false confidence. A tool that shows its working more clearly, highlights disagreements and tightens the sourcing is at least moving in a grown-up direction.

A small team might use this to compare software suppliers, summarise new guidance, or pull together the basics before a client conversation. The value is not that AI magically knows the truth. It is that it helps a person organise a messy pile of information faster.

That is also why Microsoft’s framing matters. It is saying evaluation deserves as much weight as generation. That is useful, because one of the most common workplace AI failures is letting a tidy draft arrive without anyone questioning it.

Why UK workers should keep their expectations in check

There are two big catches. The first is access. Microsoft says these features are currently in its Frontier programme, its early-access space for preview AI features. Microsoft Support says Frontier features are experimental and, at least initially, many are available in English for U.S.-based subscribers first. So if you are in the UK and cannot see this yet, that is not surprising.

The second catch is that multi-model does not mean mistake-proof. Two systems can still rely on weak source material, repeat the same bad assumption, or overlook local context. A polished summary of U.S. guidance is still not much use if your actual question is about a UK policy, regulator or workplace rule.

There is also the usual permissions issue. If a research agent is pulling from your emails, files, meeting notes or internal documents, you still need to know what it can access and whether that level of access is appropriate. As we noted in our piece on checking AI tools before giving them more access, a smarter-looking interface is not the same thing as a smaller privacy risk.

What to check before trusting the result

If this kind of feature lands in your workplace, the sensible approach is still fairly boring.

Check the sources. Do not stop at the smooth summary. Open the citations and make sure they really support the claim.
Check the geography. If the answer sounds legal, HR-related or compliance-heavy, make sure it is drawing on UK-relevant material rather than default U.S. examples.
Check the permissions. Understand what work data the tool can see before you start feeding it sensitive tasks.
Check the disagreements. If the models diverge, that is useful information in itself, not something to hide.
Check who signs off. AI research can help you prepare, but someone human still needs to own the final decision.

The calm takeaway

Microsoft’s update is worth noticing because it points toward a more realistic version of workplace AI. Instead of pretending one chatbot can do flawless research in isolation, it builds in challenge, comparison and review.

But it is still a work assistant, not an authority. For UK readers, the practical message is simple: if your employer already pays for Microsoft 365 Copilot and eventually gets these Frontier features, they may be useful for messy research tasks. Just do not confuse “two AIs looked at it” with “this has definitely been checked properly”. A second opinion helps. It is not the end of the job.

Sources:
Microsoft Tech Community — Introducing multi-model intelligence in Researcher
Microsoft Support — What is Frontier?
Engadget — Microsoft’s research assistant can now use multiple AI models simultaneously