M365 Copilot + GPT-5 = big improvement

less than 1 minute read

Have you tried M365 Copilot lately? It has gotten seriously good.

Click on the “Try GPT-5” button on the top right, and you’ll get what seems to be the same features and models as in ChatGPT Teams, with autorouting to fast or reasoning models.

Click on the “Researcher” agent on the left sidebar, and you’ll get what feels like Deep Research mode on ChatGPT or Claude – it will create a research plan, then go away and search the web for 10 minutes and synthesize the results.

Notebooks lets you create a mini-RAG like Google NotebookLM – drop in files from OneDrive or your desktop and it will use those specifically to answer questions.

In the old version of Copilot especially the Work tab seemed underpowered, using some old model like GPT-3.5. The Work tab also uses GPT-5 now, so you can use the most current model with work documents, emails, chats, transcripts, notebooks, …

I dismissed Copilot as “ChatGPT Lite” in the past, but since the update I’ve switched to it as my daily driver, it’s that useful. Give it a try if you haven’t used it in a while.

Piloting Claude for Chrome 🔗

less than 1 minute read

I’m not sure if we’re ready for agentic browser control. Yes, you can click each time to accept the risk, but how many of us read the T&Cs before we click accept?

Their 123 adversarial prompt injection test cases saw a 23.6% attack success rate when operating in “autonomous mode”. They added mitigations:

When we added safety mitigations to autonomous mode, we reduced the attack success rate of 23.6% to 11.2%

I would argue that 11.2% is still a catastrophic failure rate. In the absence of 100% reliable protection I have trouble imagining a world in which it’s a good idea to unleash this pattern.

Chinese universities want students to use more AI, not less 🔗

less than 1 minute read

While many educators in the West see AI as a threat they have to manage, more Chinese classrooms are treating it as a skill to be mastered. In fact, as the Chinese-developed model DeepSeek gains in popularity globally, people increasingly see it as a source of national pride. The conversation in Chinese universities has gradually shifted from worrying about the implications for academic integrity to encouraging literacy, productivity, and staying ahead.

FDA’s artificial intelligence is supposed to revolutionize drug approvals. It’s making up studies 🔗

less than 1 minute read

The FDA’s head of AI, Jeremy Walsh, admitted that Elsa can hallucinate nonexistent studies.

“Elsa is no different from lots of [large language models] and generative AI,” he told CNN. “They could potentially hallucinate.”

Sounds like a need for some kind of tool to verify that studies are at least real, if not accurately represented, in the answers government scientists are using to make critical decisions.

Markitdown 🔗

less than 1 minute read

This looks like a handy package for converting documents (PDF, .docx, .pptx, and more) to .md. There’s also a MCP server so you can use it with your LLM.

To install as a uv tool:

uv tool install markitdown --with 'markitdown[all]'