The Tech Roundup

Anthropic●Covered by 6 sources

Claude now writes 80% of Anthropic's code — and Anthropic says that's the scary part

Anthropic published research reporting that Claude authored over 80% of the production code merged at the company in May 2026, with the average engineer shipping 8x more code per day than in 2024 — and Claude's success rate on internal engineering tasks reportedly jumped from 26% to 76% in six months. The company frames this as an early sign of recursive self-improvement: AI helping build the next AI, fast enough that it could arrive before institutions are ready. The pitch comes with a pledge to slow or pause frontier development — but only if rival labs agree to do the same.

Why it matters

The convenient thing about a safety warning that doubles as a flex about your own product is that it can't lose: either Claude is dangerously capable, or it's an impressive marketing line, and Anthropic gets credit either way. Worth noting the alarming numbers are all Anthropic's own and self-reported, and the proposed pause hinges on competitors who have shown zero interest in slowing down — which makes it less a brake and more a press release.

Source: Anthropic

Google DeepMind●Covered by 6 sources

Google's Gemma 4 12B runs text, vision, and audio on a 16GB laptop — no cloud required

Google DeepMind released Gemma 4 12B, an Apache 2.0-licensed multimodal model that handles text, vision, and audio and runs locally on a standard 16GB laptop (about 8GB with quantization), with a 256K context window and tool calling. The trick is an encoder-free design that folds vision and audio straight into the LLM backbone, which Google says makes it nearly as capable as the bigger Gemma 4 26B MoE — and it's the first Gemma of this size to do native audio. Via Google AI Edge, it analyzes data and generates scripts on-device, so your files never leave the machine.

Why it matters

"Open weights you can actually run on the laptop you already own" is the part that matters here — no API bill, no upload, no someone-else's-server holding your documents. For anyone who balked at sending personal or work data to the cloud just to use AI, the privacy math finally tilts the other way: the model comes to your data instead of the reverse.

Source: Google DeepMind

OpenAI●Covered by 6 sources

OpenAI's Codex hits 5M weekly users — and non-coders are the fastest-growing crowd

Codex crossed 5 million weekly users, and OpenAI is rebuilding it from a developer tool into a general-purpose work platform: 110 skills across 62 apps, six role-specific plug-ins (sales, analytics, creative, product design, equity investing, investment banking), plus a new Sites feature that spins prompts into hosted, live-data web apps via partners like Figma and Wix. The tell is in the usage split — non-developers are now about 20% of users and growing 3x faster than the coders who got there first.

Why it matters

OpenAI started by automating the engineers; the growth curve says the spreadsheet-and-slides crowd is next, not safely behind them. When the fastest-growing users of a coding tool can't code, the company has quietly stopped building a dev product and started building a replacement for the desk job.

Source: OpenAI

OpenAI●Covered by 5 sources

ChatGPT now 'dreams' about your old chats — and doubles down on remembering you, doubling factual recall to 82.8%

OpenAI is rolling out a new ChatGPT memory system, internally dubbed "Dreaming," that processes your past conversations in the background to build a running profile sorted into categories like travel and hobbies. By OpenAI's own numbers, it lifts factual recall from 41.5% to 82.8% and preference-following from 31.4% to 71.3%, while reportedly cutting backend compute costs by 5x and doubling memory capacity. It's live for Plus and Pro users in the US, with free tiers and broader availability promised in the coming weeks — and yes, there's a reviewable summary so you can see and edit what it's hoarding.

Why it matters

"It dreams about your chats to build a dossier on you" is a sentence that sounds adorable until you read it twice — the whole point is that ChatGPT now quietly knows more about you across every session. The reviewable summary is the one genuinely useful bit here: if a tool is keeping a permanent file on your travel plans and hobbies, you want the door to walk in and delete things, and for once you get it.

Source: OpenAI

Ideogram●Covered by 5 sources

Ideogram 4.0 ships open-weight, fits on one 24GB GPU, and ranks #1 among open models

Ideogram released Ideogram 4.0 as an open-weight text-to-image model — a 9.3B Diffusion Transformer trained from scratch with a frozen 8B VLM text encoder, shipped in fp8 and nf4 checkpoints where the nf4 variant squeezes onto a single 24GB GPU. It landed #8 overall and #1 among open models on Arena, with native 2K rendering, multilingual text, and a structured JSON prompting interface for explicit layout control. It dropped the same day as Reve 2.0, which leans into editing specific image regions and rewriting layouts instead of regenerating the whole thing — both bets on steering, not re-rolling.

Why it matters

Prompt-roulette is the reason most people bounced off AI images — you can't art-direct a slot machine. An open-weight model strong at text and typography that runs on one consumer-grade GPU means making a flyer, a slide, or a birthday invite stops being a cloud subscription and a prayer, and starts being something you actually control on your own machine.

Source: Ideogram

Perplexity●Covered by 5 sources

Perplexity's new trick: keep the easy stuff on your laptop, ship the hard stuff to the cloud

Perplexity unveiled "hybrid agentic inference" for its Personal Computer agent (a Windows on-device app-and-file orchestrator), splitting work between a compact model running locally and frontier models in the cloud — lightweight tasks stay on your device, complex reasoning gets sent off. The pitch is better privacy and fewer wasted tokens, and it's paired with a shift from old-school API orchestration to a "Search as Code" approach. It was shown at Computex 2026, so file the privacy claims under "announced, not yet independently poked at."

Why it matters

The whole idea is that your private files never leave your machine for the routine stuff — which is genuinely the thing people are nervous about when an AI agent gets the keys to their computer. If the routing actually works as advertised, it's a real answer to "why would I let this thing rummage through my files"; if it doesn't, it's just your data taking a longer trip to the cloud.

Source: Perplexity

Microsoft●Covered by 4 sources

Microsoft's new Surface packs 1 PFLOP of AI compute so the model never leaves your laptop

At Build 2026 Microsoft unveiled the Surface Laptop Ultra — up to 1 PFLOP of AI compute, 128GB unified memory, and an RTX GPU — plus a Surface RTX Spark Dev Box built to run models fully offline. The bigger story is the software stack around it: Microsoft's first in-house reasoning model MAI-Thinking-1, localized Aion 1.0 small models for Windows, a Scout assistant for Microsoft 365, and kernel-level Execution Containers to sandbox AI agents at the OS level. The whole pitch is an "agent-first" Windows where the AI runs on your machine, not someone's cloud.

Why it matters

"On-device" is the quiet privacy upgrade buried under the spec sheet — if the model runs locally, your files and prompts don't have to take a round trip through a data center to get answered. The catch is the same sentence read differently: Microsoft is now building agents straight into the OS with kernel-level access, which is great until you ask what an agent with kernel-level access can do when it misbehaves.

Source: Microsoft

Microsoft●Covered by 4 sources

Microsoft just shipped seven of its own AI models — and the message to OpenAI is in the homework

Microsoft dropped seven in-house MAI models at Build, headlined by MAI-Thinking-1: a 35B active-parameter MoE reasoner with a 256K context window, pre-trained on 30T tokens across 8,192 GB200 GPUs, hitting 97% on AIME 2025 and 53% on SWE-Bench Pro. The flashier-than-usual flex is a 109-page technical report disclosing the full pipeline — and Microsoft's claim of zero third-party distillation and zero synthetic data, a pointed jab given how many rivals quietly train on each other. The suite also includes MAI-Code-1-Flash (5B params, 51% on SWE-Bench Pro, baked into GitHub Copilot), MAI-Image-2.5 (#2 on editing leaderboards), and a 43-language transcription model, all said to run 30% cheaper per dollar on Microsoft's own MAIA 200 chip than on Nvidia's GB200.

Why it matters

Microsoft owns the biggest stake in OpenAI and is now building the whole stack itself — models, tuning, and silicon — which is corporate-speak for "we'd like to stop paying our partner." If it works, the AI baked into your Excel, Outlook, and Copilot starts getting cheaper to run, and that cost cut is the kind of thing that eventually shows up on the bill for everyone downstream.

Source: Microsoft

Trump Administration●Covered by 4 sources

Trump's AI order shrinks the review to 30 days — and makes the whole thing optional

President Trump signed an AI security executive order that asks labs to voluntarily hand over frontier models for a government safety review 30 days before release — down from the 90-day window that was previously expected. The order sets up classified cyber benchmarks and federal support for AI vulnerability detection, but explicitly rules out mandatory licensing or permits for new models. So: shorter window, no teeth, and "please" doing the heavy lifting.

Why it matters

"Voluntary" plus "30 days" is the regulatory equivalent of a suggestion box bolted to the side of a moving train — the labs racing fastest are exactly the ones least likely to slow down for an optional checkup. For everyone downstream of these models, the safety net here is whatever the companies feel like volunteering, which is a thin thing to stand under.

Source: Trump Administration

github.blog●Covered by 4 sources

GitHub's new Copilot desktop app wants you running a swarm of AI agents at once

GitHub shipped a Copilot desktop app pitched as the "agent-native" home for coding, built to orchestrate multiple AI agents in parallel using git worktrees so they don't trip over each other. It throws in interactive canvases to visualize the workflows, cross-device continuity, and a model menu that pulls from OpenAI, Anthropic, and Google rather than locking you to one. The framing — straight from Microsoft-owned GitHub — is that the agents, not the human, are now the main thing the interface is built around.

Why it matters

The quiet shift here is in the word "orchestrating": the developer is being repositioned from the person writing the code to the person managing a crew of bots that do. If parallel agents become the normal way software gets built, the skill that pays stops being typing fast and starts being herding machines — which is a different job than the one a lot of programmers signed up for.

Source: github.blog

Anthropic●Covered by 4 sources

Anthropic's AI bug-hunter has found 10,000+ critical flaws — now it's going after the power grid

Anthropic is expanding Project Glasswing, its AI vulnerability-hunting program, to 150 more organizations across 15+ countries — including Apple, Nvidia, Microsoft, CrowdStrike, and Palo Alto Networks — and pointing it at critical infrastructure: power, water, healthcare, communications, and hardware. Partners have already turned up more than 10,000 high- or critical-severity flaws since launch, now backed by access to Anthropic's Claude Mythos model. Analysts are already raising a hand: all those findings have to be validated and patched by vendor and SOC teams that may not have the bandwidth.

Why it matters

Finding 10,000 holes is only the good news if someone actually plugs them — and the same maintenance teams who let two-year-old bugs rot are the ones now drinking from this firehose. When the AI points at the water plant and the grid, the bottleneck stops being discovery and becomes whether anyone downstream can keep up, which is exactly the boring gap real attacks slip through.

Source: Anthropic

Google●Covered by 4 sources

Google's Dreambeans reads your Gmail, Photos and Calendar — and you have to pay for the privilege

Google Labs shipped Dreambeans on iOS and Android, an experimental app that connects to your Gmail, Calendar, Photos, and Search and uses "Personal Intelligence" to spin your data into a feed of daily AI-illustrated stories — each with a beginning, middle, and end, pitched as an antidote to endless scrolling. The catch: it's locked behind a Google AI Ultra subscription, so the thing scanning your entire Google footprint is also a paid product.

Why it matters

Strip off the cozy storybook framing and Dreambeans' core feature is *ingesting everything you've emailed, photographed, and scheduled* and narrating it back to you — and you're paying for it. The illustrations are the spoonful of sugar; "let an AI read your whole life" is what you actually clicked allow on.

Source: Google

NVIDIA●Covered by 3 sources

Nvidia drops a fully open 550B model and a 1M-token memory to match — agents are the whole point

Nvidia released Nemotron 3 Ultra, a 550B-parameter mixture-of-experts model (with 55B active params and a 1M-token context window) under the OpenMDW 1.1 license — and it's open all the way down: weights, synthetic data, and training recipes. Nvidia's own numbers claim 5x faster inference and up to 30% lower cost on long-running agentic tasks, with quality it pegs as roughly even with the top open rivals. It also rolled out a "Nemotron Coalition" of partners like Nous and Prime Intellect, and Perplexity already wired it in for Pro/Max users running long agents.

Why it matters

The headline isn't the size, it's the giveaway — Nvidia handing out the recipe for free while it sells the chips everyone needs to run it is the company quietly making sure the next AI boom happens on its hardware no matter who wins. For regular users, an open model built to run cheaper agents over a million tokens means the assistants that actually finish multi-step jobs get less likely to be locked behind one closed lab's pricing.

Source: NVIDIA

the-decoder.com●1 source · panel-picked

xAI reportedly trained its coding models on Claude — and kept going after Anthropic pulled the plug

Per The Decoder, Elon Musk's xAI leaned on Anthropic's Claude to train its own coding models for months, and reportedly didn't stop when Anthropic cut off access — instead routing around the block via private accounts and the Blackbox AI service. The internals look rough: xAI's pretraining team has reportedly shrunk to fewer than five people, several leads have walked, and the compute Musk hoarded is now being rented out to Anthropic and Google rather than running his own models. All of this is reporting we can't independently verify, so file it accordingly.

Why it matters

The whole pitch of a frontier AI lab is that it builds its own brains — so a rival's model quietly doing the homework, plus a gutted pretraining team, suggests the gap between Musk's hype and Musk's models is wider than the launch events let on. If you're picking which AI to trust with your code, it's worth knowing whose intelligence is actually under the hood.

Source: the-decoder.com

cnbc.com●1 source · panel-picked

Morgan Stanley is letting AI agents from thousands of companies into your stock platform

Morgan Stanley plans to open its wealth management platforms — including ShareWorks and Equity Edge — to AI agents deployed by thousands of corporations. That means software acting on a company's behalf will be able to reach into the systems that hold employee equity and stock-plan data, not just human reps clicking through dashboards. The bank hasn't detailed what guardrails sit between those agents and the underlying account data.

Why it matters

If you've got vested shares or an employee stock plan, this is the part of your financial life people forget about — and now autonomous software gets a key to it. The convenience pitch and the new attack surface are, as usual, the exact same sentence, so the question worth asking is who's accountable when an agent does something you didn't authorize.

Source: cnbc.com

The Tech Roundup

Edition

Top Stories

Claude now writes 80% of Anthropic's code — and Anthropic says that's the scary part

Google's Gemma 4 12B runs text, vision, and audio on a 16GB laptop — no cloud required

OpenAI's Codex hits 5M weekly users — and non-coders are the fastest-growing crowd

ChatGPT now 'dreams' about your old chats — and doubles down on remembering you, doubling factual recall to 82.8%

Ideogram 4.0 ships open-weight, fits on one 24GB GPU, and ranks #1 among open models

Perplexity's new trick: keep the easy stuff on your laptop, ship the hard stuff to the cloud

Microsoft's new Surface packs 1 PFLOP of AI compute so the model never leaves your laptop

Microsoft just shipped seven of its own AI models — and the message to OpenAI is in the homework

Trump's AI order shrinks the review to 30 days — and makes the whole thing optional

GitHub's new Copilot desktop app wants you running a swarm of AI agents at once

Anthropic's AI bug-hunter has found 10,000+ critical flaws — now it's going after the power grid

Google's Dreambeans reads your Gmail, Photos and Calendar — and you have to pay for the privilege

Nvidia drops a fully open 550B model and a 1M-token memory to match — agents are the whole point

xAI reportedly trained its coding models on Claude — and kept going after Anthropic pulled the plug

Morgan Stanley is letting AI agents from thousands of companies into your stock platform

Under the Radar

Researchers chained a Zapier sandbox escape all the way to NPM publish tokens

A debug flag Microsoft forgot to flip let any Android app raid your 365 account — with 15 lines of code

Workday drops $1.1B on an AI "interface layer" because the software it sells is becoming the thing AI talks to