Claude Mythos 🛡️, GLM-5.1 🤖, warp decode ⚡

From:

TLDR AI <dan@tldrnewsletter.com>

To:

Hidden Recipient <hidden@emailshot.io>

Date:

4/8/2026, 1:39 PM

TLDR

Together With

TLDR AI 2026-04-08

Close the sim-to-real gap: Your guide to advancing physical AI (Sponsor)

Physical AI and robotics are the next frontier, but complex real-world variables and specialized multimodal experimentation make building embodied systems challenging.
Weights & Biases' new guide, Advancing physical AI: From learning to embodied intelligence, offers the strategies you need to optimize your physical AI workflow, including:

Why a full-stack solution is essential to design, test, and deploy physical AI systems
How to train and fine-tune multimodal models
Techniques to close the sim-to-real gap through real-time iteration
Tools to govern your physical AI systems

Get the guide to Advancing physical AI.

🚀

Headlines & Launches

Project Glasswing: Securing critical software for the AI era (10 minute read)

Anthropic's Claude Mythos Preview autonomously identified thousands of zero-day vulnerabilities across major operating systems and browsers. Project Glasswing, in partnership with major tech companies, uses these capabilities to enhance cybersecurity by detecting and fixing vulnerabilities at scale. Anthropic plans to develop safeguards and broaden industry cooperation to address security challenges in the AI era.

GLM-5.1: Towards Long-Horizon Tasks (14 minute read)

GLM-5.1 is a flagship model for agentic engineering created by Z.ai. It achieves state-of-the-art performance on SWE-Bench Pro. The model is built to stay effective on agent tasks over much longer horizons than previous generations. It can sustain optimization over hundreds of rounds and thousands of tool calls. The model breaks complex problems down, runs experiments, reads results, and identifies blockers with real precision.

🧠

Deep Dives & Analysis

My picture of the present in AI (11 minute read)

Ryan Greenblatt is the chief scientist at Redwood Research, a research organization with the mission of aligning superhuman AI. This post goes through some of his best guesses for the current situation of AI. The scenario forecast discusses R&D access regulations, engineering capabilities and qualitative abilities, misalignment and misalignment-related properties, cyber, bioweapons, and economic effects. Some of the claims are highly speculative, while others are better grounded.

Claude Mythos (31 minute read)

Anthropic detailed early evaluations of Claude Mythos Preview, highlighting strong performance in discovering zero-day vulnerabilities and reverse-engineering exploits, prompting a coordinated security initiative called Project Glasswing.

AI Can't Read an Investor Deck (6 minute read)

Current AI models struggle with interpreting complex financial documents, especially with visual data extraction. GPT-5.4, Gemini 3.1 Pro, and Claude Opus 4.6 consistently falter when processing dense charts and images, only achieving 56% to 64% accuracy compared to 72% to 80% with text-only inputs. These findings highlight significant gaps in AI's ability to perform real-world financial reasoning tasks, making the displacement of financial analysts with AI appear premature.

🧑‍💻

Engineering & Research

AI inference conference in SF + $5K in credits if you attend (Sponsor)

DigitalOcean Deploy is April 28 in SF. One day of technical deep dives on production inference infrastructure, from serverless to dedicated GPUs. Qualifying in-person attendees can receive up to $5,000 in inference credits*. Free to attend, limited spots. Register now!

Faster MoE Inference with Warp Decode (16 minute read)

Cursor's “warp decode” is a kernel design that reorganizes MoE inference around output neurons instead of experts. It achieves ~1.8x higher throughput and improved numerical accuracy on Blackwell GPUs.

TorchTPU: Running PyTorch Natively on TPUs at Google Scale (10 minute read)

Google's Tensor Processing Units (TPUs) are foundational to the company's supercomputing infrastructure. The company's custom ASICs power training and serving for Google and its Cloud customers. TorchTPU is a stack that makes it easy for the AI community to access the full capabilities of TPUs. It provides the APIs and tools needed to extract every ounce of compute from Google's hardware. This post takes a look under the hood at the engineering principles behind TorchTPU.

TriAttention for KV Cache Compression (GitHub Repo)

TriAttention estimates KV importance in pre-RoPE space using stable Q/K centers and distance-based scoring, preserving long-context reasoning quality while sharply reducing KV memory use and improving throughput.

Meta AI Scales RL for ML Engineering Agents (24 minute read)

SandMLE is a framework for building small but structurally realistic MLE environments that made on-policy RL practical for ML engineering agents by cutting execution cost more than 13x.

🎁

Miscellaneous

We're actually running out of benchmarks to upper bound AI capabilities (7 minute read)

METR's Time Horizon suite is being saturated. Frontier AI models can reliably do all but maybe a dozen or so tasks in the suite, making it hard to upper bound their time horizon. New benchmarks are becoming more expensive to grade and create. The situation will likely get worse as AI progress continues. It is likely that, by mid-2027, no benchmark score from a 2026 or earlier benchmark will be able to rule out dangerous capabilities from frontier AI systems.

Elon Musk Asks for OpenAI's Nonprofit to Get Any Damages From His Lawsuit (3 minute read)

Elon Musk's lawsuit against OpenAI is expected to go to trial later this month in Oakland, California. Musk has amended the lawsuit to ask that any damages he might win be awarded to OpenAI's charitable arm rather than to himself. The amendment also asks that OpenAI CEO Sam Altman be removed from the OpenAI nonprofit's board. Musk is seeking more than $150 billion in damages from both OpenAI and Microsoft as he believes OpenAI strayed from its non-profit mission and defrauded him as a donor in seeking to convert to a for-profit company.

⚡

Quick Links

TLDR is hiring a Senior Software Engineer, Applied AI ($250k-$350k, Fully Remote)

TLDR's Applied AI team is tasked with making every process at TLDR legible to code, runnable by anyone, and composable into larger workflows. Join a small, fast moving team using the latest AI tools with an unlimited token budget. Learn more.

Google controls the most AI computing power, driven by its custom TPUs (1 minute read)

Google holds around 25% of all compute sold since 2022.

Gemma Multimodal Fine-Tuner (GitHub Repo)

Gemma Multimodal Fine-Tuner lets Mac users fine-tune Gemma on text, images, and audio on remote data.

When Will Anthropic Surpass NVIDIA? (1 minute read)

Anthropic reached $10 billion in revenue in under four years, far outpacing other software companies like ServiceNow and Shopify.

Love TLDR? Tell your friends and get rewards!

Share your referral link below with friends to get free TLDR swag!

https://refer.tldr.tech/39389a05/2

Track your referrals here.

Want to advertise in TLDR? 📰

If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us.

Want to work at TLDR? 💼

Apply here, create your own role or send a friend's resume to jobs@tldr.tech and get $1k if we hire them! TLDR is one of Inc.'s Best Bootstrapped businesses of 2025.

If you have any comments or feedback, just respond to this email!

Thanks for reading,
Andrew Tan, Ali Aminian, & Jacob Turner

Manage your subscriptions to our other newsletters on tech, startups, and programming. Or if TLDR AI isn't for you, please unsubscribe.

Similar newsletters

There are other similar shared emails that you might be interested in: