Qwen 3.7 🤖, Cursor Composer 2.5 👨‍💻, Anthropic acquires Stainless 🛠️

From:

TLDR AI <dan@tldrnewsletter.com>

To:

Hidden Recipient <hidden@emailshot.io>

Date:

5/19/2026, 1:39 PM

TLDR

Together With

TLDR AI 2026-05-19

Your architecture blueprint for AI-powered search at scale (Sponsor)

Are you asking users to put up with search timeouts, empty results, or irrelevant answers? This Algolia whitepaper lays out the full stack of architecture & data foundations for AI search.

Read it to learn how to:

Combine lexical precision with semantic recall via hybrid retrieval
Engineer p95 and p99 as product features
Setup reranking and recommendations grounded in retrieved sources

Complete with code snippets, paste-ready RAG prompt contracts, and prod anti-patterns, this is your technical guide to scaling AI-powered search.

Download the whitepaper

🚀

Headlines & Launches

Qwen3.7 Preview lands on Arena (1 minute read)

Qwen3.7 Preview is now on Arena for Text and Vision. Qwen3.7 Max Preview ranks 13th overall in Text Arena, while Qwen3.7 Plus Preview ranks 16th overall in Vision Arena.

Anthropic Acquires SDK Startup Stainless (4 minute read)

Anthropic acquired developer tools startup Stainless, whose SDK automation platform was widely used by AI companies, including OpenAI, Google, and Cloudflare.

Cursor Released Composer 2.5 (7 minute read)

Cursor introduced Composer 2.5, an updated coding agent trained with targeted reinforcement learning, synthetic data, and new distributed training techniques.

🧠

Deep Dives & Analysis

What political censorship looks like inside an LLM's weights (109 minute read)

Qwen3.5-9B's political censorship is a small circuit that can be read and turned off. The factual knowledge is already in pretraining. The censorship behavior is layered on top of the facts. The model never loses the knowledge, it just learns to route around it.

Agent Evaluation: A Detailed Guide (53 minute read)

LLM evaluation has shifted from static benchmarks to more dynamic, real-world agent systems. Effective evaluation now requires realistic harnesses to test agents over long time horizons in complex environments. This is crucial as agents increasingly adopt high-stakes roles, such as coding and medicine, necessitating rigorous performance measurement and outcome-oriented evaluation.

🧑‍💻

Engineering & Research

🌉 AI Agent Security Summit | San Francisco (Sponsor)

AI agents are everywhere, and so are the risks. Join AI security experts from Microsoft, Google, and Amazon at Zenity Labs' AI Agent Security Summit on May 27th in SF to see what's next. Want to test your knowledge? Register for free for Foundations of AI Security and earn your 🎓professional certification.

Generalization Dynamics of LM Pre-training (17 minute read)

Language models (LMs) undergo unpredictable switches between parroting patterns and exhibiting adaptive intelligence during pre-training, a phenomenon termed "mode-hopping." This behavior cannot be corrected by standard optimization techniques and presents as a competition for model capacity, influenced by data from each training window. Researchers propose leveraging these dynamics to better select pre-training checkpoints, curate data for stable generalization, and evaluate metrics predicting LM behavior.

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation (9 minute read)

NVIDIA Cosmos Predict 2.5 generates videos from text, adapting for specific tasks like robot manipulation using LoRA/DoRA to inject trainable adapters, minimizing memory use. These methods offer efficient fine-tuning on a single GPU, preventing catastrophic forgetting while generating synthetic trajectories quickly. Fine-tuning with LoRA and DoRA significantly improves video quality, with LoRA more suited for tight memory conditions and DoRA preferred for addressing training instability.

HRM-Text (GitHub Repo)

HRM-Text is a 1B text generation model based on the HRM architecture. It can be trained with 130-600x less compute and 150-900x less data than foundation models, making foundation model pretraining accessible. The 0.6B parameter version of the model can be trained on 8 H100s on a single node in about 50 hours for around $800. The 1B parameter model can be trained on 16 H100s on two nodes in about 46 hours for around $1,472.

🎁

Miscellaneous

Vera Arrives: NVIDIA's First CPU Built for Agents Lands at Top AI Labs (4 minute read)

The first Nvidia Vera CPUs recently arrived at Anthropic, OpenAI, SpaceXAI, and Oracle. They were hand-delivered by Nvidia Vice President of Hyperscale and High-Performance Computing, Ian Buck. Vera features 88 custom Nvidia-designed Olympus cores, 1.2 TB/s of memory bandwidth, and 50% faster per-core performance. It is the host processor for Vera Rubin NVL72, which pairs via second-generation Nvidia NVLink-C2C to a pair of Rubin GPUs.

Jury dismisses all claims in Elon Musk's lawsuit against OpenAI CEO Sam Altman (6 minute read)

Elon Musk's lawsuit against Sam Altman and OpenAI has been dismissed. A jury has decided that Musk waited too long to file his lawsuit. Musk says he plans to appeal.

⚡

Quick Links

Running long-horizon agents in production [Langchain Webinar] (Sponsor)

Production agents need durable execution, the ability to resume from where they left off without starting over. Join LangChain to learn how to make it work in real deployments. Save your seat

Skills in web, iOS, and Android (2 minute read)

xAI launched "Skills" for Grok, allowing users to teach it functions once, which it remembers across interactions.

LLM Wiki v2 (16 minute read)

This post contains a pattern for building personal knowledge bases using LLMs.

TLDR is hiring a Senior Software Engineer, Applied AI ($250k-$350k, Fully Remote)

TLDR's Applied AI team is tasked with making every process at TLDR legible to code, runnable by anyone, and composable into larger workflows. Join a small, fast moving team using the latest AI tools with an unlimited token budget. Learn more.

Turn repeated instructions into reusable skills in Lovable (14 minute read)

Skills in Lovable allow users to create reusable, markdown-based instructions to eliminate repetitive explanations.

Introducing Scheduled Tasks 2.0 (7 minute read)

Scheduled Tasks 2.0 enhances automation by allowing tasks to run with context, ensuring continuity in workflows across different projects and apps.

Love TLDR? Tell your friends and get rewards!

Share your referral link below with friends to get free TLDR swag!

https://refer.tldr.tech/39389a05/2

Track your referrals here.

Want to advertise in TLDR? 📰

If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us.

Want to work at TLDR? 💼

Apply here, create your own role or send a friend's resume to jobs@tldr.tech and get $1k if we hire them! TLDR is one of Inc.'s Best Bootstrapped businesses of 2025.

If you have any comments or feedback, just respond to this email!

Thanks for reading,
Andrew Tan, Ali Aminian, & Jacob Turner

Manage your subscriptions to our other newsletters on tech, startups, and programming. Or if TLDR AI isn't for you, please unsubscribe.

Similar newsletters

There are other similar shared emails that you might be interested in: