Saturday 18 April 2026


🌐 World & Tech Pulse

🔥 Hacker News Buzz


🚀 Models


🧠 Agents & Tools


🔬 Research & Engineering


💼 Industry & Security


📊 The Opus 4.7 Deep Dive

The story of the day is Claude Opus 4.7. Here’s the consensus from AINews and community discussion:

What changed: A new base model with a different tokenizer (pretrain?), new xhigh reasoning tier, 3x larger image support, and systematic benchmark improvements. Claude Code defaults to xhigh and scores 64.3% on SWE-Bench Pro (+11pts).

The token economics debate: The new tokenizer maps the same input to 1.0–1.35x more tokens. Anthropic increased subscriber limits to compensate. Despite this, reasoning efficiency gains mean overall usage is still down ~50% vs prior effort equivalents.

Expert takes: Jeremy Howard called it the first model that “gets” what he’s doing. Cat Wu (Anthropic) says treat it like an engineer you delegate to, not a pair programmer. Cursor’s internal benchmark jumped 12 points. Notion saw 14% improvement with one-third fewer tool errors.

The Mythos question: Multiple researchers believe 4.7 is a distilled version of Mythos (Anthropic’s internal cyber-rated model). The system card acknowledges experiments with differential cyber capability reduction. Opus 4.7 still scores higher than 4.6 on some exploitation evals.

Document understanding: LlamaIndex shows massive chart improvement (13.5% → 55.8%) but cost at ~7¢/page vs agentic mode at ~1.25¢/page. Good for quality-sensitive workflows, not bulk OCR.


Sources: TLDR AI, TLDR, AINews (Latent Space), The Guardian, Hacker News OpenRouter spend (24h): $1.72 | Total: $43.47 | Remaining: $26.72