AI
17 posts

Did Claude Opus 4.7 Secretly Raise Prices? 497 Developers Reveal the Truth
497 anonymous developer submissions reveal Claude Opus 4.7 consumes 37.3% more tokens than 4.6 on average, with API costs rising proportionally. Here’s what caused the ‘hidden price hike’ and what you can do about it.

The Claude Code Leak: How an Accident Sparked a Rust Rewrite Wave
In March 2026, Claude Code’s 500k+ lines of TypeScript source code were accidentally leaked. Instead of panic, the developer community responded by rewriting the entire project in Rust. How did this mess turn into Rust’s best advertisement for AI infrastructure?

Your AI Agent Can Think, But It Can't Remember
AI agents can reason, plan, and converse—but forget everything once the session ends. The Ghost project solves this with a pure PostgreSQL-based infrastructure, turning the database into the agent’s memory palace.

Cramming a 400B Model into 48GB: The Magic Behind LLM in a Flash
An Apple paper from 2023 made it possible to run a 400 billion parameter model on an ordinary MacBook. The core technologies—MoE and quantization—hide an engineering philosophy built around on-demand loading.

90 Seconds of Waiting, Gone: How oMLX Buries Ollama on Mac
oMLX is built for Apple Silicon, using the MLX framework, SSD-backed KV cache, and continuous batching to cut TTFT from 90 seconds to 1-3 seconds in long-context scenarios, comprehensively outperforming Ollama.

Claude Code Channels: Give Your AI Coding Assistant a Phone Number
A hands-on look at Claude Code Channels, letting you control your AI coding assistant through Telegram, Discord, and more — no matter where you are.

Don't Build a Thousand Agents: How Ramp Automates Finance with One Agent
Ramp, America’s fastest-growing enterprise finance platform valued at $32B with 50,000+ customers and $100B+ in annual transaction volume, chose a ‘one Agent + a thousand skills’ architecture over building many agents. This is a deep dive into Ramp’s AI实战经验.

Mistral Forge Deep Dive: The Nuclear Weapon for Enterprise Fine-tuning
Spent 3 hours reading the official documentation. Forge wants to turn fine-tuning into an all-in-one service - you just feed it data, and it handles everything else. But how low is the barrier really?

AI Programming Hits the 'Review Bottleneck'? Mistral Drops an Open-Source Bomb
Mistral AI releases Leanstral, the first open-source Lean 4 code agent that lets AI both write code and prove its correctness. Let’s talk about formal verification and AI programming.

AI Agents Finally Make Literate Programming Worth Trying
Literate programming has been around for 40 years but never caught on — because maintaining parallel narratives of code and prose is exhausting. AI agents change that equation entirely.
