Claude Code Architecture Analysis: Living System vs Dead Machine

Claude Code Is Not a CLI Tool, But a 'Living' System

A deep dive into Claude Code’s underlying architecture: why it’s not a traditional CLI tool but an intelligent system running in a ‘distributed state.’ From TypeScript to Bun runtime, three-tier architecture to Feature Flags, understand the core design philosophy of this AI coding assistant.

April 4, 2026 · 9 min · 1776 words · Monster Programming
Claude Code Reverse Engineering: xxHash64 Signing Mechanism Explained

Reverse Engineering Claude Code's API Request Signing

An in-depth reverse engineering analysis of Claude Code’s API request signing mechanism, revealing how the cch hash and xxHash64 are implemented, the secrets of Bun’s runtime, and how Anthropic protects API calls with native code.

April 2, 2026 · 7 min · 1444 words · 梦兽编程
AI Agent memory infrastructure toolkit - Ghost + Memory Engine + PostgreSQL

Your AI Agent Can Think, But It Can't Remember

AI agents can reason, plan, and converse—but forget everything once the session ends. The Ghost project solves this with a pure PostgreSQL-based infrastructure, turning the database into the agent’s memory palace.

March 26, 2026 · 6 min · 1196 words · Dream Beast Programming
Mac mini connected to SSD freezer and DRAM fridge, illustrating the layered architecture of LLM in a Flash

Cramming a 400B Model into 48GB: The Magic Behind LLM in a Flash

An Apple paper from 2023 made it possible to run a 400 billion parameter model on an ordinary MacBook. The core technologies—MoE and quantization—hide an engineering philosophy built around on-demand loading.

March 24, 2026 · 5 min · 857 words · Dream Beast Programming
oMLX runs local LLMs on Mac Apple Silicon, dramatically outperforming Ollama with TTFT dropping from 90s to 1-3s

90 Seconds of Waiting, Gone: How oMLX Buries Ollama on Mac

oMLX is built for Apple Silicon, using the MLX framework, SSD-backed KV cache, and continuous batching to cut TTFT from 90 seconds to 1-3 seconds in long-context scenarios, comprehensively outperforming Ollama.

March 23, 2026 · 6 min · 1133 words · Mengshou Programming
Ramp AI Agent Enterprise Finance Automation: One Agent + A Thousand Skills

Don't Build a Thousand Agents: How Ramp Automates Finance with One Agent

Ramp, America’s fastest-growing enterprise finance platform valued at $32B with 50,000+ customers and $100B+ in annual transaction volume, chose a ‘one Agent + a thousand skills’ architecture over building many agents. This is a deep dive into Ramp’s AI实战经验.

March 19, 2026 · 17 min · 3428 words · 梦兽编程
Literate Programming in the AI Agent Era

AI Agents Finally Make Literate Programming Worth Trying

Literate programming has been around for 40 years but never caught on — because maintaining parallel narratives of code and prose is exhausting. AI agents change that equation entirely.

March 9, 2026 · 5 min · 898 words · rex
High‑value AI Toolkit Less than a coffee/month →
扫码关注公众号
微信公众号二维码

第一时间获取技术干货