LLM
2 posts

Did Claude Opus 4.7 Secretly Raise Prices? 497 Developers Reveal the Truth
497 anonymous developer submissions reveal Claude Opus 4.7 consumes 37.3% more tokens than 4.6 on average, with API costs rising proportionally. Here’s what caused the ‘hidden price hike’ and what you can do about it.
April 19, 2026 · 5 min · 1064 words · Mengshou

90 Seconds of Waiting, Gone: How oMLX Buries Ollama on Mac
oMLX is built for Apple Silicon, using the MLX framework, SSD-backed KV cache, and continuous batching to cut TTFT from 90 seconds to 1-3 seconds in long-context scenarios, comprehensively outperforming Ollama.
March 23, 2026 · 6 min · 1133 words · Mengshou Programming
