vLLM v0.21.0 Production Update: KV Offload and Multi-Server Port Bug

v0.22.0 doesn't exist yet. v0.21.0 ships KV offload, spec decode, and a multi-server port bug still under review.

Claude Code v2.1.144–v2.1.154: What Shipped in Nine Days

Ten patches in nine days: pinned sessions, four security fixes, /code-review --fix, and skill-level tool gating.

SuperGrok Subscription Now Unlocks grok-build-0.1 in Kilo Code

SuperGrok and X Premium+ subscribers can now authenticate into Kilo Code and run grok-build-0.1 inside VS Code or JetBrains — no API key management required.

Codex CLI 0.134.0 and 0.135.0: Two Stable Releases in 48 Hours

OpenAI shipped two Codex CLI stable releases in 48 hours. What changed, what broke, and why the cadence matters.

Anthropic Python SDK 0.105: Opus 4.8 and Mid-Session System Prompts

Three SDK releases in 7.5 hours ship claude-opus-4-8 support, mid-conversation system blocks, and finer output usage reporting.

xAI grok-build-0.1 API Public Beta: Token Costs and SDK Support

xAI's coding model exits the $299 CLI gate. Here's what the public API beta actually offers developers.

Grok Build Lands in OpenCode and Kilo Code: xAI's 13-Day Rollout

xAI shipped grok-build-0.1 to three developer tools in 13 days. Here's what each integration covers and how to pick the right surface.

Codex CLI 0.135.0: Doctor, Vim Text Objects, and the 0.136 Alpha

OpenAI's 0.135.0 stable is a diagnostics and polish cycle. What moved in the TUI, Vim mode, and remote transport.

Command A+ 2026: Benchmark Results, Citation Tags, and Enterprise Fit

Cohere's first open-weight frontier model: benchmark gaps, native citation design, and the enterprise sovereignty case.

grok-build-0.1: Model Spec, Pricing, and the Beta Rollout Story

xAI's grok-build-0.1 hit public beta in May 2026. Here's what the spec says — and what the caching incident revealed.

Claude Opus 4.8: Coding Benchmarks and Agentic Upgrades 2026

Anthropic ships Opus 4.8 with 69.2% SWE-Bench Pro, mid-conversation system messages, and adaptive thinking.

Gemini 3.5 Flash vs Pro: Model Selection Guide 2026

Flash is GA. Pro isn't. Here's the benchmark data and decision framework developers need before choosing or migrating.

Showing of 108 posts