5 posts 2 posts

vLLM / Ollama

[eco] creeta 3-axis IA

vLLM v0.22.0: DeepSeek V4, Rust Frontend, Concurrent rc0 Explained

459 commits, a dedicated DeepSeek V4 package, Rust frontend, and an rc0 that's one CI fix. What matters and what doesn't.

vLLM v0.21.0 Production Update: KV Offload and Multi-Server Port Bug

v0.22.0 doesn't exist yet. v0.21.0 ships KV offload, spec decode, and a multi-server port bug still under review.

Starlette BadHost: CVE-2026-48710 Auth Bypass in AI Agent Stacks

Starlette BadHost (CVE-2026-48710): a crafted Host header bypasses auth middleware. Unproxied AI agents at highest risk.

vLLM v0.22.0 RC3: Multi-API-Server Timeout Fix Explained

RC3 patches a hard-coded 60s startup timeout in vLLM's multi-API-server subsystem — here's what changed and what operators must configure.

The Real BadHost Risk: MCP Servers, vLLM, and the Proxy Gap

CVSS 6.5 misses the mark. Why MCP servers and proxy-less AI agent stacks face disproportionate exposure from BadHost.

vLLM v0.21.0 프로덕션 업데이트: KV 오프로드와 멀티 서버 포트 버그

v0.22.0 doesn't exist yet. v0.21.0 ships KV offload, spec decode, and a multi-server port bug still under review.

Starlette BadHost: AI 에이전트 스택의 CVE-2026-48710 인증 우회 취약점

Starlette BadHost (CVE-2026-48710): a crafted Host header bypasses auth middleware. Unproxied AI agents at highest risk.