5 posts 3 posts

Research & Benchmarks

[cat] creeta 3-axis IA

Gemini for Science: What Google I/O 2026 Introduced for Researchers

Google's I/O 2026 AI research suite: literature triage, hypothesis tournaments, and ERA outperforming CDC forecasts.

DiffusionBlocks: Sakana AI's Block-Wise Training for ICLR 2026

DiffusionBlocks trains one residual block per step, reducing activation memory B× with competitive or better accuracy.

Gemini for Science at I/O 2026: How Each Research Tool Works

Three experimental AI research tools launched at I/O 2026. What Literature Insights, Co-Scientist, and AlphaEvolve each actually do.

xAI Grok Build: Sub-Agents, MCP Compat, and the SWE-Bench Numbers

xAI shipped its terminal coding agent on May 14, 2026. Here's what the CLI actually does, where the benchmark numbers hold, and what $299/month buys.

Erdős Unit Distance Conjecture Disproved: Inside the OpenAI Proof

OpenAI's reasoning model disproved an 80-year-old geometry conjecture — verified by a nine-mathematician team including a Fields Medalist.

과학을 위한 Gemini: Google I/O 2026이 연구자들을 위해 선보인 것들

Google's I/O 2026 AI research suite: literature triage, hypothesis tournaments, and ERA outperforming CDC forecasts.

DiffusionBlocks: Sakana AI의 블록 단위 학습, ICLR 2026

DiffusionBlocks trains one residual block per step, reducing activation memory B× with competitive or better accuracy.

I/O 2026의 Gemini for Science: 각 연구 도구는 어떻게 작동하나

Three experimental AI research tools launched at I/O 2026. What Literature Insights, Co-Scientist, and AlphaEvolve each actually do.