Research & Benchmarks
[cat] creeta 3-axis IA
Gemini for Science: What Google I/O 2026 Introduced for Researchers
Google's I/O 2026 AI research suite: literature triage, hypothesis tournaments, and ERA outperforming CDC forecasts.
DiffusionBlocks: Sakana AI's Block-Wise Training for ICLR 2026
DiffusionBlocks trains one residual block per step, reducing activation memory B× with competitive or better accuracy.
Gemini for Science at I/O 2026: How Each Research Tool Works
Three experimental AI research tools launched at I/O 2026. What Literature Insights, Co-Scientist, and AlphaEvolve each actually do.
xAI Grok Build: Sub-Agents, MCP Compat, and the SWE-Bench Numbers
xAI shipped its terminal coding agent on May 14, 2026. Here's what the CLI actually does, where the benchmark numbers hold, and what $299/month buys.
Erdős Unit Distance Conjecture Disproved: Inside the OpenAI Proof
OpenAI's reasoning model disproved an 80-year-old geometry conjecture — verified by a nine-mathematician team including a Fields Medalist.
과학을 위한 Gemini: Google I/O 2026이 연구자들을 위해 선보인 것들
Google's I/O 2026 AI research suite: literature triage, hypothesis tournaments, and ERA outperforming CDC forecasts.
DiffusionBlocks: Sakana AI의 블록 단위 학습, ICLR 2026
DiffusionBlocks trains one residual block per step, reducing activation memory B× with competitive or better accuracy.
I/O 2026의 Gemini for Science: 각 연구 도구는 어떻게 작동하나
Three experimental AI research tools launched at I/O 2026. What Literature Insights, Co-Scientist, and AlphaEvolve each actually do.