AI Breakthroughs Digest

greaterwrong.com

May 20, 2026

OpenComputer: Verifiable Software Worlds for AI Agents

OpenComputer introduces verifiable software worlds built for computer-use agents, inviting discussion on its potential for reliable AI systems.

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

GoLongRL Tackles Long-Context RL via Multitask Alignment

GoLongRL introduces a capability-oriented approach to long context reinforcement learning through multitask alignment.

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

EnvFactory Scales Tool-Use Agents with Synthetic Environments

EnvFactory introduces executable environments synthesis paired with robust RL to advance scaling of tool-use agents.

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Rethinking Safety Measurement in Autonomous Agents

This paper reframes safety-alignment effects in autonomous security agents as a trace-level system property rather than transcript-level refusal rates, providing a deeper lens for evaluating alignment in deployed systems.

Measuring Safety Alignment Effects in Autonomous Security Agents

May 20, 2026

Benchmarks and Reasoning Boost Video AI Control

New tools target quality and controllability in AI-generated videos.

Artifact-Bench evaluates MLLMs on detecting artifacts in video outputs
Video...

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Dartmouth Study Exposes Amplified Biases in Agentic AI

Dartmouth researchers reveal that agentic AI systems exhibit systematically stronger biases than humans when making autonomous decisions.

Agents...

home.dartmouth.edu

Dartmouth Researchers Assess Agentic AI

ArXiv to Ban Researchers Over Hallucinated References

ArXiv is set to ban researchers who include hallucinated references in their submissions, a direct response to fake citations undermining academic...

Researchers who use hallucinated references to face ArXiv ban

May 20, 2026

Process Rewards with Learned Reliability

A paper titled Process Rewards with Learned Reliability is now available. Join the discussion on this paper page.

Process Rewards with Learned Reliability

Unlearnable Hard Examples Limit RLVR in LLMs

A key subset of hard reasoning tasks remains unlearnable under RLVR, even when correct rollouts appear during training.

Gradient analysis reveals...

May 20, 2026

New Paper on Semantic Generative Tuning

A paper titled Semantic Generative Tuning for Unified Multimodal Models is now open for discussion on its dedicated page.

Semantic Generative Tuning for Unified Multimodal Models

KV Sharing and Compressed Attention for LLMs

KV Sharing, MHC, and Compressed Attention techniques are discussed as optimizations for LLMs, drawing 32 points on Hacker News.

KV Sharing, MHC, and Compressed Attention

May 20, 2026

Growing Neural Cellular Automata Draws Attention

The work on Growing Neural Cellular Automata has quickly gained traction, earning 120 points on Hacker News. This signals strong community interest in self-organizing AI models as a promising research direction.

Growing Neural Cellular Automata

May 20, 2026

New Paper Introduces CEPO for RLVR Self-Distillation

A fresh paper presents CEPO, exploring RLVR self-distillation via Contrastive Evidence Policy Optimization. Join the discussion on this emerging approach in reinforcement learning.

CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization

CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization

AutoResearchClaw Enables Self-Reinforcing AI Research

The paper presents AutoResearchClaw, a framework for self-reinforcing autonomous research built on human-AI collaboration.

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Single MRI Predicts Alzheimer's Outcomes

Deep learning enables prediction of both categorical and continuous Alzheimer's outcomes from just one MRI scan, highlighting a practical advance in medical disease forecasting.

Predicting categorical&continuous Alzheimer's disease outcomes from 1 MRI scan

May 20, 2026

Actionable Interpretability Paper Accepted to ICML

Actionable Interpretability takes a practical step forward: a new paper on the topic has been accepted to ICML, timed perfectly with the return of the Actionable Interpretability workshop at COLM. Researchers can connect in Korea and SF this year.

May 20, 2026

The Fuzzy Target of Human Flourishing in AI Alignment

Michael Levin highlights that aligning AI toward human flourishing is poorly-defined, since societies still disagree on what makes a life or community truly well-lived. This ambiguity complicates efforts to set clear goals for safe, beneficial AI.

Short thoughts on AI alignment - Michael Levin - Substack