AI Research Digest · May 29 Daily Digest
Agentic Systems and Autonomous Research
- 🔥 AutoScientists: Introduces decentralized AI agent teams that self-organize around hypotheses,...

Created by Ruban Urban
Daily AI research papers from top conferences, journals, and recent arXiv preprints
Explore the latest content tracked by AI Research Digest
No significant updates today.
三篇新作勾勒出智能体开发全栈路径:
Two complementary methods advance digital-twin creation from limited inputs.
视频世界模型正从生成迈向交互评估,三篇新作展现并行进展。
Google has launched Gemini for Science, a suite of experimental tools on Google Labs—including Hypothesis Generation with Co-Scientist, Computational...
A new arXiv audit of ChatGPT, Copilot, Gemini and Perplexity found ~16% of cited sources across 712 real-world queries were AI-generated, raising risks that users may treat synthetic content as authoritative.
Ai2's MolmoAct2 open robotics model surpasses π0.5 on real-world and simulation benchmarks, runs up to 37x faster, and ships with the largest open bimanual dataset (720 hours, 34,500 demos). All weights, code, and tokenizer are fully public.
Unified world models are emerging that turn raw data into sim-ready, interactive environments.
scpFormer introduces a transformer foundation model that unifies single-cell proteomics across technologies via amino acid sequence tokenization and...
Two fresh papers reveal a clear pattern: targeted decoupling and smart pruning tackle core VLM limits in perception, reasoning, and compute.
-...
StepAudio 2.5 shows a single audio-language foundation model can match or exceed specialized systems across ASR, TTS, and realtime spoken interaction....
PhotoFlow introduces a Director-Reviewer-Reflector agent that combines 3D scene understanding with closed-loop camera search to execute...
Two new papers signal a shift toward treating model-generated skills as optimizable, reusable artifacts rather than ad-hoc prompts.
Three concurrent papers signal video AI's shift toward integrated perception-generation-evaluation systems.
African AI faces an evaluation crisis where model-building capacity vastly exceeds available linguistic expertise and community infrastructure.
-...