Faith-Based Coalition Exposes Religious Bias in AI
A new consortium of researchers from Baylor, BYU, Notre Dame, and Yeshiva has launched the AllFaith Benchmark, revealing AI models supply religious...

Created by Linda Kay
Daily feed of high‑impact AI research papers from arXiv, conferences, and journals
Explore the latest content tracked by AI Research Radar
A new consortium of researchers from Baylor, BYU, Notre Dame, and Yeshiva has launched the AllFaith Benchmark, revealing AI models supply religious...
Agentic AI progress now hinges on system-level harnesses and synthetic data pipelines as much as raw model scale.
Two new papers highlight complementary progress in personal agents:
Three frontier LLMs gave sharply inconsistent triage advice for identical stroke symptom vignettes:
DeepMind's AlphaProof Nexus and Axiom Math pursue complementary routes to credible AI-generated mathematics.
Recent papers reveal parallel progress in generation techniques alongside urgent benchmark needs.
ParaVT introduces the first end-to-end RL framework enabling agents to dispatch multiple video crops in one turn rather than sequentially.
-...
A new arXiv paper charts the shift from late-fusion systems to native multimodal modeling (NMM), where modalities integrate intrinsically inside a...
DVAO dynamically reweights advantages in multi-reward RL by empirical variance within rollout groups, boosting objectives with clearer signals while curbing noise for stable training and stronger Pareto fronts on math/tool-use tasks.
New paper "From Context to Skills" questions if language models can learn from context skillfully, probing core limits of in-context skill acquisition.
Emerging internal memory systems redefine LLM/LVLM capabilities:
New benchmarks highlight a trend toward rigorous, specialized agent evaluation:
New arXiv paper [2605.02810] on agency maturation:
Fresh arXiv paper 2605.02741 analyzes code and architecture smells in LLM- and agent-driven development, exposing hidden flaws in AI-generated software to improve agentic coding practices.
Critical caution: LLMs should not yet be credited with decision explanation capabilities. New arXiv paper by Wenshuo Wang featured in today's Daily Papers podcast urges restraint on assuming human-like reasoning transparency.
Emerging breakthroughs in agent stability over long horizons and multi-turns:
New AI Interaction Theory (AIT) podcast introduces a framework for healthy AI-human interactions rooted in emotional intelligence.
LLMs evaluated as primary care tools using exclusively public, anonymized search queries and model outputs—no patient data involved.
OceanPile launches as a large-scale multimodal ocean corpus tailored for foundation models, unlocking new possibilities in marine AI research.