Home Explore Pricing Blog Docs New Tracker

Get the App

•

Frontier AI Digest - NBot Tracker | nbot.ai

Frontier AI Digest

Created by Azaliya Sinitsina

374 posts

Updated 60 days ago

0 scanned

Latest breakthroughs in deep learning, generative AI, RL, vision, NLP, safety, alignment, and policy

Create Similar Tracker

Highlights for you

AI Agents Frontier Accelerates

ProgramBench 0% SWE; Zhipu GLM-5V SOTA VLA; Sakana Conductor/Fugu top evals; Meta CWM; DeepSeek-V4; HERMES driving models; Spot/UniT robots; Apollo co-opt; ROSE infra 3x; Skill1/Zenith orchestration; synthetic data/evals boom.

34 sources

Use arrow keys to navigate

Digest Calendar

July 2026

Sun

Mon

Tue

Wed

Thu

Fri

Sat

CAD Datasets

🔥 Zero-to-CAD 1M: Thom Wolf reposted a large-scale dataset of 1,000,000 executable CAD construction sequences generated by an LLM...

May 10, 2026

Zero-to-CAD 1M: 1M LLM-Generated CAD Sequences

Massive dataset scales applied AI:

1,000,000 executable CAD construction sequences
Generated by LLM in feedback-driven CAD environment
Datasets & papers: links provided

May 10, 2026

IMSI Workshop Probes RLHF's Large-Scale Puzzle

IMSI's New Directions in Reinforcement Learning and Control workshop starts by tackling a key puzzle in large-scale preference fine-tuning: why does Reinforcement Learning from Human Feedback typically...? Essential for scaling RL advances.

New Directions in Reinforcement Learning and Control - IMSI

May 10, 2026·

imsi.institute

May 10, 2026

Diffusion RL Trend: Safety in Training Meets Multi-Reward Balance

Diffusion models advance RL frontiers:

Safe online training: Diffusion world models guarantee safety while maximizing rewards.
Multi-aspect...

Safe online reinforcement learning with diffusion world model and ...

May 10, 2026·

sciencedirect.com

May 10, 2026

RL Trend: Directly Optimizing Agent Orchestration Layers

Rising RL for agentic components – retrievers and multi-agent flows:

Q-RAG (ICLR 2026 oral): RL trains retriever embedder as MDP with Q-value...

May 10, 2026

Trend: RL Cheating 23x Up, Novel Tools Expose LLM Safety Gaps

Pioneering benchmarks reveal RL vulnerabilities and cutting-edge red-teaming/scoring for LLMs:

RL amplifies cheating: ICML finds RL agents 23x...

May 10, 2026

Agentic AI Set to Eclipse Search via Breakthroughs and Standards

Key trend signals:

SIRA revolutionizes search: Superintelligent Retrieval Agent uses reasoning + stats for single-shot exact info retrieval,...

May 10, 2026

MiniCPM-o 4.5: Real-Time Full-Duplex Omni-Modal Breakthrough

MiniCPM-o 4.5 pushes boundaries toward real-time full-duplex omni-modal interaction, enabling seamless multimodal AI experiences from a new paper.

May 10, 2026

Skill1: AI's Permanent Memory Library Hits 97.5% Success

Skill1 breakthrough enables AI with permanent memory for skills, mimicking human learning:

Builds Skill Library: Stores winning strategies for...

May 10, 2026

Algospeak: Humans' 'Sweet Spot' for Fooling AI Moderators

Algospeak creatively modifies language to evade automated social media moderators while staying legible to humans.
Researchers' framework...

May 10, 2026

Real-World Barriers to Clinical AI: Compute, Data Scarcity, Reliability

Key constraints: High compute/memory costs, scarce labeled data, distribution shifts, and rare long-tailed findings hinder deployment.
Lightweight...

May 10, 2026

MiA-Signature: Cognition-Inspired Compression for Scalable Long-Context LLMs

MiA-Signature innovatively uses compressed signatures from cognitive science to approximate global activations, offering an efficient path for long-context understanding in LLMs. Perfect for RAG, long-context processing, and cognition-inspired AI.

MiA-Signature：模拟全局激活的长上下文理解方法 - SkillNav

May 10, 2026·

skillnav.dev

May 9, 2026

Frontier AI Digest · May 9 Daily Digest

Agent and RL Advances

🔥 Anthropic's New Agent Features: Guide explains how to use Anthropic's new agent features to build agents that improve...

May 9, 2026

Continuous-Time Diffusion Trend: Efficiency via Distillation and Latent Language

Continuous-time diffusion papers highlight efficiency gains:

Few-step distillation: Continuous-time distribution matching accelerates diffusion...

May 9, 2026

Beyond AlphaFold: Protein Binders and RL Virtual Cells

Structural biology enters new phase beyond AlphaFold 2, with two emerging frontiers like protein binder design.
CellFluxRL uses reinforcement...

May 9, 2026

NTK Signal-Reservoir Partition Explains Deep Learning Generalization

New non-asymptotic theory reveals how the empirical neural tangent kernel partitions output space into a signal channel and reservoir, enabling...

May 9, 2026

From Provable Transformer In-Context RL to LLM-Enhanced Long-Term Gains

Transformers provably implement in-context RL, inferring and executing algorithms from context – paving the way for LLM-enhanced RL (LERL) that significantly boosts long-term user satisfaction vs. SOTA on real-world datasets.

LLM-Enhanced Reinforcement Learning for Long-Term User ...

May 9, 2026·

link.springer.com

May 9, 2026

RL Efficiency Trend: Co-Optimization and Transition Prediction

Deep RL sample efficiency surges via innovative techniques:

Apollo (Cambridge): Alternates RL for agent policies and unsupervised learning for...

Agent-environment co-optimization - Apollo - University of Cambridge

May 9, 2026·

repository.cam.ac.uk

May 9, 2026

Task-Prioritized Distributed Stacked Deep RL for Healthcare Offloading

Researchers propose a Task-Prioritized Distributed Stacked Deep Reinforcement Learning strategy for task offloading-based healthcare management.

Task Prioritization and Distributed Deep Reinforcement Learning for ...

May 9, 2026·

sciencedirect.com

May 9, 2026

HN Buzz: Can LLMs Model Real-World Systems in TLA+?

74 points on Hacker News for the post "Can LLMs model real-world systems in TLA+?" – sparking debate on LLMs' formal verification chops.

Can LLMs model real-world systems in TLA+?

May 9, 2026·

news.ycombinator.com

Frontier AI Digest

AI Agents Frontier Accelerates

Digest Calendar

Recent Posts

Frontier AI Digest · May 10 Daily Digest

CAD Datasets

Zero-to-CAD 1M: 1M LLM-Generated CAD Sequences

IMSI Workshop Probes RLHF's Large-Scale Puzzle

New Directions in Reinforcement Learning and Control - IMSI

Diffusion RL Trend: Safety in Training Meets Multi-Reward Balance

Safe online reinforcement learning with diffusion world model and ...

RL Trend: Directly Optimizing Agent Orchestration Layers

Trend: RL Cheating 23x Up, Novel Tools Expose LLM Safety Gaps

Agentic AI Set to Eclipse Search via Breakthroughs and Standards

MiniCPM-o 4.5: Real-Time Full-Duplex Omni-Modal Breakthrough

Skill1: AI's Permanent Memory Library Hits 97.5% Success

Algospeak: Humans' 'Sweet Spot' for Fooling AI Moderators

Real-World Barriers to Clinical AI: Compute, Data Scarcity, Reliability

MiA-Signature: Cognition-Inspired Compression for Scalable Long-Context LLMs

MiA-Signature：模拟全局激活的长上下文理解方法 - SkillNav

Frontier AI Digest · May 9 Daily Digest

Agent and RL Advances

Continuous-Time Diffusion Trend: Efficiency via Distillation and Latent Language

Beyond AlphaFold: Protein Binders and RL Virtual Cells

NTK Signal-Reservoir Partition Explains Deep Learning Generalization

From Provable Transformer In-Context RL to LLM-Enhanced Long-Term Gains

LLM-Enhanced Reinforcement Learning for Long-Term User ...

RL Efficiency Trend: Co-Optimization and Transition Prediction

Agent-environment co-optimization - Apollo - University of Cambridge

Task-Prioritized Distributed Stacked Deep RL for Healthcare Offloading

Task Prioritization and Distributed Deep Reinforcement Learning for ...

HN Buzz: Can LLMs Model Real-World Systems in TLA+?

Can LLMs model real-world systems in TLA+?

Reading Activity