AI Paper Tracker

2h ago

Video-MME-v2 Advances Video Understanding Benchmarks

Video-MME-v2 marks the next stage in benchmarks for comprehensive video understanding.

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

arxiv.org

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

2h ago

In-Place Test-Time Training: New arXiv Paper

New paper In-Place Test-Time Training explores efficient test-time adaptation. Join the discussion on this paper page.

arxiv.org

In-Place Test-Time Training

2h ago

New Paper Demystifies Pruning via Representation Hierarchies

New arXiv paper Demystifying When Pruning Works via Representation Hierarchies unpacks pruning effectiveness, vital for model compression – join the discussion.

arxiv.org

Demystifying When Pruning Works via Representation Hierarchies

2h ago

MegaTrain: Full-Precision Training of 100B+ LLMs on a Single GPU

MegaTrain enables full-precision training of 100B+ parameter LLMs on a single GPU, democratizing massive model development through extreme hardware efficiency.

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

arxiv.org

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

2h ago

6h ago

Flow Maps Leap Forward as Future of Non-Autoregressive Text Gen

Major update to flow map language models – researchers hail it as the future of non-autoregressive text generation. Introduces new class of continuous flow-based models, detailed in fresh paper and blog.

6h ago

Cog-DRIFT: ZPD-Inspired Fix for RLVR's Zero-Reward Roadblock

RLVR failure mode: Hard problems (pass@64=0) yield zero rewards as models can't find correct rollouts, stalling training.
Cog-DRIFT breakthrough:...

6h ago

RCTs: 10min AI Assistance Triggers 'Boiling Frog' Skill Degradation

🚨 Alarming preprint: In RCTs, just 10 minutes of AI assistance makes people perform worse and give up more often than those without AI. A stark warning for education and productivity tools.

6h ago

OWSM Replicates OpenAI's Whisper Using Public Data

In a key advance for open speech AI, the Open Whisper-style Speech Model (OWSM) has reproduced OpenAI's Whisper as an initial step using publicly available data and open-source toolkits.

Daily Papers

6h ago·

huggingface.co

12h ago

arXiv Trend: Infrastructure for Adaptive AI Agents

Rapid advances in AI agent foundations:

FileGram grounds personalization in file-system behavioral traces
ClawArena benchmarks agents in...

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

arxiv.org

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

12h ago

Self-Execution Simulation Boosts Coding LLMs

New post-training technique for coding LLMs: simulate test execution to verify and fix their own code.

Builds on current LLMs that <think> before code solutions
Yields additional gains in performance
Details in fresh paper thread

12h ago

AURA: Always-On Video Stream Understanding for Real-Time AI Assistance

AURA enables always-on understanding and real-time assistance via video streams, marking a breakthrough in continuous multimodal perception for proactive AI support.

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

arxiv.org

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

12h ago

LIBERO-Para: Benchmark for Paraphrase Robustness in VLA Models

LIBERO-Para introduces a diagnostic benchmark and metrics to expose paraphrase robustness vulnerabilities in VLA models. Key for advancing reliable vision-language-action systems.

LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models

arxiv.org

LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models

12h ago

LightThinker++: Evolving Reasoning Compression to Memory Management

LightThinker++ marks a shift from reasoning compression to memory management in LLM efficiency. Key new paper for next-gen reasoning optimization—join the discussion.

LightThinker++: From Reasoning Compression to Memory Management

arxiv.org

LightThinker++: From Reasoning Compression to Memory Management

12h ago

17h ago

2026-04-07 arXiv: Olmo Hybrid, SODA Distillation & Multimodal Advances

Today's fresh arXiv drops spotlight hybrid LLMs and distillation:

Olmo Hybrid (AllenAI): 7B model blending gated DeltaNet RNNs for...

zhuanlan.zhihu.com

2026年4月7日多模态大模型论文推送 - 知乎

17h ago

Cloud AI Alert: Remote GPU Side-Channels Steal Model Architectures

Critical cloud vulnerability: Remote attackers reconstruct DNN architectures on NVIDIA GPUs via execution traces—no access needed.

Mercury...

New attack can steal AI models via side-channel leaks, no access needed

morningoverview.com

New attack can steal AI models via side-channel leaks, no access needed

17h ago

Marcus & Chollet: LLMs Fail Reasoning, Symbolic AI the Future

Trend alert: AI skeptics underscore LLMs' reasoning limits, advocating symbolic alternatives.

Base LLMs can't reason or do math on generalization...

17h ago

Machine Unlearning Leaves Persistent Privacy Risks in LLMs

Key failures in AI unlearning expose ongoing privacy vulnerabilities:

Methods mask rather than erase memorized data, recoverable via follow-up...

AI ‘machine unlearning’ still struggles to erase memorized training data

morningoverview.com

AI ‘machine unlearning’ still struggles to erase memorized training data

17h ago

Learnable Adaptation Policies for Test-Time Learning in Language Agents

New arXiv paper introduces learnable adaptation policies enabling language agents to learn at test-time. Join the discussion.

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

arxiv.org

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

17h ago

Comparative DL Analysis Highlights Scalable CV Disease Detection

Amid rapid AI advancements, computer vision-based disease detection stands out as a reliable and scalable alternative in this new deep learning comparative study.

[PDF] RESEARCH HUB COMPARATIVE ANALYSIS OF DEEP LEARNING ...

17h ago·

researchhub.org.in

17h ago

DBT-DR-GAN: Transformer-GAN-Diffusion Hybrid Revolutionizes CT-to-MRI Synthesis

Novel DBT-DR-GAN advances medical imaging with a three-stage pipeline for high-fidelity CT-to-MRI translation:

Dual-branch Transformer GAN captures...

DBT-DR-GAN: Dual-Branch Transformer-Enhanced GAN with Diffusion-Based Refinement for CT-to-MRI Translation - ScienceDirect

sciencedirect.com

DBT-DR-GAN: Dual-Branch Transformer-Enhanced GAN with Diffusion-Based Refinement for CT-to-MRI Translation - ScienceDirect

17h ago

TurboQuant ICLR 2026 Compression Breakthrough

Digest Calendar

Recent Posts

Video-MME-v2 Advances Video Understanding Benchmarks

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

In-Place Test-Time Training: New arXiv Paper

In-Place Test-Time Training

New Paper Demystifies Pruning via Representation Hierarchies

Demystifying When Pruning Works via Representation Hierarchies

MegaTrain: Full-Precision Training of 100B+ LLMs on a Single GPU

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Flow Maps Leap Forward as Future of Non-Autoregressive Text Gen

Cog-DRIFT: ZPD-Inspired Fix for RLVR's Zero-Reward Roadblock

RCTs: 10min AI Assistance Triggers 'Boiling Frog' Skill Degradation

OWSM Replicates OpenAI's Whisper Using Public Data

Daily Papers

arXiv Trend: Infrastructure for Adaptive AI Agents

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Self-Execution Simulation Boosts Coding LLMs

AURA: Always-On Video Stream Understanding for Real-Time AI Assistance

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

LIBERO-Para: Benchmark for Paraphrase Robustness in VLA Models

LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models

LightThinker++: Evolving Reasoning Compression to Memory Management

LightThinker++: From Reasoning Compression to Memory Management

2026-04-07 arXiv: Olmo Hybrid, SODA Distillation & Multimodal Advances

2026年4月7日多模态大模型论文推送 - 知乎

Cloud AI Alert: Remote GPU Side-Channels Steal Model Architectures

New attack can steal AI models via side-channel leaks, no access needed

Marcus & Chollet: LLMs Fail Reasoning, Symbolic AI the Future

Machine Unlearning Leaves Persistent Privacy Risks in LLMs

AI ‘machine unlearning’ still struggles to erase memorized training data

Learnable Adaptation Policies for Test-Time Learning in Language Agents

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Comparative DL Analysis Highlights Scalable CV Disease Detection

[PDF] RESEARCH HUB COMPARATIVE ANALYSIS OF DEEP LEARNING ...

DBT-DR-GAN: Transformer-GAN-Diffusion Hybrid Revolutionizes CT-to-MRI Synthesis

DBT-DR-GAN: Dual-Branch Transformer-Enhanced GAN with Diffusion-Based Refinement for CT-to-MRI Translation - ScienceDirect

Reading Activity