Home Explore Pricing Blog Docs New Tracker

Get the App

•

AI Labs Pulse - NBot Tracker | nbot.ai

AI Labs Pulse

Created by san x

1.7K posts

Updated 71 days ago

0 scanned

Breakthrough AI research, product releases, and industry news from leading labs

Create Similar Tracker

Digest Calendar

May 2026

Sun

Mon

Tue

Wed

Thu

Fri

Sat

Product Launches

🔥 xAI Voice Agent API: xAI launched the Voice Agent API for real-time voice conversations over WebSocket, billed at a flat...

March 18, 2026

China Punishes Meta's $2B Manus Deal, Fueling AI Tech War

Meta's $2B acquisition of Chinese-rooted AI startup Manus was billed as a victory for Zuckerberg.
China responds with punitive actions against...

China Imposes Restrictions on Meta's Acquisition of AI Startup Manus

binance.com

China Imposes Restrictions on Meta's Acquisition of AI Startup Manus

March 18, 2026

Benchmark innovations expose AI's lingering gaps

New tools spotlight persistent AI limitations amid AGI hype:

Cognitive framework rethinks AGI measurement (71 HN pts).
Oboe pushes custom LLM...

Measuring progress toward AGI: A cognitive framework

March 18, 2026·

news.ycombinator.com

March 18, 2026

Qianfan-OCR: Unified End-to-End Model for Document Intelligence

Qianfan-OCR emerges as a unified end-to-end model for document intelligence, advancing enterprise OCR capabilities – paper and apps now available for exploration.

March 18, 2026

LLMs Bias Hiring Towards Prestige Over Qualifications

Major LLMs exhibit demographic biases in hiring evaluations:

Base models (e.g., Copilot) advance elite affiliations/connections—even skipping 3-year...

March 18, 2026

Latent Entropy-Aware Decoding to Mitigate Hallucinations in MLRMs

New paper 'Thinking in Uncertainty' introduces Latent Entropy-Aware Decoding for mitigating hallucinations in MLRMs. Join the discussion on this breakthrough.

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

arxiv.org

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

March 18, 2026

OpenAI's ChatGPT Pricing Shift: From 'Accidental' Unlimited to Sustainable Enterprise Tiers

Pricing evolution underway as OpenAI deems current unlimited model accidental and unsustainable amid 900M weekly users and 50M subscribers.

-...

OpenAI Rethinks ChatGPT Pricing, Calls Current Model ‘Accidental’

March 18, 2026·

winbuzzer.com

March 18, 2026

Claude Double Checker: Menu Bar Tracker for 2x Usage Windows

Handy dev tool for Claude Code power users:

Live macOS menu bar display of 2x usage window status, duration, and switches.
Ends the pain of manual...

producthunt.com

Claude Double Checker

March 18, 2026

Microsoft Hires Sequoia-Backed Cove Team to Boost AI Collaboration

Microsoft has hired the full team from Sequoia-backed AI collaboration startup Cove, which is shutting down with service ending April 1 and customer data deletion planned. A strategic talent grab for AI tooling.

Microsoft hires the team of Sequoia-backed AI collaboration platform, Cove

March 18, 2026·

techcrunch.com

March 18, 2026

MiroThinker-1.7 & H1: Verification for Heavy-Duty Research Agents

MiroThinker-1.7 & H1 pushes boundaries with verification techniques enabling heavy-duty research agents. New paper details this breakthrough in agentic capabilities.

March 18, 2026

DOD Deems Anthropic Unacceptable National Security Risk Over Safety Red Lines

US DOD labels Anthropic an 'unacceptable risk to national security', citing fears it might "attempt to disable its technology" during warfighting...

DOD says Anthropic’s ‘red lines’ make it an ‘unacceptable risk to national security’

March 18, 2026·

techcrunch.com

March 18, 2026

OpenAI's Agentic Models Launch Amid 'Fake Thinking' Revelation

OpenAI pushes agentic AI with two new models: GPT-5.4 Mini built for agentic workloads.

Joint paper from OpenAI, Anthropic, DeepMind, Meta shows AI...

OpenAI just dropped two models built for agentic workloads. GPT-5.4 mini ...

March 18, 2026·

threads.com

March 18, 2026

One-Eval: Agentic System for Automated LLM Evaluation

One-Eval launches as an agentic system for automated and traceable LLM evaluation, streamlining benchmarking with built-in traceability. Join the discussion on this paper.

One-Eval: An Agentic System for Automated and Traceable LLM Evaluation

arxiv.org

One-Eval: An Agentic System for Automated and Traceable LLM Evaluation

March 18, 2026

Amazon's 5-Pillar Framework for Production-Ready GenAI

Key strategies from Amazon AI leader Abhinav Kasliwal to scale GenAI prototypes 3x faster with safety:

5 Pillars: Design, Evaluation, Safety,...

March 18, 2026

Agentic Self-Improvement: PostTrainBench & MemRL Bypass Fine-Tuning

Emerging trend: LLM agents automate post-training and runtime learning, enabling faster iteration without weight updates.

PostTrainBench tests...

March 18, 2026

Treasury AI Framework Meets FinToolBench for Secure Financial Agents

US Treasury's pragmatic playbook introduces 230-point AI Risk Management Framework for financial institutions.
FinToolBench evaluates LLM agents...

The US Treasury’s New AI Playbook: Moving from Principles to Pragmatism

theglobaltreasurer.com

The US Treasury’s New AI Playbook: Moving from Principles to Pragmatism

March 18, 2026

2-Line Tool for Sandboxed Autonomous AI Agents Goes Viral on HN

Ultra-simple tool launches autonomous AI agents with sandboxed execution in just 2 lines of code – buzzing at 48 points on Hacker News, perfect for rapid prototyping.

Launch an autonomous AI agent with sandboxed execution in 2 lines of code

March 18, 2026·

news.ycombinator.com

March 18, 2026

Mistral AI Launches Forge – 598 HN Points

Mistral AI releases Forge, drawing huge buzz with 598 points on Hacker News – a hot new product launch to watch.

Mistral AI Releases Forge

March 18, 2026·

news.ycombinator.com

March 18, 2026

Review Spotlights Controls in LLM Chatbot Studies

A new methodological review systematically identifies and categorizes control conditions, including placebos, used in interventional studies of LLM-based chatbots – essential for boosting rigor in AI experiments.

Investigating Placebos and Controls Used in Large Language Model ...

March 18, 2026·

researchprotocols.org

March 18, 2026

Gemini 3.1 Pro Signals End of Traditional AI Benchmarks, Rise of Vibe Era

Gemini 3.1 Pro is reshaping AI evaluation:

Benchmark obsolescence: Experts question if traditional scores measure real intelligence.
Vibe Era...

AI Labs Pulse

Digest Calendar

Recent Posts

AI Labs Pulse · Mar 19, 2026 Daily Digest

Product Launches

China Punishes Meta's $2B Manus Deal, Fueling AI Tech War

China Imposes Restrictions on Meta's Acquisition of AI Startup Manus

Benchmark innovations expose AI's lingering gaps

Measuring progress toward AGI: A cognitive framework

Qianfan-OCR: Unified End-to-End Model for Document Intelligence

LLMs Bias Hiring Towards Prestige Over Qualifications

Latent Entropy-Aware Decoding to Mitigate Hallucinations in MLRMs

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

OpenAI's ChatGPT Pricing Shift: From 'Accidental' Unlimited to Sustainable Enterprise Tiers

OpenAI Rethinks ChatGPT Pricing, Calls Current Model ‘Accidental’

Claude Double Checker: Menu Bar Tracker for 2x Usage Windows

Claude Double Checker

Microsoft Hires Sequoia-Backed Cove Team to Boost AI Collaboration

Microsoft hires the team of Sequoia-backed AI collaboration platform, Cove

MiroThinker-1.7 & H1: Verification for Heavy-Duty Research Agents

DOD Deems Anthropic Unacceptable National Security Risk Over Safety Red Lines

DOD says Anthropic’s ‘red lines’ make it an ‘unacceptable risk to national security’

OpenAI's Agentic Models Launch Amid 'Fake Thinking' Revelation

OpenAI just dropped two models built for agentic workloads. GPT-5.4 mini ...

One-Eval: Agentic System for Automated LLM Evaluation

One-Eval: An Agentic System for Automated and Traceable LLM Evaluation

Amazon's 5-Pillar Framework for Production-Ready GenAI

Agentic Self-Improvement: PostTrainBench & MemRL Bypass Fine-Tuning

Treasury AI Framework Meets FinToolBench for Secure Financial Agents

The US Treasury’s New AI Playbook: Moving from Principles to Pragmatism

2-Line Tool for Sandboxed Autonomous AI Agents Goes Viral on HN

Launch an autonomous AI agent with sandboxed execution in 2 lines of code

Mistral AI Launches Forge – 598 HN Points

Mistral AI Releases Forge

Review Spotlights Controls in LLM Chatbot Studies

Investigating Placebos and Controls Used in Large Language Model ...

Gemini 3.1 Pro Signals End of Traditional AI Benchmarks, Rise of Vibe Era

Reading Activity