Frontier Model Watch · Mar 19 Daily Digest
Benchmark Innovations
- 🔥 PostTrainBench: POSTTRAINBENCH benchmark evaluates LLM agents like Claude Code and Codex CLI on autonomously performing...

Created by Cheng Niu
Deep‑dive news, benchmarks, safety studies, launches, and policy on GPT‑4‑class models
Explore the latest content tracked by Frontier Model Watch
Critical security flaws in publicly accessible Ollama LLM servers:
Emerging techniques highlight LLM vulnerabilities in a new wave of red-teaming:
Arena has become the de facto leaderboard for frontier LLMs, driving funding, launches, and PR—but faces integrity tests:
PostTrainBench benchmarks LLM agents like Claude Code automating post-training on Qwen/Gemma across math, coding, science.
NAVER's Seoul World Model (SWM) pioneers promptable, grounded world sims in real Seoul:
Randy Goebel unveiled a partial framework for debugging foundation models at Ontology Summit 2026:
Promptfoo showcases AI red teaming at RSA Conference 2026:
Open Source Security Foundation launches Model Signing v1.0 to secure the machine learning supply chain, building on SLSA frameworks from traditional software. A key open-source advancement tackling security gaps in accelerating ML ecosystems.
Anthropic advances AI safety with a dedicated hire: a manager to design and implement protections against chemical and explosive threat risks. This underscores their commercial priority on preventing catastrophic misuse.
Key eval progress for DeepSeek-R1, Gemini 3.0, ChatGPT-5 equivalents:
LLMs are increasingly deployed as tool-using agents, shifting safety concerns from harmful text generation to harmful task completion.
Frontier labs eye neurotech: Sam Altman co-founded Merge Labs, securing massive early 2026 funding for ultrasonic brain-computer interfaces.
-...
Key US governance tensions expose frontier AI weaknesses:
A new international AI Safety Report argues frontier capabilities are advancing faster than mitigations, coinciding with model self-explanations coming under question from a cross-lab paper. Urgent call for faster safeguards.
OpenAI advances frontier deployment on two fronts:
Key advances in reasoning and agents from latest research: