Tech Policy Science Brief

Competition in large models and AI chip hardware

Competition in large models and AI chip hardware

Model & Chip Race

The landscape of large models and AI hardware in 2026 is witnessing unprecedented innovation and strategic shifts, driven by massive investments, emerging chip technologies, and intensifying competition to lead the AI era. Central to this evolution is the recent funding success of startups like MatX, which has raised approximately $500 million in a Series B round led by prominent investors such as Jane Street and Situational Awareness. This substantial capital positions MatX as a formidable challenger to Nvidia, aiming to challenge its near-monopoly in AI hardware with next-generation processors designed for both training and inference of large models.

MatX’s push for hardware innovation is part of a broader industry trend—as companies race to develop specialized silicon that can efficiently handle the demands of increasingly sophisticated models. These developments include silicon-embedded large language models (LLMs), where startups like Taalas are pioneering the “printing” of LLMs directly onto chips. This approach creates hardware-intrinsic AI, drastically reducing inference latency, lowering power consumption, and enabling robust on-device AI in smartphones, industrial sensors, and embedded systems. Such embedded silicon models are crucial for privacy-preserving, real-time AI applications, especially as the demand for on-device inference continues to accelerate.

Beyond hardware, the model landscape is diversifying and scaling rapidly. Google’s Gemini 3.1 Pro has recently outperformed previous benchmarks, including GPT-5.2, across various assessments such as ARC-AGI-2 and MMMU-Pro, demonstrating deep reasoning and multi-tasking proficiency. OpenAI’s GPT-5.3-Spark, utilizing Cerebras hardware, now processes up to 17,000 tokens per second, enabling ultra-low latency responses vital for industrial automation, real-time gaming, and on-the-fly coding. Meanwhile, Anthropic's Sonnet 4.6 exemplifies a paradigm shift toward affordability, offering state-of-the-art performance at roughly 20% of the cost, making advanced AI more accessible to emerging markets and small enterprises.

The global AI ecosystem is also becoming more multipolar, with models like Qwen 3.5 Flash from China emphasizing regional innovation and reduced dependence on Western technology. This diversification fuels the rise of bespoke AI ecosystems tailored to specific industries and regional needs.

Hardware investments are complemented by a surge in capital flows into related sectors. In addition to MatX, companies like Boss Semiconductor secured ₩87 billion (~$70 million) to develop performance-optimized chips for autonomous vehicles, particularly targeting the Chinese market. The chip manufacturing ecosystem is expanding rapidly, with Intel partnering with SambaNova in a $350 million investment to enhance AI chip capabilities amid unsuccessful acquisition talks, signaling industry consolidation and collaboration.

Quantum computing also remains a strategic frontier, with Quantonation’s €220 million fund backing quantum processors poised to revolutionize sectors like manufacturing, logistics, and defense. Investments in advanced manufacturing processes and energy storage systems continue to bolster the physical infrastructure needed to sustain large-scale AI ecosystems.

Supply chain resilience and geopolitical considerations are at the forefront of industry strategy. Companies like Apple are reshoring manufacturing operations to the U.S. to enhance technological sovereignty, amid rising tensions and export control measures. Notably, DeepSeek, a major AI startup, has withheld its latest flagship model from U.S. chipmakers like Nvidia, citing security and provenance concerns, which underscores fears over model siphoning and national security vulnerabilities. Such moves complicate international collaboration and supply chain stability.

On the regulatory and ethical front, governments are increasingly active. The U.S. has taken steps like banning Anthropic from federal agencies due to security concerns, while Google employees demand "red lines" on military AI applications, reflecting societal apprehensions. Device-level safety features, such as AI kill switches embedded in browsers like Firefox 148, are gaining importance to build public trust and ensure ethical deployment.

The broader ecosystem is also focusing on infrastructure, safety, and governance. Significant investments include India’s commitment of over $110 billion to establish sovereign AI infrastructure and Eon’s $300 million Series D to unlock AI data goldmines through trusted, transparent data platforms like Eon. Deployment tools such as Portkey are simplifying large-scale AI deployment, while Google’s automated workflow creation in Opal exemplifies efforts to accelerate innovation cycles.

In the geopolitical arena, model provenance and security remain contentious. DeepSeek’s withholding of models and regulatory crackdowns highlight fears over model theft and misuse, prompting nations to revisit export controls and security protocols. The Pentagon’s recent AI innovation memo emphasizes military applications and security concerns, while industry leaders warn that many social media demos are still far from production readiness.

Emerging frontiers include spatial AI, exemplified by World Labs, which has raised $1 billion to develop world generation tools that could revolutionize urban planning, environmental monitoring, and disaster management. The space and robotics sectors are also gaining prominence, with companies like CesiumAstro acquiring space-focused AI firms to enhance satellite autonomy and space debris management.

In summary, 2026 is a pivotal year where hardware innovation, model diversification, and massive capital inflows are reshaping the AI landscape. While these advancements promise more powerful, efficient, and democratized AI, they also pose security, ethical, and geopolitical challenges that require careful navigation. The decisions made now will determine whether AI becomes a safe, inclusive tool for societal progress or a source of future vulnerabilities. As the industry pushes forward, the integration of next-gen chips, on-device inference, and robust governance will be essential to harness AI’s full potential responsibly.

Sources (91)
Updated Feb 28, 2026
Competition in large models and AI chip hardware - Tech Policy Science Brief | NBot | nbot.ai