What 'frontier AI' means and why it matters for governance
Frontier AI refers to the most capable AI systems at any given time — systems at or near the frontier of what is technically possible. In 2026, this means large language models with hundreds of billions of parameters, multimodal systems capable of processing text, images, audio, and code simultaneously, and increasingly, agentic systems capable of taking sequences of autonomous actions in the world. These systems are qualitatively different from the AI systems that most enterprise governance frameworks were designed to govern.
The governance gap arises because enterprise AI governance was largely developed in response to specific, narrow AI applications: credit scoring models, hiring screening tools, fraud detection systems. These applications are consequential but bounded — they take specific inputs, perform specific functions, and produce specific outputs that can be evaluated against defined criteria. Frontier AI systems are general-purpose, capable of being applied to an almost unlimited range of tasks, and produce outputs that can be difficult to evaluate against pre-specified criteria. The governance frameworks that work well for bounded AI applications need significant adaptation to address frontier AI.
The agentic AI governance challenge
The deployment of AI in agentic configurations — where the AI takes sequences of actions rather than responding to individual prompts — is the frontier deployment model that most challenges current governance frameworks. An agentic AI might: browse the web, read and write documents, send emails, execute code, and make API calls — all in service of a goal specified by a human prompt, but with minimal human oversight of the individual actions it takes. The human oversight mechanisms designed for prompt-response AI — a human reads the output and decides whether to use it — are fundamentally inadequate for agentic AI, where the AI is taking consequential actions faster than human review can keep pace.
Effective agentic AI governance requires different oversight mechanisms: pre-execution review of the AI's plan before it begins taking actions; bounded action spaces that limit what actions the AI can take without human approval; monitoring of actions in real-time with circuit-breakers that pause execution when unexpected actions are detected; and post-execution audit of what actions were taken and their consequences. Most enterprise organisations deploying agentic AI have not implemented these mechanisms, because the governance frameworks they are using were not designed for them.
Emergent capabilities and the limits of pre-deployment testing
One of the most challenging governance characteristics of frontier AI is emergent capabilities — capabilities that appear unpredictably as AI systems scale, which are not present in less capable versions of the system and were not specifically trained for. Examples include in-context learning (the ability to learn from examples in the prompt without updating model weights), chain-of-thought reasoning (complex multi-step reasoning), and various other capabilities that emerged in larger models. The governance implication is that testing a smaller or earlier version of a model does not fully characterise the governance profile of a larger or more capable version.
For enterprise governance teams, this means that the testing and validation processes adequate for narrow AI applications are not adequate for frontier AI. Comprehensive capability evaluation — testing the model for a broad range of capabilities, including potentially dangerous ones — is a component of responsible frontier AI deployment. The major frontier AI labs conduct safety evaluations before deployment; enterprises deploying these systems should understand what those evaluations covered and what they did not.