De-Risking
Frontier AI.

You asked for AI Governance.

You got Excel checklists, confused boards and a set of principles engineers can't use.

If it feels like it's all being improvised by people who don't understand the tech... you're probably right.

Stop paying for theatre. Call in the experts.

See the stack
You can't run your business on AI that's governed like a side hustle.

Most of what's sold as AI governance is theatre: an ethics board, a set of principles, a policy document, a workshop for the leadership team. The board shrugs and approves, lots of people talk about bias, HR create a community page, and it all melts on contact with the actual technology.
A business that barely knows what it has deployed gives governance no grip. So a framework full of "shoulds" meets a team that deals in specifics innovating with a technology that evolves daily. The result is foreseeable.

So what is AI governance?

AI governance is the set of controls, evidence and decision rights that make an organisation's use of AI deliberate and accountable. It is operational, and it's based on confidence and understanding. You cannot govern what you cannot evaluate, you cannot evaluate what you do not measure, and you cannot provide useful measures for a system that you don't really grasp.

87% of executives claim to have clear AI governance frameworks, but fewer than 25% have fully implemented the technical tools to manage risks.

Source: IBM Newsroom

A framework is a structure, and a structure is easy to approve. But unconnected to technical implementation it's window dressing. A governance function that only has the ability to advise means the delivery timeline decides what ships. This is why AI governance frameworks fail. Real governance is the work of making your organisation's AI practice intentional.

"Should" is not a control. It never will be.

Everyone is deploying.
Almost no one has it running.

This isn't a failure of intent. Most organisations have the framework, the principles, the committee. But it stalls when governance has to leave the document and start running inside the technology: continuous evaluation, instrumentation, controls that live in the stack.

That is a different order of difficulty and well outside the comfort zone of most AI governance programs. It is the edge of what they can currently execute, not the edge of their intent

AI doesn't wait for them to catch up. The gap widens with every model pushed to production without triage, every vendor claim taken on trust, every system running that no-one is watching, every new model release.

Address the Gap

>60%

are already using or plan to use agentic AI this year

Source: Cloud Security Alliance & Google Cloud

<28%

of respondents are confident they can secure AI used in core business operations

Source: Cloud Security Alliance & Google Cloud

<1%

of organisations have fully operationalised responsible AI

Source: World Economic Forum

What operational governance is made of.

None of this is a slide deck or a policy pack. Each is a working component that runs inside your systems, tuned to your models and your risk. Built to your environment, not lifted from a template.

Tiered AI Impact Assessment


Scrutiny scaled to consequence: heavy where it matters, light where it doesn't.


A triage that sorts every use case by what happens if it goes wrong, so a customer-facing credit model and an internal meeting-notes summariser don't carry the same paperwork. Each tier sets its own decision rights, escalation path, and evidence bar. A separate procurement track puts vendor "responsible AI" claims through real due diligence before the contract is signed, not after the incident.

AI Bill of Materials


A live inventory of every model, dataset, prompt, tool, and agent in production, each with an owner.


Most large organisations cannot name every model they have running, let alone the prompts, agents, and datasets feeding it. This is the discovery exercise that finds them, then tracks lineage, provenance, and dependencies as they change. It is the difference between an estate you manage and one you hope is behaving.

AI Risk Taxonomy


A risk model drawn from your actual systems, mapped to the standards you'll be audited against.


Generic risk registers list harms that could happen to anyone. This one is built from your own models, agents, and data flows, then mapped against ISO 42001, NIST AI RMF, the EU AI Act, and OWASP CycloneDX so every risk traces to a specific system and a specific control. It is the artefact that lets your risk committee and your engineers point at the same row and mean the same thing.

Reusable Guardrails Library


A versioned library of controls: input filters, output checks, and policy enforcers, tested and ready to drop in.


Without a shared library, every team rebuilds the same input filters and output checks slightly differently, none of them audited, none reusable. This collects them as versioned, tested controls any team can pull into a deployment, and gives the Bill of Materials something real to point at. As it grows it becomes the shared standard a community of practice forms around, rather than a policy nobody reads.

Evaluation & Observability Pipeline


Continuous evaluation, tracing, and drift detection wired into your CI/CD, running on every change.


Before a model ships, evaluation suites test it for accuracy, bias, and capability limits. Once it is live, tracing and drift detection watch its real behaviour and route a failure into your incident process the same way a Sev-1 outage would. The audit trail a regulator asks for is generated automatically, as a by-product of running the thing.

Frontier

Agentic Constraint Architecture


Decision-traceability, hard constraint layers, and red-teaming for systems that act without a human in the loop.


When an agent can place an order, send a message, or change a record on its own, a wrong step stops being a wrong answer and becomes a wrong action that has already happened. This bounds what an agent is allowed to do, traces why it did what it did, and red-teams it for failure modes a conventional risk framework never anticipated. The constraints have to hold at machine speed, because nobody is reading along.

Each of these runs continuously, and each produces a decision that lands on someone's desk: a drift alert, a model nobody had catalogued, an action just blocked in a live system. They arrive at commit speed; governance runs at committee speed. And there is nowhere for them to land.

It surfaces as pressure, not as a gap.

It almost never arrives as "our governance doesn't work." It arrives as pressure: a finding, a question the board can't answer, a headline. Underneath, two things are missing at once: an instrument that governs the real systems rather than an abstraction of them, and an organisation able to reason about what it would surface, let alone act in time.

Build the first without the second and the problem only moves: the signals arrive faster than any committee can meet, in a language the people who must act can't yet reason in. So the work runs on both. One half is instruments wired to the actual technology. The other is an organisation that can use them: the capacity to reason about risk together, decisions pushed to where the work happens, and a language for risk that holds from the engineer to the board. Where it starts depends on what's missing. Three shapes it has taken.

Scenario:

A regulatory finding had landed. The board was being asked questions it couldn't answer.

Underneath the finding, the engineering team had models in production with no agreed measures of performance, no instrumentation, and no shared vocabulary for what "working" meant. Assurance presumes a practice that can be assured, and there wasn't one yet. So the first task was not the governance framework. It was defining what good looked like in operational terms, instrumenting the systems already running, and giving the team ways to recognise failure before the regulator did.

The framework followed, with something to govern.

Scenario:

The board was asking about AI risk appetite. Nobody could answer.

The conversation kept stalling because the people being asked to set appetite had no working understanding of what they were setting it for. The exec team had been through training that left them more confident and no more capable: a panel of specialists explaining their specialisms, none of it integrated, none of it pitched at the altitude a board member actually needs. The work became rebuilding what executive AI literacy means: enough technical understanding to ask precise questions, enough strategic framing to allocate capital intelligently, enough governance fluency to set appetite without being captured by the people they are meant to be governing.

Risk appetite is downstream of comprehension. The comprehension had to be built first.

Frontier

Scenario:

The systems were already past where the field had answers.

This team was technically deep and its practice was already intentional. The questions they were asking sat at the genuine frontier of the field, and the governance specialists they brought in could not grasp the situation. The work was building safety frameworks for systems whose failure modes are not yet well understood by anyone: agentic architectures, multi-agent coordination, self-modifying behaviour. Constraint architectures, red-team methodologies, and decision-traceability for systems acting without a human in the loop, where there is no established best practice to lean on.

What gets built here is what the rest of the field eventually inherits.

Matt Newman - AI governance and safety advisor

From neural networks research to international AI standards.

Work like this only holds when several disciplines meet in one person: safety research at the frontier, a hand in writing the standards, the hard-won craft of operationalising governance inside large regulated organisations, and the change practice that decides whether any of it gets adopted. Most people have one. The engagements that fail, fail at the seams between them.

Fifteen years in AI and machine learning, a decade specifically in AI governance and safety, and more than thirty enterprise engagements across banking, insurance, energy, healthcare, and telecom.

"I help executives turn a vision into reliable assets, and engineers turn experiments into trusted systems."

That's what makes this work. A risk taxonomy that maps to your actual technology. Governance requirements that your engineers can implement without interpretation. Board assurance that is technically honest, and a board who are confident in having those conversations.

At SingularityNET I built safety frameworks for frontier AGI systems. At nib I'm operationalising ISO 42001 and NIST AI RMF through governance boards, LLM evaluation pipelines, and agentic observability infrastructure, working with engineers on exactly how guardrails land in their stack and with the risk committee on what that means in terms they can act on.

Thinking on the frontier.

All articles
Technical

Why You Can't Unit-Test an AI Agent

Read more - Why You Can't Unit-Test an AI Agent
Governance

Audit Committees Are Asking the Wrong AI Questions

Read more - Audit Committees Are Asking the Wrong AI Questions
Governance

Should Is Not a Control: How AI Ethics Built Its Own Graveyard

Read more - Should Is Not a Control: How AI Ethics Built Its Own Graveyard
Governance

AI Governance for the Board

Read more - AI Governance for the Board
Governance

Interim Policy for Generative AI

Read more - Interim Policy for Generative AI
Views

Australia's AI Action Plan – An Open Response

Read more - Australia's AI Action Plan – An Open Response
Strategy

Implementing AI Ethics: Complex architectures

Read more - Implementing AI Ethics: Complex architectures
Strategy

MLOps - Fast-tracking AI Ethics?

Read more - MLOps - Fast-tracking AI Ethics?

Let's close
the gap.

If you're a CRO, CISO, CDO, or board member trying to get ahead of AI risk, rather than catch up to it, I'd like to hear from you.

That might be building your first governance framework, hardening an enterprise-scale agentic deployment, stress-testing what your engineers have already built, or preparing your leadership team for what's coming. Bring the specific problem, and I'll tell you directly where I can help.