Methodology — Metonym

Most AI safety evaluation for mental health applications relies on keyword detection, content moderation classifiers, or red-team prompts written by AI safety engineers. These methods catch a real and important subset of harmful outputs.

They are not, however, the same thing as clinical evaluation. A trained clinician evaluating a transcript does not ask "did the system block disallowed content?" — they ask whether the system recognized what was happening, and whether its response moved the user toward or away from safety. These are clinical questions, and they require clinical instruments.

Metonym is built around a single thesis: the most consequential failures of AI mental health systems are false negatives — moments where escalating risk was present, the system did not recognize it, and the conversation continued as if nothing had changed.

The high-risk states AI systems tend to misread

The Salient Distress Model is organized around clinically established categories of risk that surface-feature systems routinely miss:

Decision-state transitions — the shift from passive ideation to active planning, where risk escalates most rapidly.
Collapse of perceived options — the moment a person's perceived solution space narrows to one. Language often remains calm even as risk is peaking.
Temporal narrowing — constriction of future-thinking, a clinical marker that rarely surfaces in the vocabulary content moderators were trained on.
Calm or resolved states — a sudden sense of peace can mask elevated risk and will reliably be scored as low-risk by any keyword-based system.

Metonym engagements are designed to produce something a clinical, technical, and regulatory audience can each trust: a structured assessment of how an AI system performs against a calibrated set of clinically meaningful scenarios.

A typical engagement includes scoping aligned to the client's deployment context, evaluation against the SDM/MSS framework, and a written report identifying performance gaps and clinical recommendations. The methodology has been applied to over 1,700 AI model evaluations across multiple scenarios to date.

Engagements are intended to complement, not replace, the client's internal safety review — and to provide the kind of independent clinical assessment that regulators and ethics boards increasingly expect.

Clinical evaluation, made systematic.

The gap between technical safety and clinical safety.

The high-risk states AI systems tend to misread

Salient Distress Model and Mechanical Severity Score.

Salient Distress Model

Mechanical Severity Score

What a Metonym evaluation looks like.

Patent-pending status.