Skip to main content
INSIGNIA.
Engage
PRODUCT0001
PG. SUB · verum
/ Product · Verum · v.1.xLIVE · IN PRODUCTION

The verdict is deterministic. The AI is everywhere else.

A clinical-inference platform shipping two production-grade surfaces on a single engine. Payer-side prior authorization and provider-side multimodal decision support, where verdicts are computed by code and LLMs handle extraction, narrative, and second-opinion texture.

Live deployment · two products · one engine

VERUM · clinical inference platformverum-health.vercel.app
§ 02 · Thesis

If extraction is correct, the verdict is correct by construction.

The architectural rule across Verum is a deliberate split between load-bearing and decorative AI. Extraction is genuinely a language task and stays LLM-bound. The verdict is a code path: rule-tree evaluation in pure TypeScript, same answer across all models, all providers, all runs. Five LLM passes run in parallel around the verdict to add narrative texture, second-opinion advisory, soft signals, and qualitative tags. None of them own the decision. The trace shows you which is which on every case.

FIG. i · agent pipeline
  1. 01
    NOTE
    Clinical note
    Free-text · structured · voice
    Probabilistic LLM extraction · per-flagship Zod schema
  2. 02
    FACTS
    Extracted facts
    Typed fact spans · source-quoted
  3. 03
    GATE
    Deterministic eval
    Rules × facts → verdict
  4. 04
    VERDICT
    Decision + audit
    APPROVE · PEND · DENY
FIG. ii · same input, two approaches
IDENTICAL · input case
Same clinical note. Same policy. Run repeatedly.

“62F with 8 weeks of PT-documented chronic low back pain. ODI 42. Conservative trial of NSAIDs and structured PT failed. No red-flag findings on exam. Requesting lumbar MRI without contrast.”

CPT
72148
ICD
M54.50
Payer
MCR
APPROACH A · pure LLM
⚠ NON-DETERMINISTIC
Run 1 of 3 — same input, different verdict each time.
VERDICT
PEND
confidence
0.68

Conservative therapy possibly insufficient. Recommend additional records review.

APPROACH B · Verum
✓ DETERMINISTIC
Same input. Every run. Forever.
VERDICT
APPROVE
confidence
0.94

“All mandatory criteria met. PT ≥ 6w (✓ 8w). ODI ≥ 30 (✓ 42). No red flags required for approval.”

Why it never changes
  • · Rules evaluated in pure code
  • · No temperature, no sampling
  • · Same facts → same answer, always

The probabilistic side flickers because that is the honest behavior of an LLM under temperature. The deterministic side stays still because it is a pure-code rule tree. Both have a place in the system. Only one is allowed to drive the verdict.

§ 03 · The platform

Two production surfaces. One engine. Both sides of the payer-provider divide.

Verum runs Decide (payer-side prior authorization) and Insight (provider-side multimodal clinical decision support) on a shared `lib/clinical-policies/` framework with the same deterministic-gate architecture, the same audit ledger pattern, and the same model-portable provider lattice. The pair is connected at the data layer via the BRIDGE drawer — a PA reviewer can pull a vision read from Insight into a Decide case mid-review without leaving the surface.

/ 01Utilization management teams at payers

Verum · Decide

Prior authorization, reviewer-native.

8production surfaces · live
  • Four procedure families encoded: lumbar MRI, cervical MRI, brain MRI, CT abdomen / pelvis
  • Three payers per family: Medicare, Commercial, Medicaid
  • Two policy versions: 2024 and 2026 LCDs, retrospectively replayable
  • Lumbar and cervical Medicare 2026 encoded verbatim from Aetna CPB 0236 and CMS L34220
  • Counterfactual panel: any DENY or PEND shows the fact that would flip the verdict
  • Cross-payer fan-out: same case, three lanes, three policy ontologies in parallel
Open Decide
/ 02Clinical teams: radiologists, pathologists, specialists

Verum · Insight

Multimodal clinical decision support.

12production surfaces · live
  • Four flagships fine-tuned to Verum Vision's specialty training distribution
  • Six-stage multimodal pipeline: extract, annotate, image, fuse, gate, synthesize
  • Verum Vision in the critical path, contributing facts but never deciding
  • Append-only audit ledger with sealed-state immutability and supersedes-chain lineage
  • PDF + FHIR R4 export per sealed record, ready for downstream audit systems
  • TrinaryVerdict shared with Decide: CLEAR / REVIEW / ESCALATE
Open Insight
+ THE BRIDGE

The two products are connected at the data layer. A PA reviewer inside Decide can pull a vision read from Insight into their case mid-review — findings stream into a right-side drawer, selected ones are promoted with audit attestation, the decision-packet picks up a Page-3 audit-detail section. Detailed in Chapter 06.

§ 04 · Decide

Same patient. Same note. Three payer verdicts.

Cross-payer policy variance is the single most expensive operational problem in UM. Verum encodes each payer's policy as data, not prompt. The same clinical extraction runs three lanes in parallel, each evaluated against its own rule tree. The divergence is visible — every cited rule traces back to verbatim policy text and a fetchable URL.

FIG. iii · same case, three payer ontologies
INPUT · shared clinical note
CPT 72148ICD M54.50

“62F with 8 weeks of PT-documented chronic low back pain. ODI 42. Conservative trial of NSAIDs and structured PT failed. No red-flag findings on exam. Requesting lumbar MRI without contrast.”

PAYER LANEMedicare
VERDICT
APPROVE
confidence · 0.94
CITED · Aetna CPB 0236 §1 item 10

Six-week conservative therapy threshold met. ODI 42 exceeds policy floor. No red flags required.

PAYER LANECommercial
VERDICT
PEND
confidence · 0.71
CITED · SC BCBS CAM 709 §3

Neurogenic claudication mapping required for payer track. Missing fact: explicit dermatomal correlation.

PAYER LANEMedicaid
VERDICT
DENY
confidence · 0.88
CITED · Carelon 2025-11-15 spine §4

EMG required before advanced imaging for radiculopathy cases. Conservative pathway not satisfied for this payer.

Each lane evaluates the same extracted facts against its own policy ontology. The divergence isn't a model artifact, it's encoded data — every cited rule traces back to verbatim policy text and a fetchable source URL.

FIG. iv · the four analytical surfaces
/ 01

Reviewer-resolve panel

On any pend, enter the missing fact value live. The verdict re-derives in <5 ms with the resolution captured on the decision's resolutionAudit.

/ 02

Counterfactual editor

Click any rule chip. The panel shows what would change the verdict — minimum required threshold, fact-by-fact.

/ 03

Policy simulator

Retrospective mode runs a 30-case eval suite against a tweaked policy in under a second. No LLM call.

/ 04

Soft-signal advisory

Five decorative LLMs surface narrative, advisor opinion, soft signals, qualitative tags, denial letter draft. None of them own the verdict.

Decide · live surfaceOpen cross-payer view
§ 05 · Insight

Vision in the critical path. Never in the verdict.

Each Insight flagship is fine-tuned on Verum Vision's specialty training distribution and anchored in peer-reviewed clinical consensus. The vision encoder contributes structured facts in the same vocabulary as the text extractor. The deterministic gate runs identical code regardless of which encoder produced the evidence. When facts disagree, the audit trail names the dispute by ID.

FIG. v · four flagship CDS pathways

CXR Critical Findings

ACR Practice Parameter, 17-fact extraction

Modality

Chest X-ray

Peer-reviewed basis

ACR Practice Parameter + 3 cross-validated sources

Verdict scale
NORMALURGENTSTAT
Extracted facts per case
17
typed Zod fields
Sample cases bundled
14
with image + narrative
Verdict trinary
3
normal · urgent · stat
Open CXR flagship

All four flagships collapse to the same TrinaryVerdict (CLEAR / REVIEW / ESCALATE) so artifact generators — SOAP notes, advisor opinions, escalation letters, audit packets — plug in unchanged across modalities via prompt-swap.

§ 06 · BRIDGE

Insight inside Decide. Cross-product integration at the data layer.

A PA reviewer in /review encounters a spine case with an MRI attached. The dictated note is borderline. They click "Read this image with Verum Vision", a right-side drawer opens, Verum Vision streams findings into the panel, high-confidence findings are pre-checked. The reviewer promotes the ones they want into the case, the Decision carries imagePromotionAudit metadata, and the PDF decision-packet gains a Page-1 banner plus a Page-3 audit-detail section. The image read never leaves the case context.

FIG. vi · cross-product flow, five stations
  1. 01

    PA reviewer opens /review with a spine case + MRI attached

  2. 02

    Reviewer clicks "Read this image with Verum · Vision"

  3. 03

    BRIDGE drawer slides in. Verum Vision streams findings live.

  4. 04

    High-confidence findings pre-checked. Reviewer selects promotions.

  5. 05

    Decision carries imagePromotionAudit. PDF gains audit banner + detail page.

+ The platform pitch

No payer-only or provider-only competitor has this. The two-product architecture is the platform pitch; the BRIDGE is what makes it non-trivial.

§ 07 · Vision

Verum Vision, in the critical path.

Verum Vision is our proprietary 4-billion-parameter clinical vision model, trained and fine-tuned on proprietary specialty corpora across four clinical domains: thoracic radiology, vitreoretinal ophthalmology, primary-care dermatology, and uropathology. The model initializes from an open-weights medical foundation and is then put through continued domain pre-training plus task-specific supervised fine-tuning per flagship. We deploy it on a customer-tenant inference endpoint behind the customer's BAA. The adapter speaks the OpenAI-compatible /v1/chat/completions wire shape, so the same protocol works against vLLM, TGI, or any compatible runtime. Each flagship runs its own specialty-register system prompt anchored in its training distribution.

Parameters
4B
proprietary fine-tune
Training stages
2
continued pre-train + SFT
Specialty domains
4
CXR · DR · Derm · Path
Inference surface
OpenAI
/v1/chat/completions
FIG. vii · per-flagship specialty register
/ 01CXR

Thoracic radiologist

Specialty-register system prompt

Training corpus

Radiology impression-line corpus

/ 02DR

Vitreoretinal specialist

Specialty-register system prompt

Training corpus

Curated retinal-grading set

/ 03Derm

UK GPwSI dermatology

Specialty-register system prompt

Training corpus

Dermoscopy + clinical-photo corpus

/ 04Path

GU pathologist

Specialty-register system prompt

Training corpus

Prostate biopsy histology corpus

Deployment posture
  • 01Verum Vision · proprietary 4B clinical vision model, trained and fine-tuned in-house, deployable in customer-tenant VPC behind the customer's BAA
  • 02OpenAI-compatible /v1/chat/completions endpoint, runtime-agnostic adapter shape (vLLM, TGI, or any compatible inference server)
  • 03Per-flagship specialty-register prompts: thoracic radiologist (CXR), vitreoretinal specialist (DR), UK GPwSI-derm (NICE), GU pathologist (Gleason)
  • 04Trained and fine-tuned on curated specialty corpora across radiology impressions, retinal grading sets, dermoscopy plus clinical-photography libraries, and prostate biopsy histology, so flagships sit inside the training distribution by construction
§ 08 · Audit

Append-only ledger. Sealed records. Two export formats.

Every action since the case opened is on an append-only ledger. Sealing requires an attestation modal with a free-text statement and an optional Accept-with-reservations flag. Once sealed, the record is immutable; any further mutation throws SealedRecordError. Re-image creates a new record with supersedes lineage. Records persist locally, LRU-capped at 20.

SOC 2 Type II · Compliant
FIG. viii · the audit posture
/ a · the ledger story
01

Append-only ledger

Every event since case open is on the ledger. No mutation of historical entries.

02

Sealed-state immutability

Sealed records throw SealedRecordError on any post-seal write. Supersedes-chain lineage on re-image.

03

Attestation modal

Free-text statement + optional Accept-with-reservations flag. Operator credentials captured per seal.

/ b · two export formats
04

PDF decision packet

react-pdf, paper-grade layout. Page 1 verdict + attestation; Page 2 facts table; Page 3 rule evaluations + disputes; Page 4 synthesis + advisor opinion.

05

FHIR R4 Bundle

Patient + Condition[] + DocumentReference + Provenance + AuditEvent[]. CRD-aligned posture via meta.tag[]; we do not claim CRD STU 2 profile conformance the IG doesn't publish.

CRD-aligned posture via meta.tag[]; Verum does not claim CRD STU 2 profile conformance the IG does not publish. The audit posture is designed for review and forensic, not for marketing literature.

§ 09 · Operator agency

Reviewer-attested override. Resolve missing facts. Counterfactual probe.

Verum is not autonomous-by-default. Three operator actions are first-class. The deterministic gate is the reproducible baseline; each operator action is captured in the audit chain so the override or resolution lives in the record alongside the verdict it changed.

FIG. ix · four first-class operator actions
/ 01

Resolve missing facts

On any PEND, enter the missing fact value. Live preview shows pre-resolution vs post-resolution verdict. "Apply to live decision" commits with resolutionAudit on the record.

SurfaceDecide + Insight
/ 02

Override with AI verdict

When the advisor LLM disagrees with the gate, an explicit override button swaps the live verdict. The deterministic outcome is preserved on decisionOverride for revert.

SurfaceDecide
/ 03

Counterfactual editor

Two-column diff: current state vs the state that would flip the verdict. Threshold edits re-run the deterministic gate in code, no LLM call.

SurfaceDecide
/ 04

Dispute resolution

When text and image facts disagree, the Disputed flag surfaces with a Resolve button. Reviewer picks the canonical fact; resolution captured in the audit ledger.

SurfaceInsight

Each operator action lives in the audit ledger next to the verdict it changed. The deterministic gate is the reproducible baseline; the override or resolution is the operator's attested deviation. Both survive a sealed-record forensic read.

§ 10 · Model lattice

Five providers. Forty-eight models. One env-flip to swap.

Verum is model-portable by construction. The TopBar selector switches provider live across every load-bearing call. Per-request BYOK key threading via AsyncLocalStorage means the customer's key flows through every provider factory in the call graph without changing call-site signatures. Adding a provider is one file edit plus a model-registry entry.

Models
48
across 5 providers
Providers
5
OpenAI-compatible adapters
Env-flip
1
to swap provider live
FIG. x · the five-provider lattice
/ 01free + paid

Groq

12models

Llama 3.3, Mixtral, gpt-oss-120b

/ 02paid

OpenRouter

14models

Claude Sonnet, GPT-4, Mistral Large, many more

/ 03free + paid

Google AI Studio

4models

Gemini 3 preview, Gemini Pro

/ 04free + paid

Mistral La Plateforme

6models

Mistral Large, Mistral Small, Voxtral

/ 05free

NVIDIA NIM

12models

Nemotron-3 Nano + Super, Qwen3-Next, Kimi K2 Thinking, GLM 4.7

// the call-site stays untouched
const facts = await extractFacts(note)

// inside the factory, the BYOK key is already in scope
function groqClient() {
  const key = byokScope.get("groq")
    ?? process.env.GROQ_API_KEY
  return new Groq({ apiKey: key })
}

// AsyncLocalStorage carries the key through the call graph
withByok({ groq: customerKey }, async () => {
  return run(case)   // every nested provider call sees it
})
Portability posture
  • 01Per-request BYOK threading via AsyncLocalStorage, customer keys never persist server-side
  • 024-tier model reliability ranking: strict-JSON-validated models surface as recommended
  • 03Per-model strict-JSON badge in the picker
  • 04Mistral Large Latest validated as the EU-sovereign load-bearing extraction path
  • 05Same wire protocol for STT: Groq Whisper Large turbo + Mistral Voxtral, switchable live
§ 11 · Onboarding

Twenty-two walkthrough tours. Roughly two hundred steps.

Operator training is built into the surface. Fourteen intro tours auto-launch one-per-session covering each major surface in 50-100 seconds. Eight deep-dive companions walk every load-bearing affordance: four input modes, voice sub-modes, suggest-from-note extraction, model selector, attestation flow, override flow, FHIR / PDF export, cross-payer chip resolution, simulator presets, validation ECE, eligibility heuristic, CDS Hooks discovery. Approximately 200 steps total. Enterprise rollout doesn't require a separate training program.

FIG. xi · the tour catalog
Total tours
22
across every surface
Steps
~200
self-paced · keyboard nav
Intro tours
14
50-100s each · auto-launch
Deep-dives
8
load-bearing affordances
Tour mechanics
  • 01Spotlight via four fixed-position scrim rectangles, target stays clickable through the tour
  • 02Keyboard nav: ESC closes, arrows navigate, Enter advances
  • 03rAF-driven repositioning on scroll, resize, content change
  • 04Accent color cascades per page so the tour visually integrates with the surface
  • 05Tours explicitly name what the surface cannot model. The simulator tour says "this can't model reviewer-behavior shifts"
§ 12 · Quality discipline

Over a thousand assertions, two evaluation harnesses, three wire logs.

The codebase ships with deep test coverage as a precondition, not a follow-up. Decide carries 1,020+ pure-code smoke assertions across 24+ test files. Insight ships 142+ always-on deterministic assertions plus 156+ conditional LLM assertions for opt-in end-to-end runs. Both products carry their own evaluation harness: Decide's reports ECE calibration; Insight's tracks per-flagship coverage. Three vendor-adapter wire logs (Groq, NIM, Verum Vision) capture full request bodies and raw responses for any failure forensic.

FIG. xii · test + eval coverage
Smoke assertions
0+
Decide pure-code path
Always-on assertions
0+
Insight deterministic
Eval cases
30 + 40
Decide + Insight harnesses
Sample cases
47 + 47
Decide + Insight, rich patient context
The operability layer
  • 01GROQ_WIRE_LOG, NIM_WIRE_LOG, VISION_WIRE_LOG, env-gated fetch wrappers with zero overhead when unset
  • 02Diagnostic helpers in lib/ai/error-diagnostics.ts, failed_generation extraction, error-shape snapshots
  • 03ECE calibration metric on the Decide /evals page
  • 04Per-flagship Insight coverage matrix on /insight/coverage
  • 05Cross-model A/B comparison: extraction-quality diff under identical deterministic gate
§ 13 · The register

What's shipped. What's in flight.

SHIPPED
11
  1. SOC 2 Type II
    Audit complete · current
  2. Live deployment
    verum-health.vercel.app · two products
  3. Deterministic-gate architecture
    Pure-code verdict · same answer across 48 models
  4. FHIR R4 export
    Bundle with Patient, Condition[], Provenance, AuditEvent[]
  5. Verum Vision encoder
    Proprietary 4B-param model · four flagships in production
  6. Append-only audit ledger
    Sealed-state immutability · supersedes chain
  7. Cross-payer policy fan-out
    Medicare · Commercial · Medicaid in parallel
  8. BYOK key threading
    Per-request key flow via AsyncLocalStorage
  9. Voice intake
    Three modes · Whisper + Voxtral · editable transcript
  10. Counterfactual probe
    Re-derive in pure code under 5 ms
  11. 22-tour onboarding catalog
    ~200 steps · self-paced
IN FLIGHT
5
  1. HIPAA BAA
    Q3 2026
    BAA template drafted · provider signatures pending
  2. Multi-tenant deployment
    Q4 2026
    Tenant isolation tested · database layer in build
  3. CDS Hooks Hook 2.0 / SMART-on-FHIR launch
    Q4 2026
    1.0 prefetch shipped; 2.0 + SMART launch in design
  4. Spine-keyed PA policy rules
    Q3 2026
    BRIDGE drawer promotes findings; rule expansion in design
  5. Bbox overlays on image findings
    Q1 2027
    Spatial grounding via SAM2 / DETR chain

The shipped column is what survives a customer's technical due-diligence today. The in-flight column carries target dates the team commits to. Everything on the page above this register is drawn from the shipped column.

§ 14 · Engage

Open the platform.

Live deployment is open for evaluation. Two products, one engine, every capability above is clickable today.

Live · two products · SOC 2 Type IIVERUM · MMXXVI
REGISTER · WORDMARKIN.↓ CH.05