Ethics of Clinical AI — Lecture 4 of 4
Data InDeed | dataindeed.org
2026-01-01
The vocabulary is not a technical detail. It is the foundation on which every ethical claim about the data rests.
Post 10 Semantic Infrastructure as Ethics
Post 11 Responsible AI — Beyond the Checklist
Post 12 DoDTR Modernization
Ontology is not optional
What local codes mean in practice:
A trauma registry built on local codes has:
"GSW_ABD" at Site A = gunshot wound, abdominal, confirmed penetrating"ABDO_PEN" at Site B = abdominal penetrating trauma (includes blast)"PENET_TORSO" at Site C = penetrating torso (may include thoracic)These are not the same thing. A federated query that pools them treats them as equivalent.
The result: a multi-site mortality analysis that compares apples, oranges, and blast injuries — labeled identically.
The ethical implication:
When a clinical practice guideline is developed using pooled multi-site data built on local codes, the guideline is based on a fiction of definitional consistency.
If the guideline is wrong because the underlying data was inconsistently defined, the harm falls on patients — not on the data architects who chose local codes for convenience.
Using local codes in a registry intended to support multi-site analysis or CPG development is an ethical failure of infrastructure design.
OMOP CDM addresses the first two layers. Value-level metadata, governance, and ontology networks require additional institutional investment — but they are the layers that make federated research actually reliable.
The semantic gap is real and consequential:
| Civilian EHR | Military / DoDTR | Semantic equivalent? |
|---|---|---|
| ICU admission | Role 3 stabilization | Partial |
| OR / surgery | Damage Control Surgery | No |
| Ambulance transport | CASEVAC / MEDEVAC | No |
| Emergency dept | Role 2 / Aid station | No |
| Transfer | Echelon transition | No |
Why this matters:
When civilian EHR data is mapped to OMOP and DoDTR data is mapped to OMOP separately — using different source value mappings — a federated query may silently compare incomparable care episodes.
The ethical requirement:
A trauma registry that serves both civilian and military populations must document:
This is not a technical footnote. It is a required section of every data use agreement and methods section in any study that crosses the civilian-military boundary.
The governance question:
Who decides what “penetrating trauma” means in the DoDTR? Who decides when that definition changes? Who is notified when it changes? How are historical records handled after a definitional change?
These are not technical questions. They are political questions with ethical stakes. Whoever controls the ontology controls what the data says — and therefore what the evidence supports.
What responsible ontology governance requires:
Without governance:
A definition change propagates silently through the registry. Historical analyses become non-reproducible. Two publications using “the same registry” with different extraction dates reach different conclusions — not because the data changed, but because the vocabulary did.
The scientists, clinicians, and policy makers who rely on those publications have no way to know.
Beyond the checklist
Five themes the consensus addresses well:
These are correct, important, and well-specified.
Why checklists are necessary but insufficient:
A checklist tells you what to do. It does not tell you when it’s hard, what to do when requirements conflict, or how to make judgment calls in the gaps.
Responsible AI in clinical settings requires institutional capacity — not just compliance with a list.
The six gaps below are the places where checklists run out and judgment begins.
The two highest-severity, least-addressed gaps: local validation requirements and PHI in LLM workflows. Both are operationally critical for military trauma AI.
The risk:
LLMs (GPT-4, Claude, Llama, Gemini) are increasingly used in clinical workflows for:
The PHI risk:
If a LLM is called via an API with patient data in the prompt — even with a BAA in place — the data may be:
This is not a hypothetical. It is happening in clinical institutions now, without systematic governance.
The ethical requirement: Every LLM integration in a clinical workflow must have a documented, approved data governance policy specifying: what PHI categories may be included, what data remains within the organizational boundary, what retention applies, and who approved the exception.
What AI literacy requires for clinicians:
What the consensus says:
“Clinicians should have appropriate literacy.” It does not specify what “appropriate” means, how it is assessed, or what happens when a clinician who lacks literacy uses the system.
The military context:
A trauma surgeon at a Role 3 facility using a model-assisted triage tool needs to know:
This is not generic AI literacy. It is this model in this context. Deployment requires training — not just access.
The status quo has a body count
The current DoDTR state:
What this costs:
The body count argument:
If tourniquet-to-amputation analysis requires 90-day data lag, and the analysis that would reveal a care deviation takes 6 months to conduct, then 8+ months of patients experience the deviation before a corrective protocol is issued.
This is not an abstract concern about data quality. It is a structural delay in the feedback loop between care delivery and care improvement — with casualties at every month of lag.
The largest gaps: model performance monitoring (zero current capability), CPG monitoring automation, and semantic standardization. These are not aspirational — they are prerequisites for defensible clinical AI in trauma.
Why the framework matters ethically:
The five-level CPG compliance automation framework provides a progression from manual to autonomous processing — with governance requirements at each level.
This is ethical scaffolding: each level increase requires:
The ethical risk of skipping levels:
An institution that jumps from L0 (manual) to L4 (autonomous) without intermediate validation has produced an autonomous system with no validated foundation — and no institutional experience with the failure modes of each intermediate level.
The commander’s report as governance:
In the DoDTR modernization vision, the Commander’s Report is not a PDF delivered monthly. It is a live governance document:
The data infrastructure to produce this document is the same infrastructure that makes clinical AI in trauma defensible. They are not separate investments — they are the same investment.
The five institutional requirements for responsible DoDTR modernization:
None of these is a technology purchase. All five are organizational decisions that must be made before the first line of production code is written.
Lectures 1 & 2
Lecture 3
Lecture 4
The series meta-lesson: Ethics in clinical AI is not a philosophical add-on — it is embedded in every decision about data collection, model training, deployment governance, and monitoring infrastructure. The work is upstream, front-loaded, and institutional.
Data InDeed · Ethics of Clinical AI · Lecture 4 | ⚡ Open App