Ethics of Clinical AI — Lecture 1 of 4
Data InDeed | dataindeed.org
2026-01-01
The question is not whether the model is interpretable. The question is whether the deployment is defensible.
Post 01 Opacity & Interpretability
Post 02 Accountability Without Interpretability
Post 03 Ethical Failure Modes in Registry Data
When black boxes save lives
The popular argument:
“Clinical AI must be interpretable because clinicians need to understand why the model made its recommendation before they can trust it.”
The assumptions embedded in this argument:
All four are contestable.
The clinical reality:
Clinicians use blood pressure readings, troponin assays, and pulse oximeters without understanding the underlying sensor physics or assay biochemistry.
What they require: validation, calibration, known failure modes, and clear decision context. Not a causal pathway through the measurement.
Opacity refers to:
Opacity does not refer to:
The conflation:
Opacity of mechanism ≠ Opacity of behavior
A black box can have:
These are the ethically relevant properties — not the ability to explain a specific weight.
If the interpretable model makes more errors, the demand for interpretability has a body count. The ethical case for a black box with strong governance is real.
The traditional ethics framing:
The alternative framing:
The second chain does not require interpretability. It requires governance.
When black boxes are ethically defensible:
The sepsis example:
A black box sepsis alert with AUC 0.87 and documented false positive rate of 12% — deployed with clear override protocol and continuous monitoring — is more ethically defensible than an interpretable logistic model with AUC 0.71 and no prospective validation.
The ethics are in the governance, not the glass box.
The critical distinction:
Explanation: “Feature X contributed weight W to this prediction.”
Justification: “This prediction is reliable enough to support this clinical decision in this patient population.”
SHAP values and LIME provide explanation. They do not provide justification. Justification requires validation evidence — not post-hoc attribution.
Why this matters:
SHAP tells you which features drove the model score for patient 12345. It does not tell you whether the model’s overall behavior is trustworthy for that patient’s clinical context.
A clinician who sees SHAP values and concludes the model is justified is reasoning from explanation to justification — a logical gap that has caused real harm.
The trauma registry implication:
A trauma triage model with good SHAP explanations but no prospective calibration data in the deployed theater is not ethically justified — regardless of how interpretable it appears.
The question to ask: Has this model been validated in conditions like these? Not: Can the model explain its decision?
Traceability, not transparency
The intuition:
If we can see inside the model, we can hold it accountable.
The problem:
Models cannot be held accountable. They have no agency, no authority, and no consequences.
Accountability is a property of people and institutions. It requires:
The dangerous comfort of “the model did it”:
When accountability is attributed to the model — through interpretability, audit logs of weights, or SHAP values — it displaces from the humans who deployed it, set the thresholds, trained the clinicians, and monitored the outcomes.
This displacement is not neutral. It is an accountability gap that protects institutions at the expense of patients.
Accountability is highest at the institutional level — the hospital or MTF that deploys the model and the governance body that sets the policy. Interpretability of the model itself is most relevant to developers — not to the accountability chain for a specific decision.
What traceability requires:
# Every prediction that enters a clinical workflow must:
prediction_log <- list(
patient_id = "12345",
model_version = "mortality_v2.3.1",
model_hash = "sha256:a3f7c1d...",
prediction = 0.31,
threshold = 0.25,
alert_fired = TRUE,
clinician_id = "MD-0412",
override = FALSE,
override_reason = NA,
timestamp = "2026-09-01 14:32:07 UTC",
data_snapshot_hash = "sha256:b2e9f..."
)This log is the accountability artifact — not the SHAP values.
DoDTR context:
Every model prediction that informs a triage decision, a resource allocation, or a clinical practice guideline recommendation must be traceable to:
Without this, a post-incident review cannot determine whether the model contributed to harm — or protected against it.
Bias before the model
The standard framing:
A biased model → biased predictions → unfair outcomes. Fix: debias the model (reweighting, adversarial training, fairness constraints).
The upstream reality:
A biased registry → a biased training set → a model that learned real patterns — in a population that was itself shaped by systemic inequities.
“Debiasing” a model trained on biased data doesn’t undo the bias — it launders it.
Where ethical failure actually begins:
The implication:
Ethical analysis of a registry model must begin at the data dictionary and the inclusion criteria — not the loss function.
If the registry underrepresents severely injured patients who died in the field, every model trained on it will underestimate severe injury mortality.
The model sees only the red distribution. Any model trained on registry data will have implicitly learned: “What predicts outcomes in patients who survived to Role 2.” Not: “What predicts outcomes in all trauma patients.”
Measurement bias:
What gets measured depends on who is being measured, who is doing the measuring, and what tools are available.
Documentation bias:
What gets documented depends on what the system rewards:
Survivorship bias:
The quietest failure mode:
Patients who die before discharge disappear from follow-up. If you analyze 30-day outcomes, you only observe patients who survived long enough to have 30 days of data.
Long-term outcomes (PTSD, TBI sequelae, limb function) are only observed in survivors — meaning every model of long-term outcomes has built-in survivorship bias.
There is no statistical fix. The fix is documentation: acknowledging who is not in the data.
Using discharge-revised ISS or post-hoc complications as predictors in an admission-time model is temporal leakage. The model will appear better than it is at deployment — where only admission-time data is available.
Opacity & Interpretability
Accountability
Ethical Failure Modes
The meta-lesson: The ethical work in clinical AI is front-loaded — in how the registry was built, who was captured, what was measured, and how the deployment is governed. Statistical methods cannot repair upstream failures.
Prediction, Human Oversight & The Ethics of Data Exclusion
Posts 04, 05 & 06:
Data InDeed · Ethics of Clinical AI · Lecture 1 | ⚡ Open App