Platform

Not all de-identification is created equal

HIPAA provides two paths to de-identification. Safe Harbor is the simple one — remove 18 identifiers and move on. Expert Determination is the powerful one — a statistically rigorous, flexible approach that preserves far more data utility while achieving stronger privacy guarantees.

expert_determination_cert.json
{
  "method": "expert_determination",
  "dataset": "claims_q1_2026",
  "privacy_model": "k-map + differential_privacy",
  "linkage_tokens": "retained",
  "geographic_fields": "preserved (3-digit zip)",
  "clinical_dates": "preserved",
  "demographic_fields": "generalized",
  "re_identification_risk": "0.04%",
  "data_utility_preserved": "94.2%",
  "certification": "signed — expert opinion attached",
  "status": "certified"
}

What Expert Determination unlocks

Safe Harbor checks a box. Expert Determination opens the door to flexible, high-utility, defensible de-identification that adapts to your use case.

Pseudo-Anonymous Identifiers

Retain linkage tokens from providers like Datavant, IQVIA, HealthVerity, and Spindle for cross-dataset matching. Safe Harbor's identifier #18 — "any other unique identifying number, characteristic, or code" — forces these out. Expert Determination lets you keep them, so long as re-identification risk remains certifiably low.

Flexible Remediation Strategies

Preserve what matters most to your use case. Need geographic precision? Keep it and suppress demographic fields instead. Need clinical dates for temporal analysis? Retain them at the cost of other entities. The remediation adapts to your analytical needs, not the other way around.

Certified Compliance Documentation

Every Expert Determination produces a signed PDF opinion authored by Integral and backed by qualified statistical experts — justifying low re-identification risk based on your specific data, use case, and privacy posture. A formal, auditable artifact designed for regulators, partners, and legal teams.

Alternative Privacy Models

Expert Determination isn't limited to simple redaction. Integral supports differential privacy, k-map analysis, and transformation strategies like generalization, truncation, and feature engineering — choosing the privacy model that best fits the data and the use case.

Synthetic Replacement — "Hiding in Plain Sight"

For clinical notes, replace real PHI entities with synthetic ones instead of redacting them. It becomes extremely difficult for an attacker to distinguish a false negative from a synthetic replacement, because everything in the output looks real. The text stays natural, the clinical context stays intact, and the privacy guarantee is stronger than redaction alone.

Higher Data Utility, Stronger Privacy

Safe Harbor is a blunt instrument — remove everything on the list, regardless of whether it matters for your use case. Expert Determination is surgical — remove only what's necessary, keep everything that's safe, and prove it with math. The result: datasets that are both more useful and more defensible.

Safe Harbor vs. Expert Determination

Two HIPAA-compliant paths. Very different outcomes for your data.

Safe Harbor Expert Determination
Approach Remove or generalize 18 fixed identifiers Statistical risk assessment by a qualified expert
Flexibility None — rigid, predefined rules Fully configurable per dataset, use case, and privacy posture
Linkage Tokens Must be removed (identifier #18) Can be retained with certified low re-identification risk
Data Utility Lower — rigid removal destroys analytical value Higher — optimized trade-offs preserve what matters
Certification Self-attestation — no expert required Expert-signed PDF with statistical justification
Remediation One-size-fits-all Customer-directed trade-offs (geographic vs. demographic, date preservation, etc.)
Privacy Models Not applicable Differential privacy, k-map, generalization, truncation, feature engineering
Unstructured Data Redaction only Synthetic replacement ("hiding in plain sight")
Regulatory Defensibility Moderate — follows the checklist Strong — backed by statistical analysis, expert opinion, and formal documentation

Why Integral for Expert Determination

Expert Determination is only as good as the team behind it. The flexibility that makes it powerful also makes it hard — there's no checklist to follow, no formula to plug in. The right remediation strategy depends on the data, the use case, and the customer's privacy posture. Getting it wrong means either destroying data utility or leaving re-identification risk on the table.

Integral has been doing this across diverse customer scenarios, data types, and regulatory environments for years. That depth of experience means we can get creative with remediation strategies in ways other providers can't. We've seen enough edge cases, tricky datasets, and unusual use cases to know where the boundaries are — and how to push right up against them without crossing them.

The result is a signed certification — a PDF opinion backed by qualified statistical experts — that justifies a sufficiently low risk of re-identification based on your specific data, use case, and privacy posture. Not a generic report. Not a black-box score. A defensible artifact that stands up to regulatory scrutiny, partner due diligence, and customer audit requests.

Every engagement starts with your data and your use case. Every certification is built to defend.

Get a certified Expert Determination

Book a demo to see how Integral delivers defensible Expert Determinations in days — with flexible remediation, signed certifications, and higher data utility.