Datavant operates the largest health data network in the United States, connecting more than 80,000 hospitals and clinics, 100% of US payers, and 350+ real-world data organizations globally. Founded in 2017, the platform enables privacy-preserving data linkage across healthcare organizations - letting hospitals, pharma companies, researchers, and payers share clinical information without exposing patient identities. Following a 2021 merger with Ciox Health, the company added record retrieval, data transformation, and AI-powered analytics to its core linking technology, building out full health data logistics capabilities.
The threat model is non-trivial: healthcare data flows across fragmented systems with inconsistent security postures, creating exposure points at every handoff. Datavant's infrastructure handles de-identification, tokenization, and controlled linkage at scale, running on AWS with Kubernetes orchestration. The stack includes Snowflake for warehousing, Kafka and Kinesis for streaming pipelines, and Spark/Databricks for transformation workloads. Monitoring runs through DataDog, Grafana, and CloudWatch. Backend services are built in Python, Go, and Java; frontend tooling uses React and Next.js with GraphQL APIs.
Security engineering here means designing cryptographic protocols for data minimization, building access controls that span hundreds of institutional boundaries, and maintaining audit trails across a network processing sensitive health records at enterprise scale. The operational surface includes microservices handling real-time data ingestion, batch ETL pipelines built with Airflow and Glue, and customer-facing analytics tools feeding Power BI, Tableau, and Looker dashboards. This is infrastructure-level work where misconfiguration or weak isolation directly impacts patient privacy across the entire US healthcare system.