Clinical Data Warehouse - Data Overview

* The data may be out of date due to the ongoing transition of our source data to the Epic EHR system.


Last report update:

Last data refresh:

CDW Composition and Overlap


Available Data and MPI

The record systems linked by our Enterprise Master Patient Index (MPI) are Allscripts TouchWorks, GE Centricity, and axiUm Dental.

Centricity contains Houston area administrative (billing) data.

The Allscripts EHR uses Centricity for billing data, so Allscripts patients are a subset of patients in the Centricity set.

axiUm Dental, on the other hand, is a completely independent billing system. We can measure patient overlap using the MPI.

* The data may be out of date due to the ongoing transition of our source data to the Epic EHR system.


To facilitate research across multiple data sources, we've implemented an eMPI (Enterprise Master Patient Index) to give every patient a unique ID distinct from their local MRN.

This allows us to consolidate records of patients who may have been seen in different settings. It also identifies duplicate patient records, improving quality for clinicians. An MPI can prevent duplicate patient records by linking those with identical or nearly identical demographic data. This ensures a high quality of care.

The MPI database is updated nightly.

Total Number of Patients in CDW

Minimum number of COVID-19 cases in the UT Physicians patient population

By looking for specific order types and test results, we can set a lower bound on the number of persons seeing a UTPhysicians provider who have tested positive for COVID-19.

  • Dates reflect the date a test was ordered. "Tested Positive" is the number of positive test results that came in for any tests ordered on that date. Test results might not be reported on the same day of the test, but are always shown on the same day here.
  • The actual number of both ordered tests and results is likely higher. This chart looks at recorded tests in structured data fields only, and does not take notes or other items in a patient's chart into consideration.
Refer to the UT-HIP Global COVID-19 dashboard for an analysis using other sources of data.

Total Number of COVID-19 Vaccines administered at UT Physicians

Comparison of Moderna and Pfizer Vaccine Administration at UT Physicians

Total Number of COVID-19 Vaccine Doses administered at UT Physicians

Cumulative Number of COVID-19 Vaccine Doses administered at UT Physicians



TouchWorks by Allscripts is the health record system for UT Physicians. Allscripts contains data on patients seen in outpatient UTP clinics. (Main page)

We also process hundreds of gigabytes of free-text clinical notes from medical records into a pipeline suitable for NLP research. Since 2004 we've collected over 30 million note events.

GE Centricity


Centricity is the UT Physicians billing system. We pull demographics and billing data. Centricity contains information on patients seen in both outpatient and inpatient settings, for providers that invoice with the UT Physicians network. (Main page)



axiUm is an EHR used by the School of Dentistry and attached clinics. This data is loaded into the BigMouth Dental Data Repository. (Main page)