Data Dictionaries and Common Categories

These are some of the most relevant tables and columns in our data warehouse and various source systems.

Aggregate counts of all fields are available through i2b2.

More detailed information from patient records can be provided upon request with appropriate IRB approval.

UTP CDW - PCORnet Common Data Model

Our main data warehouse conforms to the PCORnet Common Data Model (CDM) Specification. PCORnet Network Partners perform rigorous work upfront that enables users to ask the same question to millions of people across the United States simultaneously, with fast answers delivered in a single, standardized format.

This is a selection of tables and commonly used columns, along with category counts as of the latest CDW refresh date.

Version 6.0 of the specification, listing all tables and columns is available at pcornet.org.

Notes about some common values:
No information: A data field for an enumeration is present in the source system, but the source value is null or blank.
Unknown: A data field for an enumeration is present in the source system, but the source value explicitly denotes an unknown value.
Other: A data field for an enumeration is present in the source system, but the source value cannot be mapped to the PCORnet CDM.

DEMOGRAPHIC

Column Number of Records Unique Values Top 5 Most Common Values
Sex 6,039,174 5 Female (3303983, 54.57%)
Male (2727905, 45.06%)
Unknown (20492, 0.34%)
Other (2029, 0.03%)
Hispanic 6,039,174 6 No (5014110, 78.94%)
Yes (925441, 14.57%)
Refuse to answer (249969, 3.94%)
Unknown (114822, 1.81%)
City 6,039,174 14533 Houston (3714895, 49.05%)
Katy (390637, 5.16%)
Spring (309503, 4.09%)
Humble (168472, 2.22%)
Richmond (167050, 2.21%)
Age 6,039,174 0 Over 65 (1536652, 20.29%)
50 - 65 (1474848, 19.47%)
18 and under (1137802, 15.02%)
40 - 49 (1093106, 14.43%)
30 - 39 (1085320, 14.33%)
19 - 29 (1028071, 13.57%)

PROCEDURES

Column Number of Records Unique Values Top 5 Most Common Values
PX Type 78,513,055 2 CPT or HCPCS (54457531, 69.36%)
Other (24055524, 30.64%)
PX Source 78,513,055 2 Billing (57179330, 72.83%)
Order/EHR (21333725, 27.17%)
PX 78,513,055 39374 99213 (5415472, 6.90%)
99214 (3393159, 4.32%)
99232 (2192075, 2.79%)
93010 (1629327, 2.08%)
6370000001 (1595139, 2.03%)
PPX 78,513,055 4 Principal (44399165, 56.55%)
Other (22752112, 28.98%)
Secondary (7041964, 8.97%)
No information (4319814, 5.50%)
Encounter Type 78,513,055 7 Ambulatory Visit (37535835, 47.81%)
Inpatient (17291817, 22.02%)
Other Ambulatory (11079769, 14.11%)
Emergency Department (7301482, 9.30%)
Other (4860093, 6.19%)

VITAL

Column Number of Records Unique Values Top 5 Most Common Values
Weight 11,822,465 1914 150 - 200 (1534541, 32.16%)
100 - 150 (1130164, 23.68%)
< 100 (883544, 18.51%)
200 - 250 (835595, 17.51%)
> 250 (388405, 8.14%)
Systolic BP 11,822,465 249 115 - 139 (3899894, 47.56%)
90 - 114 (2582937, 31.50%)
140 - 165 (1303501, 15.90%)
> 165 (279948, 3.41%)
< 90 (133814, 1.63%)
BMI 11,822,465 17253 30 - 40 (2531139, 25.35%)
25 - 29 (2483050, 24.87%)
20 - 24 (2218608, 22.22%)
< 20 (1995237, 19.98%)
> 40 (756370, 7.58%)
Height 11,822,465 375 5' 0'' - 5' 4'' (1563522, 36.21%)
5' 5'' - 5' 9'' (1227076, 28.42%)
< 5' 0'' (1034003, 23.95%)
5' 10'' - 6' 3'' (455016, 10.54%)
> 6' 3'' (38100, 0.88%)
Diastolic BP 11,822,465 199 70 - 84 (4450737, 54.28%)
55 - 69 (2231637, 27.21%)
85 - 100 (1185275, 14.45%)
< 55 (198302, 2.42%)
> 100 (134129, 1.64%)

LAB RESULTS

Column Number of Records Unique Values Top 5 Most Common Values
Panel Name 118,196,135 12835 CBC (35953256, 30.42%)
CMP (29298176, 24.79%)
Urinalysis (11188542, 9.47%)
Lipid Panel (5400169, 4.57%)
BMP (4099844, 3.47%)
Component 118,196,135 24570 GLUCOSE (1398528, 1.18%)
BUN/CREATININE RATIO (1382260, 1.17%)
ALT (1304953, 1.10%)
AST (1295582, 1.10%)
GLOBULIN (1282248, 1.08%)
LOINC 118,196,135 3943 2345-7 (1879244, 1.59%)
2160-0 (1797306, 1.52%)
2075-0 (1789215, 1.51%)
2823-3 (1682604, 1.42%)
17861-6 (1681210, 1.42%)

ENCOUNTER

Column Number of Records Unique Values Top 5 Most Common Values
Encounter Type 150,737,873 8 Other Ambulatory (52593961, 34.89%)
Ambulatory Visit (43881962, 29.11%)
Other (24471114, 16.23%)
Unknown (12840053, 8.52%)
Inpatient (10755436, 7.14%)

DIAGNOSIS

Column Number of Records Unique Values Top 5 Most Common Values
PDX 114,975,505 3 Secondary (48369612, 42.07%)
Principal (41168138, 35.81%)
Other (25437755, 22.12%)
DX Type 114,975,505 4 ICD-10 (77944346, 67.79%)
ICD-9 (37031126, 32.21%)
DX Source 114,975,505 4 Final (79842739, 69.44%)
Unknown (23596924, 20.52%)
Admitting (10878011, 9.46%)
Discharge (657831, 0.57%)
DX Origin 114,975,505 2 Billing (84188047, 73.22%)
Order/EHR (30787458, 26.78%)
DX 114,975,505 60772 I10 (3125002, 2.72%)
Z23 (1166762, 1.01%)
401.9 (1098857, 0.96%)
E11.9 (1020897, 0.89%)
Z00.00 (894024, 0.78%)