<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1472-6947-14-51</ui>
	<ji>1472-6947</ji>
	<fm>
		<dochead>Research article</dochead>
		<bibl>
			<title>
				<p>Hidden in plain sight: bias towards sick patients when sampling patients with sufficient electronic health record data for research</p>
			</title>
			<aug>
				<au id="A1" ce="yes"><snm>Rusanov</snm><fnm>Alexander</fnm><insr iid="I1"/><email>ar2765@columbia.edu</email></au>
				<au id="A2" ce="yes"><snm>Weiskopf</snm><mi>G</mi><fnm>Nicole</fnm><insr iid="I2"/><email>ngw2105@columbia.edu</email></au>
				<au id="A3"><snm>Wang</snm><fnm>Shuang</fnm><insr iid="I3"/><email>sw2206@columbia.edu</email></au>
				<au id="A4" ca="yes"><snm>Weng</snm><fnm>Chunhua</fnm><insr iid="I2"/><email>cw2384@columbia.edu</email></au>
			</aug>
			<insg>
				<ins id="I1"><p>Department of Anesthesiology, Columbia University, New York, NY, USA</p></ins>
				<ins id="I2"><p>Department of Biomedical Informatics, Columbia University, New York, NY, USA</p></ins>
				<ins id="I3"><p>Department of Biostatistics, School of Public Health, Columbia University, New York, NY, USA</p></ins>
			</insg>
			<source>BMC Medical Informatics and Decision Making</source>
			<section><title><p>Standards, technology, and modeling</p></title></section><issn>1472-6947</issn>
			<pubdate>2014</pubdate>
			<volume>14</volume>
			<issue>1</issue>
			<fpage>51</fpage>
			<url>http://www.biomedcentral.com/1472-6947/14/51</url>
			<xrefbib><pubidlist><pubid idtype="doi">10.1186/1472-6947-14-51</pubid><pubid idtype="pmpid">24916006</pubid></pubidlist></xrefbib>
		</bibl>
		<history><rec><date><day>17</day><month>2</month><year>2014</year></date></rec><acc><date><day>2</day><month>6</month><year>2014</year></date></acc><pub><date><day>11</day><month>6</month><year>2014</year></date></pub></history>
		<cpyrt><year>2014</year><collab>Rusanov et al.; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (
				<url>http://creativecommons.org/licenses/by/4.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (
				<url>http://creativecommons.org/publicdomain/zero/1.0/</url>) applies to the data made available in this article, unless otherwise stated.</note></cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>To demonstrate that subject selection based on sufficient laboratory results and medication orders in electronic health records can be biased towards sick patients.</p>
				</sec>
				<sec>
					<st>
						<p>Methods</p>
					</st>
					<p>Using electronic health record data from 10,000 patients who received anesthetic services at a major metropolitan tertiary care academic medical center, an affiliated hospital for women and children, and an affiliated urban primary care hospital, the correlation between patient health status and counts of days with laboratory results or medication orders, as indicated by the American Society of Anesthesiologists Physical Status Classification (ASA Class), was assessed with a Negative Binomial Regression model.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Higher ASA Class was associated with more points of data: compared to ASA Class 1 patients, ASA Class 4 patients had 5.05 times the number of days with laboratory results and 6.85 times the number of days with medication orders, controlling for age, sex, emergency status, admission type, primary diagnosis, and procedure.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st>
					<p>Imposing data sufficiency requirements for subject selection allows researchers to minimize missing data when reusing electronic health records for research, but introduces a bias towards the selection of sicker patients. We demonstrated the relationship between patient health and quantity of data, which may result in a systematic bias towards the selection of sicker patients for research studies and limit the external validity of research conducted using electronic health record data. Additionally, we discovered other variables (i.e., admission status, age, emergency classification, procedure, and diagnosis) that independently affect data sufficiency.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>Since the passage of the Health Information Technology for Economic and Clinical Health (HITECH) Act in 2009 
				<abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
				</abbrgrp>, there has been an increase in the rate of electronic health record (EHR) adoption. As of 2012, the rate of EHR adoption with at least basic functionality was 44.4% in non-federal acute care hospitals 
				<abbrgrp>
					<abbr bid="B3">3</abbr>
				</abbrgrp> and 39.6% in office-based physician practices 
				<abbrgrp>
					<abbr bid="B4">4</abbr>
				</abbrgrp>.</p>
			<p>The transition to EHRs has created new opportunities for research 
				<abbrgrp>
					<abbr bid="B5">5</abbr>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
				</abbrgrp>. The secondary use of EHR data provides a more efficient and less expensive alternative to clinical trials, the current gold standard of medical research 
				<abbrgrp>
					<abbr bid="B8">8</abbr>
					<abbr bid="B9">9</abbr>
				</abbrgrp>. This is especially important in the current fiscal climate, where federal funding of medical research is becoming increasingly limited.</p>
			<p>There are, however, potential caveats to the secondary use of EHR data 
				<abbrgrp>
					<abbr bid="B10">10</abbr>
				</abbrgrp>. EHRs suffer from data quality problems 
				<abbrgrp>
					<abbr bid="B11">11</abbr>
					<abbr bid="B12">12</abbr>
					<abbr bid="B13">13</abbr>
				</abbrgrp>, which may affect the internal validity of retrospective studies. One of these data quality problems is insufficient data. Sufficiency can be conceptualized as a type of completeness, which is one of several categories of data quality that are relevant to EHR data reuse 
				<abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp>. When EHR data are complete according to the requirements of a given task, those data can be considered to be sufficient for that task. Required data may be missing for different reasons: a data point was observed but not documented 
				<abbrgrp>
					<abbr bid="B12">12</abbr>
				</abbrgrp> or it was never observed in the first place, either because the observation was not clinically necessary or because it could not be performed. Regardless of the reason, missing data is very common in today&#8217;s EHR databases, leading to datasets that may not be sufficient for work relying on the secondary use of EHR data. Although it has been pointed out that the missing data may cause records to be &#8220;visually complete but intellectually insufficient,&#8221; 
				<abbrgrp>
					<abbr bid="B15">15</abbr>
				</abbrgrp> the causal effect of health status on data sufficiency is not the focus of this study. Instead, we focus on the correlation between the sufficiency of electronic health record data for clinical research and the underlying patient health status.</p>
			<p>In a clinical trial a study sample is chosen based on predefined eligibility criteria. The data necessary to answer the research question is then prospectively collected for every participant. This approach ensures that all required data are present and trustworthy, but may come at the expense of limited external generalizability due to the non-representativeness of the sample 
				<abbrgrp>
					<abbr bid="B16">16</abbr>
				</abbrgrp>. In contrast, studies relying on the use of EHR data are thought to have greater external validity, having drawn their participants from actual patients receiving regular care in actual health care settings. In such studies, however, participants must be chosen based not only on the eligibility criteria but also upon the availability of sufficient data for extraction 
				<abbrgrp>
					<abbr bid="B17">17</abbr>
					<abbr bid="B18">18</abbr>
					<abbr bid="B19">19</abbr>
				</abbrgrp>. Example sufficiency requirements include &#8220;a sub-population who have sufficient health record data at institution <it>{X}</it> frequenting the <it>{X}</it> hospital system for routine care&#8221; and &#8220;total number of individuals that have male gender and serum creatinine 1.5&#160;mg/dL or female gender and serum creatinine 1.3&#160;mg/dL. The patients need to have at least 2 values over the threshold.&#8221; In a study by Green et al., of 122,270 patients satisfying eligibility criteria, only 59.7% had sufficient data 
				<abbrgrp>
					<abbr bid="B19">19</abbr>
				</abbrgrp>. Patients without the data necessary to determine eligibility or perform the analyses of interest cannot, by definition, be included in the study sample. The addition of this frequently overlooked sufficiency requirement has the potential to lead to bias in the selection of patients for inclusion in EHR based studies, which may limit their external validity.</p>
			<p>The proportion of patients in a given population with sufficient data varies from study to study, as it depends on the research question and the necessary kinds of data required for answering that question 
				<abbrgrp>
					<abbr bid="B14">14</abbr>
					<abbr bid="B20">20</abbr>
				</abbrgrp>. We have previously demonstrated the contextual nature of EHR sufficiency, as well as the high variability of sufficient patient records in a large-scale analysis of the NewYork-Presbyterian Hospital Clinical Data Warehouse 
				<abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp>. This variability is not always random; it is more likely that the pattern of data quantity is related to one or more of the variables of interest 
				<abbrgrp>
					<abbr bid="B21">21</abbr>
				</abbrgrp>. Our preliminary work indicates that the patient records containing sufficient data, i.e., those best-suited for secondary use in research, tend to belong to the sickest patients 
				<abbrgrp>
					<abbr bid="B22">22</abbr>
				</abbrgrp>.</p>
			<p>As Lee et al. point out, many studies assume that the addition of a requirement for a visit in a given time frame (visit-based sampling) produces a sample that is representative of the population from which it is derived. However, as their work demonstrates this assumption is wrong, and the imposition of just this one sufficiency requirement biases the population towards sicker and older patients 
				<abbrgrp>
					<abbr bid="B23">23</abbr>
				</abbrgrp>. Sufficient visit data is one common way patients are selected for inclusion in EHR based research. Another common sufficiency requirement is based on laboratory and/or medication data. Some studies require just the presence of a specific laboratory value or medication order while others also impose a minimum threshold for the number of each.</p>
			<p>This paper reports an in-depth exploration of the relationship between patient illness severity and quantity of available data, as well as the potential clinical confounders of this relationship. We demonstrate that, because of the data sufficiency requirements for sampling, the cohorts being identified for research may not be representative of the broader patient population, thus compromising the external validity of research conducted using EHR data.</p>
			<p>We hypothesized that the health records of sicker patients would be more likely to have sufficient data for research, and that this relationship would hold true when controlling for possible covariates. We also hypothesized that other patient- and procedure-related factors, such as age, sex, admission status, and the emergent nature of the procedure, would independently affect EHR data quantity.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<sec>
				<st>
					<p>Identification of a health status indicator</p>
				</st>
				<p>To study the relationship between patient health status and EHR data sufficiency, we required a measure of patient health that was not affected by data missing from the EHR. Since the most common indices of patient health and comorbidity rely on information from the EHR 
					<abbrgrp>
						<abbr bid="B24">24</abbr>
					</abbrgrp>, they are influenced by missing data and are thus unsuitable for a study where data sufficiency is the dependent variable 
					<abbrgrp>
						<abbr bid="B25">25</abbr>
					</abbrgrp>. An ideal, health status assessment for our purposes would be performed prospectively via examination and testing of the patient, rather than relying upon data recorded in the EHR, but such a study would be expensive and time-consuming. The American Society of Anesthesiologists (ASA) Physical Status Classification System (Table&#160;
					<tblr tid="T1">1</tblr>) is closer to this ideal than most health status indices 
					<abbrgrp>
						<abbr bid="B26">26</abbr>
						<abbr bid="B27">27</abbr>
					</abbrgrp>. An ASA Class is a subjective assessment of illness severity determined by an anesthesia provider, using a combination of direct assessment of the patient and available information from not only the EHR but also other sources that include family members, other healthcare providers, and records from outside the institution. In cases of missing or ambiguous information, further testing may be ordered to assist in patient classification. The ASA Classification is strongly correlated with other clinical risk predictors as well as outcomes 
					<abbrgrp>
						<abbr bid="B28">28</abbr>
						<abbr bid="B29">29</abbr>
						<abbr bid="B30">30</abbr>
						<abbr bid="B31">31</abbr>
					</abbrgrp>.</p>
				<table id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>
							<b>ASA classification</b>
						</p>
					</caption>
					<tgroup cols="2">
						<colspec align="center" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry align="center" colname="c1">
									<p>
										<b>ASA class</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>
										<b>Definition</b>
									</p>
								</entry>
							</row>
						</thead>
						<tfoot>
							<p>ASA&#8201;=&#8201;American Society of Anesthesiologists.</p>
							<p>An &#8220;E&#8221; is appended to the ASA Class for emergency cases.</p>
						</tfoot>
						<tbody valign="top">
							<row>
								<entry align="center" colname="c1">
									<p>
										<b>1</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>A normal healthy patient</p>
								</entry>
							</row>
							<row>
								<entry align="center" colname="c1">
									<p>
										<b>2</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>A patient with mild systemic disease</p>
								</entry>
							</row>
							<row>
								<entry align="center" colname="c1">
									<p>
										<b>3</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>A patient with severe systemic disease</p>
								</entry>
							</row>
							<row>
								<entry align="center" colname="c1">
									<p>
										<b>4</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>A patient with severe systemic disease that is a constant threat to life</p>
								</entry>
							</row>
							<row>
								<entry align="center" colname="c1">
									<p>
										<b>5</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>A moribund patient who is not expected to survive without the operation</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry align="center" colname="c1">
									<p>
										<b>6</b>
									</p>
								</entry>
								<entry colname="c2">
									<p>A declared brain-dead patient whose organs are being removed for donor purposes</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table>
				<sec>
					<st>
						<p>Data extraction</p>
					</st>
					<p>With approval from the Columbia University Medical Center Institutional Review Board (#AAAD1873), we queried the Department of Anesthesiology Research Database (RD) to obtain our study sample. The RD contains clinical data recorded during the provision of anesthetic services and stored in a specialized Anesthesia Information Management System (CompuRecord, Philips Healthcare, Andover, MA). The CompuRecord system is used for the documentation of all anesthetic services provided in the main operating rooms, labor and delivery floors, and ophthalmology operating suite, as well as most anesthetic services provided in the endoscopy and cardiac electrophysiology and catheterization suites, within our major metropolitan tertiary care academic medical center (Columbia University Medical Center) and two of its affiliates, a hospital for women and children (Morgan Stanley Children&#8217;s Hospital), and an urban primary care hospital (The Allen Hospital).</p>
					<p>We queried the RD for all cases containing an International Classification of Diseases, 9th Revision (ICD-9) 
						<abbrgrp>
							<abbr bid="B32">32</abbr>
						</abbrgrp> code and a Current Procedural Terminology (CPT) 
						<abbrgrp>
							<abbr bid="B33">33</abbr>
						</abbrgrp>. We looked at the quantity of available data in the year preceding the provision of anesthetic services, and therefore excluded patients younger than one year of age. We also excluded cases where the patient had another anesthetic record in the RD in the preceding year, in order to minimize bias introduced by having multiple anesthetic services. We excluded ASA 5 and 6 cases, due to their lower incidence, and randomly selected 10,000 as our study sample from the remaining 24,073 cases.</p>
					<p>Our primary variable of interest, patient health status, defined by ASA class, was extracted for each of the 10,000 patients. We also extracted the primary ICD-9 code, primary CPT code, age, sex, emergency classification, and admission status. Emergency classification consisted of two possible values, emergent or non-emergent, based on the presence or absence of the &#8220;E&#8221; modifier of the ASA classification. Admission status consisted of three possible values &#8212; inpatient, same day, and outpatient &#8212; as documented by the anesthesia provider. Inpatients were those who had been admitted to the hospital prior to provision of anesthesia, same day patients were those admitted to the hospital after provision of anesthesia for a period of more than 23&#160;hours, and outpatients were those discharged from the hospital within 23&#160;hours following completion of anesthetic services.</p>
					<p>The Clinical Data Warehouse (CDW) contains clinical care data from Allscripts&#8217; Sunrise Clinical Manager and ancillary services data from Cerner Millennium. We chose two kinds of data - laboratory results and medication orders &#8211; commonly used as sufficiency requirements. We queried the CDW to obtain the number of days with medication orders and the number of days with laboratory results for each patient for the year preceding the provision of anesthetic services. Two or more medication orders or laboratory results recorded on the same day would be counted once. In aligning with the concept of task-dependent data quality 
						<abbrgrp>
							<abbr bid="B14">14</abbr>
						</abbrgrp>, we conceptualized data sufficiency as a count variable, as opposed to a binary variable. Consequently, each patient could have a minimum of zero and a maximum of 365&#160;days for each of the two outcome variables.</p>
				</sec>
				<sec>
					<st>
						<p>Data analysis</p>
					</st>
					<p>To facilitate analysis, we grouped ICD-9 and CPT codes into 18 major categories using the Clinical Classification Software tools provided by the Healthcare Cost and Utilization Project of the Agency for Healthcare Research and Quality 
						<abbrgrp>
							<abbr bid="B34">34</abbr>
						</abbrgrp>. ICD-9 Category 15 (Certain conditions originating in the perinatal period) contained only one patient. This was deemed to be medically similar to and was thus merged into ICD-9 Category 14 (Congenital anomalies) which contained 247 patients prior to the merge. ICD-9 Categories 1, 4, 5, 12, and 18 (Infections and parasitic diseases, Diseases of the blood and blood forming organs, Mental disorders, Diseases of the skin and subcutaneous tissue, and Supplementary, respectively) contain diseases not usually treated with procedures that require anesthetic services and thus contained few patients (10, 19, 9, 83, and 112, respectively). These were merged with ICD-9 Category 16 (Symptoms, signs, and ill-defined conditions) which contained 369 patients prior to merging.</p>
					<p>Similar CPT categories were merged as follows: CPT Category 4 (Operations on the ear) containing 85 patients, was merged into CPT Category 5 (Operations on the nose, mouth and pharynx) containing 345 patients; CPT Category 8 (Operations on the hemic and lymphatic system) containing 106 patients, was merged into CPT Category 15 (Operations on the integumentary system) containing 336 patients; CPT Category 13 (Obstetrical procedures) containing 26 patients was merged into CPT Category 12 (Operations on the female genital organs) containing 557 patients; and CPT Category 17 (Other) containing 3 patients, was merged into CPT Category 18 (Anesthesia procedures) containing 1752 patients.In our sample 25.2% of patients had no laboratory results and 54.3% had no medication orders. Marginal distributions for laboratory results and medication orders grouped by ASA Class are shown in Figure&#160;
						<figr fid="F1">1</figr>.</p>
					<fig id="F1"><title><p>Figure 1</p></title><caption><p>Marginal distributions for laboratory results and medication orders</p></caption><text>
   <p><b>Marginal distributions for laboratory results and medication orders. </b>Each curve shows the number of patients (y-axis) as a function of the number of days (x-axis) with Laboratory Results (left panel) or Medication Orders (right panel) for a given ASA Class. The insets provide a closer look at the curves in the range of 0 to 10 days.</p>
</text><graphic file="1472-6947-14-51-1"/></fig>
					<p>The variations of the counts of laboratory results and medication orders are far greater than the means. To account for this over-dispersion we fit a negative binomial regression 
						<abbrgrp>
							<abbr bid="B35">35</abbr>
						</abbrgrp>. While the Poisson regression (which uses a Poisson distribution) is commonly used for analyses of count data, it does not handle over-dispersed data sets well due to the assumption that the variance of counts equals the mean. The negative binomial regression is an extension of the Poisson regression that is particularly well suited for over-dispersed count data, such as ours, where the variance is greater than the mean. In the negative binomial model, the counts Y follow a Poisson distribution (&#955;), where &#955; is a random variable with a gamma distribution. Therefore, the unconditional distribution of Y is a negative binomial.</p>
				</sec>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<p>The mean age of patients in our sample was 45.0 (SD&#8201;=&#8201;23.9) and ranged from one year to 102. Sixty-one percent of our cohort was female. Most cases were non-emergent (88.8%) with more outpatients (41.6%) than same-day admissions (32.7%) or inpatients (25.6%). The most frequently occurring diagnostic categories in our dataset were Complications of pregnancy, childbirth, and puerperium (19.3%), Diseases of the digestive system (12.3%), and Neoplasms (11.1%). The most common procedure categories were Anesthesia procedures, which includes procedures for analgesia during labor and delivery (17.5%), Operations on the digestive system (16.3%), and Operations on the musculoskeletal system (13.2%). Table&#160;
				<tblr tid="T2">2</tblr> presents descriptive statistics of counts of days with laboratory results and medication orders within subcategories.</p>
			<table id="T2">
				<title>
					<p>Table 2</p>
				</title>
				<caption>
					<p>
						<b>Model inputs with counts of days with laboratory results and medication orders (n&#8201;=&#8201;10,000)</b>
					</p>
				</caption>
				<tgroup cols="6">
					<colspec align="right" colname="c1" colnum="1" colwidth="1*"/>
					<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
					<colspec align="center" colname="c3" colnum="3" colwidth="1*"/>
					<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
					<colspec align="center" colname="c5" colnum="5" colwidth="1*"/>
					<colspec align="left" colname="c6" colnum="6" colwidth="1*"/>
					<thead valign="top">
						<row>
							<entry colname="c1"/>
							<entry colname="c2"/>
							<entry align="center" colname="c3" nameend="c4" namest="c3" rowsep="1">
								<p>
									<b>Laboratory results</b>
								</p>
							</entry>
							<entry align="center" colname="c5" nameend="c6" namest="c5" rowsep="1">
								<p>
									<b>Medication orders</b>
								</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>
									<b>Variable</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>
									<b>n (%)</b>
								</p>
							</entry>
							<entry align="center" colname="c3">
								<p>
									<b>Max</b>
								</p>
							</entry>
							<entry align="center" colname="c4">
								<p>
									<b>Mean (SD)</b>
								</p>
							</entry>
							<entry align="center" colname="c5">
								<p>
									<b>Max</b>
								</p>
							</entry>
							<entry align="center" colname="c6">
								<p>
									<b>Mean (SD)</b>
								</p>
							</entry>
						</row>
					</thead>
					<tfoot>
						<p>ASA Class&#8201;=&#8201;American Society of Anesthesiologists Physical Status Classification; ICD-9&#8201;=&#8201;International Classification of Diseases, Ninth Revision; CPT&#8201;=&#8201;Current Procedural Terminology; Dz. = Diseases; * Denotes ICD and CPT categories that contain other similar categories: Congenital anomalies contains Certain conditions originating in the perinatal period; Symptoms, signs and ill-defined conditions contains Infections and parasitic diseases, Diseases of the blood and blood forming organs, Mental disorders, Diseases of the skin and subcutaneous tissue, &amp; Supplementary; Operations on the nose, mouth and pharynx contains Operations on the ear; Operations on the female genital organs contains Obstetrical procedures; Operations on the integumentary system contains Operations on the hemic and lymphatic system; Anesthesia procedures contains Other.</p>
					</tfoot>
					<tbody valign="top">
						<row>
							<entry colname="c1">
								<p>
									<b>ASA class</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c5"/>
							<entry colname="c6"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>1</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>2263(22.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>20</p>
							</entry>
							<entry align="center" colname="c4">
								<p>2.9(3.4)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>21</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.3(2.2)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>2</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>4779(47.8)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>85</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.0(4.4)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>62</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.6(3.6)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>3</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>2499(25.0)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>107</p>
							</entry>
							<entry align="center" colname="c4">
								<p>5.7(9.4)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>102</p>
							</entry>
							<entry align="center" colname="c6">
								<p>4.1(8.5)</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>&#8195;<b>4</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>459(4.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>99</p>
							</entry>
							<entry align="center" colname="c4">
								<p>9.4(13.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>91</p>
							</entry>
							<entry align="center" colname="c6">
								<p>7.5(11.5)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>Sex</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c5"/>
							<entry colname="c6"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Male</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>3943(39.4)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>99</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.4(7.4)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>91</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.3(6.3)</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>&#8195;<b>Female</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>6057(60.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>107</p>
							</entry>
							<entry align="center" colname="c4">
								<p>4.3(6.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>102</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.5(5.5)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>Age (years)</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c5"/>
							<entry colname="c6"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>1-10</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>911(9.1)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>52</p>
							</entry>
							<entry align="center" colname="c4">
								<p>1.4(4.0)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>62</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.6(4.8)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>11-20</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>837(8.4)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>49</p>
							</entry>
							<entry align="center" colname="c4">
								<p>2.7(4.5)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>91</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.9(4.9)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>21-30</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1379(13.8)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>78</p>
							</entry>
							<entry align="center" colname="c4">
								<p>5.3(6.1)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>71</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.8(5.0)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>31-40</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1461(14.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>107</p>
							</entry>
							<entry align="center" colname="c4">
								<p>5.0(6.5)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>102</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.1(5.5)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>41-50</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>981(9.8)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>85</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.4(6.3)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>43</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.0(5.0)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>51-60</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1182(11.8)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>99</p>
							</entry>
							<entry align="center" colname="c4">
								<p>4.1(8.8)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>76</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.9(7.7)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>61-70</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1484(14.8)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>99</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.7(7.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>76</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.5(6.2)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>71-80</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1143(11.4)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>60</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.9(6.7)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>63</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.6(6.0)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>81-90</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>545(5.5)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>60</p>
							</entry>
							<entry align="center" colname="c4">
								<p>4.7(7.3)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>55</p>
							</entry>
							<entry align="center" colname="c6">
								<p>3.2(6.1)</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>&#8195;<b>91-102</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>77(0.8)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>59</p>
							</entry>
							<entry align="center" colname="c4">
								<p>6.9(9.1)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>44</p>
							</entry>
							<entry align="center" colname="c6">
								<p>5.1(7.3)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>Emergency status</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c5"/>
							<entry colname="c6"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Non-emergent</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>8883(88.8)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>107</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.7(6.3)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>102</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.2(5.5)</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>&#8195;<b>Emergent</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1117(11.2)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>99</p>
							</entry>
							<entry align="center" colname="c4">
								<p>5.5(9.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>81</p>
							</entry>
							<entry align="center" colname="c6">
								<p>4.0(7.9)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>Admission status</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c5"/>
							<entry colname="c6"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Outpatient</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>4162(41.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>85</p>
							</entry>
							<entry align="center" colname="c4">
								<p>2.1(4.4)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>56</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.3(3.8)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Same day</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>3274(32.7)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>69</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.7(4.8)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>76</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.7(4.0)</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>&#8195;<b>Inpatient</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>2564(25.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>107</p>
							</entry>
							<entry align="center" colname="c4">
								<p>7.2(9.9)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>102</p>
							</entry>
							<entry align="center" colname="c6">
								<p>5.1(8.8)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>ICD-9 category name (number)</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c5"/>
							<entry colname="c6"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Neoplasms (2)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1111(11.1)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>53</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.3(5.4)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>62</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.8(4.8)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Endocrine, nutritional, and metabolic &amp; immunity disorders (3)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>211(2.1)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>59</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.0(6.0)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>44</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.3(4.4)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Dz. of the nervous system and the sense organs (6)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>942(9.4)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>99</p>
							</entry>
							<entry align="center" colname="c4">
								<p>1.9(6.0)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>70</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.6(4.9)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Dz. of the circulatory system (7)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1005(10.0)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>92</p>
							</entry>
							<entry align="center" colname="c4">
								<p>5.2(8.5)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>81</p>
							</entry>
							<entry align="center" colname="c6">
								<p>3.9(7.4)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Dz. of the respiratory system (8)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>368(3.7)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>60</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.7(8.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>91</p>
							</entry>
							<entry align="center" colname="c6">
								<p>3.2(9.0)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Dz. of the digestive system (9)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1232(12.3)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>107</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.7(8.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>102</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.7(7.3)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Dz. of the genitourinary system (10)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>887(8.9)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>92</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.9(7.1)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>65</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.3(6.4)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Complications of pregnancy, childbirth and the puerperium(11)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1931(19.3)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>54</p>
							</entry>
							<entry align="center" colname="c4">
								<p>6.3(4.4)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>28</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.6(3.3)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Dz. of the musculoskeletal system and connective tissue (13)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>767(7.7)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>59</p>
							</entry>
							<entry align="center" colname="c4">
								<p>1.8(4.3)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>60</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.3(4.3)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>*Congenital anomalies (14)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>248(2.5)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>15</p>
							</entry>
							<entry align="center" colname="c4">
								<p>1.4(2.1)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>34</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.1(3.2)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>*Symptoms, signs and ill-defined conditions (16)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>602(6.0)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>99</p>
							</entry>
							<entry align="center" colname="c4">
								<p>5.1(9.0)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>62</p>
							</entry>
							<entry align="center" colname="c6">
								<p>3.4(7.6)</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>&#8195;<b>Injury and poisoning (17)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>696(7.0)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>77</p>
							</entry>
							<entry align="center" colname="c4">
								<p>2.4(6.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>76</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.3(5.7)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>CPT category name (number)</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c5"/>
							<entry colname="c6"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the nervous system (1)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>460(4.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>78</p>
							</entry>
							<entry align="center" colname="c4">
								<p>2.4(5.8)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>70</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.9(6.1)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the endocrine system (2)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>198(2.0)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>21</p>
							</entry>
							<entry align="center" colname="c4">
								<p>1.7(3.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>29</p>
							</entry>
							<entry align="center" colname="c6">
								<p>0.9(3.6)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the eye (3)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>664(6.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>99</p>
							</entry>
							<entry align="center" colname="c4">
								<p>1.8(5.9)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>52</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.4(4.5)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>*Operations on the nose, mouth, and pharynx (5)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>430(4.3)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>44</p>
							</entry>
							<entry align="center" colname="c4">
								<p>1.1(3.1)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>36</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.1(3.1)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the respiratory system (6)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>198(2.0)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>60</p>
							</entry>
							<entry align="center" colname="c4">
								<p>6.5(10.0)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>91</p>
							</entry>
							<entry align="center" colname="c6">
								<p>5.6(12.2)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the cardiovascular system (7)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1105(11.1)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>92</p>
							</entry>
							<entry align="center" colname="c4">
								<p>5.9(9.2)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>76</p>
							</entry>
							<entry align="center" colname="c6">
								<p>4.5(8.1)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the digestive system (9)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1625(16.3)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>107</p>
							</entry>
							<entry align="center" colname="c4">
								<p>4.0(8.5)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>102</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.7(7.1)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the urinary system (10)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>533(5.3)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>57</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.7(5.4)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>65</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.5(4.9)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the male genital organs (11)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>345(3.5)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>49</p>
							</entry>
							<entry align="center" colname="c4">
								<p>2.5(3.8)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>33</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.1(3.4)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>*Operations on the female genital organs (12)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>603(6.0)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>28</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.0(2.9)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>17</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.2(2.5)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Operations on the musculoskeletal system (14)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1321(13.2)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>77</p>
							</entry>
							<entry align="center" colname="c4">
								<p>2.0(4.9)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>67</p>
							</entry>
							<entry align="center" colname="c6">
								<p>1.7(4.8)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>*Operations on the integumentary system (15)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>442(4.4)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>54</p>
							</entry>
							<entry align="center" colname="c4">
								<p>3.5(6.5)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>53</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.4(5.6)</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>Miscellaneous diagnostic and therapeutic procedures (16)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>321(3.2)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>92</p>
							</entry>
							<entry align="center" colname="c4">
								<p>4.2(9.5)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>81</p>
							</entry>
							<entry align="center" colname="c6">
								<p>3.1(7.8)</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>&#8195;<b>*Anesthesia procedures (18)</b>
								</p>
							</entry>
							<entry align="center" colname="c2">
								<p>1755(17.6)</p>
							</entry>
							<entry align="center" colname="c3">
								<p>54</p>
							</entry>
							<entry align="center" colname="c4">
								<p>6.7(4.5)</p>
							</entry>
							<entry align="center" colname="c5">
								<p>52</p>
							</entry>
							<entry align="center" colname="c6">
								<p>2.8(3.5)</p>
							</entry>
						</row>
					</tbody>
				</tgroup>
			</table>
			<p>Table&#160;
				<tblr tid="T3">3</tblr> shows the effect of each variable as a whole and of each level of the primary outcome variable (ASA Class) on the estimated number of days with laboratory results and medication orders based on the parameter estimates from the negative binomial model. Effects, standard errors and 95% confidence intervals for individual variable levels are expressed as ratios comparing that level to the reference level for the variable, such that an effect of 2.0 for ASA 3 indicates that ASA 3 is estimated to have 2 times the number of days as ASA 1. These ratios were obtained by exponentiating the model regression coefficients. ASA class, subject sex, age, admission status, ICD-9 category, and CPT category were significantly associated with the counts of days with laboratory results and medication orders, while emergency status was associated only with laboratory results.</p>
			<table id="T3">
				<title>
					<p>Table 3</p>
				</title>
				<caption>
					<p>
						<b>Negative binomial regression</b>
					</p>
				</caption>
				<tgroup cols="10">
					<colspec align="center" colname="c1" colnum="1" colwidth="1*"/>
					<colspec align="left" colname="c2" colnum="2" colwidth="1*"/>
					<colspec align="center" colname="c3" colnum="3" colwidth="1*"/>
					<colspec align="left" colname="c4" colnum="4" colwidth="1*"/>
					<colspec align="left" colname="c5" colnum="5" colwidth="1*"/>
					<colspec align="left" colname="c6" colnum="6" colwidth="1*"/>
					<colspec align="center" colname="c7" colnum="7" colwidth="1*"/>
					<colspec align="left" colname="c8" colnum="8" colwidth="1*"/>
					<colspec align="left" colname="c9" colnum="9" colwidth="1*"/>
					<colspec align="left" colname="c10" colnum="10" colwidth="1*"/>
					<thead valign="top">
						<row>
							<entry colname="c1"/>
							<entry align="center" colname="c2" nameend="c5" namest="c2" rowsep="1">
								<p>
									<b>Laboratory results</b>
								</p>
							</entry>
							<entry align="center" colname="c6" nameend="c9" namest="c6" rowsep="1">
								<p>
									<b>Medication orders</b>
								</p>
							</entry>
						</row>
						<row rowsep="1">
							<entry colname="c1" valign="top">
								<p>
									<b>Variable</b>
								</p>
							</entry>
							<entry colname="c2" valign="top">
								<p>
									<b>Overall variable effect p-value</b>
								</p>
							</entry>
							<entry colname="c3" valign="top">
								<p>
									<b>Ratio of expected number of days(SE)</b>
									<sup>
										<b>*</b>
									</sup>
								</p>
							</entry>
							<entry colname="c4" valign="top">
								<p>
									<b>95% Confidence interval</b>
								</p>
							</entry>
							<entry colname="c5" valign="top">
								<p>
									<b>Regression p-value</b>
								</p>
							</entry>
							<entry colname="c6" valign="top">
								<p>
									<b>Overall variable effect p-value</b>
								</p>
							</entry>
							<entry colname="c7" valign="top">
								<p>
									<b>Ratio of expected number of days(SE)</b>
									<sup>
										<b>*</b>
									</sup>
								</p>
							</entry>
							<entry colname="c8" valign="top">
								<p>
									<b>95% Confidence interval</b>
								</p>
							</entry>
							<entry colname="c9" valign="top">
								<p>
									<b>Regression p-value</b>
								</p>
							</entry>
						</row>
					</thead>
					<tfoot>
						<p>ASA Class&#8201;=&#8201;American Society of Anesthesiologists Physical Status Classification; ICD-9&#8201;=&#8201;International Classification of Diseases, Ninth Revision; CPT&#8201;=&#8201;Current Procedural Terminology; SE&#8201;=&#8201;Standard Error.</p>
						<p>*The effects are ratios, obtained by exponentiating the model regression coefficients, of the expected number of days (with either laboratory results or medication orders) for each ASA Class compared to ASA 1, that is, an effect of 2.0 for ASA 3 indicates that ASA 3 is estimated to have 2 times the number of days as ASA 1.</p>
						<p>
							<sup>&#8224;</sup>Statistically significant at the 0.05 significance level.</p>
					</tfoot>
					<tbody valign="top">
						<row>
							<entry colname="c1">
								<p>
									<b>ASA class</b>
								</p>
							</entry>
							<entry colname="c2">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c5"/>
							<entry colname="c6">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c8"/>
							<entry colname="c9"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>1</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3">
								<p>1.00</p>
							</entry>
							<entry colname="c5"/>
							<entry colname="c6"/>
							<entry colname="c7"/>
							<entry colname="c7">
								<p>1.00</p>
							</entry>
							<entry colname="c9"/>
							<entry colname="c10"/>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>2</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3">
								<p>1.47(1.03)</p>
							</entry>
							<entry colname="c4">
								<p>1.38 &#8211; 1.57</p>
							</entry>
							<entry colname="c5">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c7">
								<p>1.74(1.05)</p>
							</entry>
							<entry colname="c8">
								<p>1.56 &#8211; 1.94</p>
							</entry>
							<entry colname="c9">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>3</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3">
								<p>3.38(1.04)</p>
							</entry>
							<entry colname="c4">
								<p>3.11 &#8211; 3.67</p>
							</entry>
							<entry colname="c5">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c7">
								<p>4.78(1.07)</p>
							</entry>
							<entry colname="c8">
								<p>4.18 &#8211; 5.48</p>
							</entry>
							<entry colname="c9">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>&#8195;<b>4</b>
								</p>
							</entry>
							<entry colname="c2"/>
							<entry colname="c3">
								<p>5.05(1.07)</p>
							</entry>
							<entry colname="c4">
								<p>4.41 &#8211; 5.77</p>
							</entry>
							<entry colname="c5">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c7">
								<p>6.85(1.12)</p>
							</entry>
							<entry colname="c8">
								<p>4.49 &#8211; 8.54</p>
							</entry>
							<entry colname="c9">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>Sex</b>
								</p>
							</entry>
							<entry colname="c2">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c6"/>
							<entry colname="c6">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c8"/>
							<entry colname="c9"/>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>Age</b>
								</p>
							</entry>
							<entry colname="c2">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c6"/>
							<entry colname="c6">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c8"/>
							<entry colname="c9"/>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>Emergency status</b>
								</p>
							</entry>
							<entry colname="c2">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c6"/>
							<entry colname="c6">
								<p>0.84</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c8"/>
							<entry colname="c9"/>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>Admission status</b>
								</p>
							</entry>
							<entry colname="c2">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c6"/>
							<entry colname="c6">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c8"/>
							<entry colname="c9"/>
						</row>
						<row>
							<entry colname="c1">
								<p>
									<b>ICD-9 category</b>
								</p>
							</entry>
							<entry colname="c2">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c6"/>
							<entry colname="c6">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c8"/>
							<entry colname="c9"/>
						</row>
						<row rowsep="1">
							<entry colname="c1">
								<p>
									<b>CPT category</b>
								</p>
							</entry>
							<entry colname="c2">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c3"/>
							<entry colname="c4"/>
							<entry colname="c6"/>
							<entry colname="c6">
								<p>&lt;.001<sup>&#8224;</sup>
								</p>
							</entry>
							<entry colname="c7"/>
							<entry colname="c8"/>
							<entry colname="c9"/>
						</row>
					</tbody>
				</tgroup>
			</table>
			<p>Our primary variable of interest, ASA class, had a significant association with the counts of days with laboratory results and medication orders. Controlling for all other variables, the estimated count of days with laboratory results for ASA 2 was 1.47 times, for ASA 3 was 3.38 times, and for ASA 4 was 5.05 times the count of days with laboratory results for ASA 1. The pairwise differences for counts of days with laboratory results between all four ASA classes were statistically significant. Similarly, the estimated count of days with medication orders for ASA 2 was 1.74 times, for ASA 3 was 4.78 times, and for ASA 4 was 6.85 times the count of days with medication orders for ASA 1. All pairwise comparisons between the four ASA classes for counts of days with medication orders were statistically significant.</p>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>The results of the negative binomial regression model demonstrate the relationship between patient health status and EHR data sufficiency. The less healthy the patient, as measured by ASA status, the more data that patient is likely to have, as represented by counts of days with laboratory results and medication orders, and the more likely they are to satisfy sufficiency requirements. This relationship holds true even when controlling for a number of likely confounders, including sex, age, emergent status, patient type, diagnosis, and procedure, which suggests that even within specific, well-defined cohorts, sicker patients are likely to have more data than healthier patients.</p>
			<p>These findings highlight an important but usually overlooked problem inherent to studies using EHR data: the selection of records with sufficient data, as measured by human imposed sufficiency requirements, for research may bias the sample towards patients who are sicker than the population from which the sample is drawn. The findings from this study are consistent with previous work exploring the complex relationships among data quality, bias, and health status. In one example, Wennberg et al. used insurance claims data to demonstrate bias in comorbidity measurement by showing that Charlson Comorbity Index scores are associated with the frequency of physician visits 
				<abbrgrp>
					<abbr bid="B25">25</abbr>
				</abbrgrp>, suggesting that data quality is compromised by differences in healthcare utilization. Similarly, Collins et al. identified a relationship between patient mortality and increased rates of nursing documentation, suggesting that more acutely ill patients are likely to have more thoroughly documented records 
				<abbrgrp>
					<abbr bid="B36">36</abbr>
					<abbr bid="B37">37</abbr>
				</abbrgrp>. In a study of a pneumonia severity index using EHR data, Hripcsak et al. found that the addition of cohort selection criteria that required the presence of sufficient data to make a reliable diagnosis substantially limited the sample size and significantly altered the mortality rates 
				<abbrgrp>
					<abbr bid="B38">38</abbr>
				</abbrgrp>. They note that the addition of simple sample restraints, while beneficial in their case, has the potential to significantly narrow the sample, leading to the possibility of bias.</p>
			<p>We observed a direct correlation between severity of illness and data sufficiency in spite of the presence of sub-populations in our study sample in which this correlation should not exist: living organ donors and pregnant women with uncomplicated pregnancies presenting for management of labor and delivery. These patients tend to be healthy, but have more data in their records, resulting from laboratory testing performed as part of routine prenatal care or organ donor evaluation. Our 10,000-patient sample contained 1,802(18.0%) such patients, of whom 1,746(96.9%) were classified as ASA 1 or 2 (relatively healthy). The average number of days with laboratory results for patients in this group (6.5) is nearly double that of all other patients in the study (3.4). Despite the presence of such a large number of healthy patients with a high degree of EHR sufficiency, our original hypothesis &#8212; that sicker patients have better EHR data sufficiency for research &#8212; was confirmed. (See Additional file 
				<supplr sid="S1">1</supplr>: Table S1 for results of the negative binomial model with pregnant patients and living organ donors excluded).</p>
			<suppl id="S1">
				<title>
					<p>Additional file 1: Table S1</p>
				</title>
				<text>
					<p>Negative binomial regression excluding pregnant patients and living organ donors.</p>
				</text>
				<file name="1472-6947-14-51-S1.docx">
   <p>Click here for file</p>
</file>
			</suppl>
			<p>In addition to confirming our primary hypothesis, we discovered that many other variables are independently associated with data sufficiency. These include admission status at time of assessment, age, emergency classification of the procedure, procedure type (CPT category) and primary diagnosis type (ICD-9 category). Potential biases in these other characteristics of the study population should be considered when selecting populations based on sufficiency requirements this population is studied.</p>
			<sec>
				<st>
					<p>Limitations and future directions</p>
				</st>
				<p>This study was performed primarily in a tertiary care academic medical center (though one of the included hospitals is a primary care facility) in a major metropolitan area. Consequently, many of the patients included in our analysis were likely referred from other facilities. Data might differ in a more rural, primary practice setting or in a health system where patients receive the majority of their care within that one system. A follow-up study should be performed to determine whether our results could be replicated in other clinical settings.</p>
				<p>Since our findings are based only on data primarily collected for documentation of clinical care, we cannot definitively conclude that this same bias would exist for secondary use of data primarily collected for other purposes, such as regulatory oversight and billing. Further analysis should be performed on other data sources.</p>
				<p>As a result of our decision to use ASA class as a measure of health status, our sample was limited to patients who had received anesthetic services. Though anesthetic services are generally provided to a wide range of patients, and one might therefore expect the relationship between record sufficiency and patient health to hold true more broadly, the generalizability of our results to other populations may be limited. A novel measure of health status that is independent of data quality but available for all patients in the EHR would provide a means to evaluate the correlation between health status and data sufficiency. Alternatively, a study that prospectively evaluates a representative sample of all patients in the EHR for health status could determine if the correlation exists in a more general population, though such a study would be costly. As in any retrospective study, it is possible that there exist covariates not controlled for in our model that account for the observed differences.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st>
			<p>In this analysis, we established the correlation between the degree of patient sickness and the sufficiency of data of their health records, for inclusion in research. This finding is important for researchers reusing EHR data. EHR-based studies sample patients based on sufficiency requirements with the aim of selecting only those records containing sufficient data to overcome the data missingness problem. This strategy turns out to introduce a hidden bias towards sick patients because sicker patient have records with a higher degree of data sufficiency and are more likely to be included in EHR-based studies; therefore, this selection process biases the study populations towards those comprised of sicker patients. The more stringent the sufficiency requirements, the sicker the resultant sample population. This is a problem unique to studies that rely on the secondary use of data initially collected for purposes other than research. Those involved in the secondary use of EHR data for research, as well as consumers of this research, should be aware of this sampling bias problem and exercise caution when applying results to real world populations.</p>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st>
			<p>The authors declared that they have no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Authors&#8217; contributions</p>
			</st>
			<p>AR and NGW carried out data extraction and analysis and wrote the manuscript together. SW performed the statistical analyses. CW identified the research question and directed the experiment. All authors read and approved the final manuscript.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgments</p>
				</st>
				<p>We would like to thank Dr. Mathew L. Maciejewski for his advice and help in the preparation of this manuscript. We would also like to thank the reviewers for their valuable input.</p>
				<sec>
					<st>
						<p>Funding</p>
					</st>
					<p>This work was supported by grants R01LM009886, R01LM010815, and 5T15LM007079 from the National Library of Medicine, grant UL1 TR000040 from the National Center for Advancing Translational Sciences (NCATS), and grant 5T32GM008464 from the National Institute of Health.</p>
				</sec>
			</sec>
		</ack>
		<refgrp><bibl id="B1"><title><p>Stimulating the adoption of health information technology</p></title><aug><au><snm>Blumenthal</snm><fnm>D</fnm></au></aug><source>N Engl J Med</source><pubdate>2009</pubdate><volume>360</volume><issue>15</issue><fpage>1477</fpage><lpage>1479</lpage></bibl><bibl id="B2"><title><p>The &#8220;meaningful use&#8221; regulation for electronic health records</p></title><aug><au><snm>Blumenthal</snm><fnm>D</fnm></au><au><snm>Tavenner</snm><fnm>M</fnm></au></aug><source>N Engl J Med</source><pubdate>2010</pubdate><volume>363</volume><issue>6</issue><fpage>501</fpage><lpage>504</lpage></bibl><bibl id="B3"><aug><au><snm>Charles</snm><fnm>D</fnm></au><au><snm>King</snm><fnm>J</fnm></au><au><snm>Patel</snm><fnm>V</fnm></au><au><snm>Furukawa</snm><fnm>M</fnm></au></aug><source>ONC Data Brief No. 9: Adoption of Electronic Health record Systems among U.S. Non-federal Acute Care Hospitals: 2008-20012</source><series>
   <title>
      <p>The Office of the National Coordinator for Health Information Technology</p>
   </title>
</series><pubdate>2013</pubdate></bibl><bibl id="B4"><aug><au><snm>Hsiao</snm><fnm>C</fnm></au><au><snm>Hing</snm><fnm>E</fnm></au></aug><source>NCHS Data Brief No. 111: Use and characteristics of electronic health record systems among office-based physician practice: United States, 2001-2012</source><series>
   <title>
      <p>National Center for Health Statistics</p>
   </title>
</series><pubdate>2012</pubdate></bibl><bibl id="B5"><title><p>Advancing the Framework: Use of Health Data&#8212;A Report of a Working Conference of the American Medical Informatics Association</p></title><aug><au><snm>Bloomrosen</snm><fnm>M</fnm></au><au><snm>Detmer</snm><fnm>DE</fnm></au></aug><source>J Am Med Inform Assoc</source><pubdate>2008</pubdate><volume>15</volume><issue>6</issue><fpage>715</fpage><lpage>722</lpage></bibl><bibl id="B6"><title><p>Adding value to the electronic health record through secondary use of data for quality assurance, research, and surveillance</p></title><aug><au><snm>Hersh</snm><fnm>WR</fnm></au></aug><source>Am J Manage Care</source><pubdate>2007</pubdate><volume>13</volume><issue>6 Part 1</issue><fpage>277</fpage><lpage>278</lpage></bibl><bibl id="B7"><title><p>Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper</p></title><aug><au><snm>Safran</snm><fnm>C</fnm></au><au><snm>Bloomrosen</snm><fnm>M</fnm></au><au><snm>Hammond</snm><fnm>WE</fnm></au><au><snm>Labkoff</snm><fnm>S</fnm></au><au><snm>Markel-Fox</snm><fnm>S</fnm></au><au><snm>Tang</snm><fnm>PC</fnm></au><au><snm>Detmer</snm><fnm>DE</fnm></au><au><snm>Expert</snm><fnm>P</fnm></au></aug><source>J Am Med Inform Assoc</source><pubdate>2007</pubdate><volume>14</volume><issue>1</issue><fpage>1</fpage><lpage>9</lpage></bibl><bibl id="B8"><title><p>The cost of drug development: a systematic review</p></title><aug><au><snm>Morgan</snm><fnm>S</fnm></au><au><snm>Grootendorst</snm><fnm>P</fnm></au><au><snm>Lexchin</snm><fnm>J</fnm></au><au><snm>Cunningham</snm><fnm>C</fnm></au><au><snm>Greyson</snm><fnm>D</fnm></au></aug><source>Health Policy</source><pubdate>2011</pubdate><volume>100</volume><issue>1</issue><fpage>4</fpage><lpage>17</lpage></bibl><bibl id="B9"><title><p>Central challenges facing the national clinical research enterprise</p></title><aug><au><snm>Sung</snm><fnm>NS</fnm></au><au><snm>Crowley</snm><fnm>WF</fnm><suf>Jr</suf></au><au><snm>Genel</snm><fnm>M</fnm></au><au><snm>Salber</snm><fnm>P</fnm></au><au><snm>Sandy</snm><fnm>L</fnm></au><au><snm>Sherwood</snm><fnm>LM</fnm></au><au><snm>Johnson</snm><fnm>SB</fnm></au><au><snm>Catanese</snm><fnm>V</fnm></au><au><snm>Tilson</snm><fnm>H</fnm></au><au><snm>Getz</snm><fnm>K</fnm></au><au><snm>Larson</snm><fnm>EL</fnm></au><au><snm>Scheinberg</snm><fnm>D</fnm></au><au><snm>Reece</snm><fnm>EA</fnm></au><au><snm>Slavkin</snm><fnm>H</fnm></au><au><snm>Dobs</snm><fnm>A</fnm></au><au><snm>Grebb</snm><fnm>J</fnm></au><au><snm>Martinez</snm><fnm>RA</fnm></au><au><snm>Korn</snm><fnm>A</fnm></au><au><snm>Rimoin</snm><fnm>D</fnm></au></aug><source>JAMA</source><pubdate>2003</pubdate><volume>289</volume><issue>10</issue><fpage>1278</fpage><lpage>1287</lpage></bibl><bibl id="B10"><title><p>Caveats for the use of operational electronic health record data in comparative effectiveness research</p></title><aug><au><snm>Hersh</snm><fnm>WR</fnm></au><au><snm>Weiner</snm><fnm>MG</fnm></au><au><snm>Embi</snm><fnm>PJ</fnm></au><au><snm>Logan</snm><fnm>JR</fnm></au><au><snm>Payne</snm><fnm>PR</fnm></au><au><snm>Bernstam</snm><fnm>EV</fnm></au><au><snm>Lehmann</snm><fnm>HP</fnm></au><au><snm>Hripcsak</snm><fnm>G</fnm></au><au><snm>Hartzog</snm><fnm>TH</fnm></au><au><snm>Cimino</snm><fnm>JJ</fnm></au><au><snm>Saltz</snm><fnm>JH</fnm></au></aug><source>Med Care</source><pubdate>2013</pubdate><volume>51</volume><issue>8 Suppl 3</issue><fpage>S30</fpage><lpage>S37</lpage></bibl><bibl id="B11"><title><p>Review: electronic health records and the reliability and validity of quality measures: a review of the literature</p></title><aug><au><snm>Chan</snm><fnm>KS</fnm></au><au><snm>Fowles</snm><fnm>JB</fnm></au><au><snm>Weiner</snm><fnm>JP</fnm></au></aug><source>Med Care Res Rev</source><pubdate>2010</pubdate><volume>67</volume><issue>5</issue><fpage>503</fpage><lpage>527</lpage></bibl><bibl id="B12"><title><p>Accuracy of data in computer-based patient records</p></title><aug><au><snm>Hogan</snm><fnm>WR</fnm></au><au><snm>Wagner</snm><fnm>MM</fnm></au></aug><source>J Am Med Inform Assoc</source><pubdate>1997</pubdate><volume>4</volume><issue>5</issue><fpage>342</fpage><lpage>355</lpage></bibl><bibl id="B13"><title><p>Systematic review of scope and quality of electronic patient record data in primary care</p></title><aug><au><snm>Thiru</snm><fnm>K</fnm></au><au><snm>Hassey</snm><fnm>A</fnm></au><au><snm>Sullivan</snm><fnm>F</fnm></au></aug><source>BMJ</source><pubdate>2003</pubdate><volume>326</volume><issue>7398</issue><fpage>1070</fpage></bibl><bibl id="B14"><title><p>Defining and measuring completeness of electronic health records for secondary use</p></title><aug><au><snm>Weiskopf</snm><fnm>NG</fnm></au><au><snm>Hripcsak</snm><fnm>G</fnm></au><au><snm>Swaminathan</snm><fnm>S</fnm></au><au><snm>Weng</snm><fnm>C</fnm></au></aug><source>J Biomed Inform</source><pubdate>2013</pubdate><volume>46</volume><issue>5</issue><fpage>830</fpage><lpage>836</lpage></bibl><bibl id="B15"><title><p>The clinical record: a 200-year-old 21st-century challenge</p></title><aug><au><snm>Barr</snm><fnm>MS</fnm></au></aug><source>Ann Intern Med</source><pubdate>2010</pubdate><volume>153</volume><issue>10</issue><fpage>682</fpage><lpage>683</lpage></bibl><bibl id="B16"><title><p>External validity of randomised controlled trials: &#8220;to whom do the results of this trial apply?&#8221;</p></title><aug><au><snm>Rothwell</snm><fnm>PM</fnm></au></aug><source>Lancet</source><pubdate>2005</pubdate><volume>365</volume><issue>9453</issue><fpage>82</fpage><lpage>93</lpage></bibl><bibl id="B17"><title><p>Meta-analysis of hypercoagulability genetic polymorphisms in Perthes disease</p></title><aug><au><snm>Woratanarat</snm><fnm>P</fnm></au><au><snm>Thaveeratitharm</snm><fnm>C</fnm></au><au><snm>Woratanarat</snm><fnm>T</fnm></au><au><snm>Angsanuntsukh</snm><fnm>C</fnm></au><au><snm>Attia</snm><fnm>J</fnm></au><au><snm>Thakkinstian</snm><fnm>A</fnm></au></aug><source>J Orthop Res</source><pubdate>2014</pubdate><volume>32</volume><issue>1</issue><fpage>1</fpage><lpage>7</lpage></bibl><bibl id="B18"><aug><au><snm>Hudson</snm><fnm>DL</fnm></au><au><snm>Cohen</snm><fnm>ME</fnm></au></aug><source>Merging medical informatics and automated diagnostic methods</source><series>
   <title>
      <p>Conference proceedings: Annual International Conference of the IEEE Engineering in Medicine and Biology Society IEEE Engineering in Medicine and Biology Society Conference 2013</p>
   </title>
</series><pubdate>2013</pubdate><fpage>4783</fpage><lpage>4786</lpage></bibl><bibl id="B19"><title><p>Using body mass index data in the electronic health record to calculate cardiovascular risk</p></title><aug><au><snm>Green</snm><fnm>BB</fnm></au><au><snm>Anderson</snm><fnm>ML</fnm></au><au><snm>Cook</snm><fnm>AJ</fnm></au><au><snm>Catz</snm><fnm>S</fnm></au><au><snm>Fishman</snm><fnm>PA</fnm></au><au><snm>McClure</snm><fnm>JB</fnm></au><au><snm>Reid</snm><fnm>R</fnm></au></aug><source>Am J Prev Med</source><pubdate>2012</pubdate><volume>42</volume><issue>4</issue><fpage>342</fpage><lpage>347</lpage></bibl><bibl id="B20"><title><p>Beyond accuracy: what data quality means to data consumers</p></title><aug><au><snm>Wang</snm><fnm>RY</fnm></au><au><snm>Strong</snm><fnm>DM</fnm></au></aug><source>J Manage Inf Syst</source><pubdate>1996</pubdate><volume>12</volume><issue>4</issue><fpage>5</fpage><lpage>34</lpage></bibl><bibl id="B21"><title><p>Inference and missing data</p></title><aug><au><snm>Rubin</snm><fnm>D</fnm></au></aug><source>Biometrika</source><pubdate>1976</pubdate><volume>63</volume><issue>3</issue><fpage>581</fpage><lpage>592</lpage></bibl><bibl id="B22"><title><p>Sick patients have more data: the non-random completeness of electronic health records</p></title><aug><au><snm>Weiskopf</snm><fnm>NG</fnm></au><au><snm>Rusanov</snm><fnm>A</fnm></au><au><snm>Weng</snm><fnm>C</fnm></au></aug><source>AMIA Annu Symp Proc</source><pubdate>2013</pubdate><volume>2013</volume><fpage>1472</fpage><lpage>1477</lpage></bibl><bibl id="B23"><title><p>What patient population does visit-based sampling in primary care settings represent?</p></title><aug><au><snm>Lee</snm><fnm>ML</fnm></au><au><snm>Yano</snm><fnm>EM</fnm></au><au><snm>Wang</snm><fnm>M</fnm></au><au><snm>Simon</snm><fnm>BF</fnm></au><au><snm>Rubenstein</snm><fnm>LV</fnm></au></aug><source>Med Care</source><pubdate>2002</pubdate><volume>40</volume><issue>9</issue><fpage>761</fpage><lpage>770</lpage></bibl><bibl id="B24"><title><p>How to measure comorbidity. a critical review of available methods</p></title><aug><au><snm>de Groot</snm><fnm>V</fnm></au><au><snm>Beckerman</snm><fnm>H</fnm></au><au><snm>Lankhorst</snm><fnm>GJ</fnm></au><au><snm>Bouter</snm><fnm>LM</fnm></au></aug><source>J Clin Epidemiol</source><pubdate>2003</pubdate><volume>56</volume><issue>3</issue><fpage>221</fpage><lpage>229</lpage></bibl><bibl id="B25"><title><p>Observational intensity bias associated with illness adjustment: cross sectional analysis of insurance claims</p></title><aug><au><snm>Wennberg</snm><fnm>JE</fnm></au><au><snm>Staiger</snm><fnm>DO</fnm></au><au><snm>Sharp</snm><fnm>SM</fnm></au><au><snm>Gottlieb</snm><fnm>DJ</fnm></au><au><snm>Bevan</snm><fnm>G</fnm></au><au><snm>McPherson</snm><fnm>K</fnm></au><au><snm>Welch</snm><fnm>HG</fnm></au></aug><source>BMJ</source><pubdate>2013</pubdate><volume>346</volume><fpage>f549</fpage></bibl><bibl id="B26"><title><p>Grading of patients for surgical procedures</p></title><aug><au><snm>Saklad</snm><fnm>M</fnm></au></aug><source>Anesthesiology</source><pubdate>1941</pubdate><volume>2</volume><issue>3</issue><fpage>281</fpage><lpage>284</lpage></bibl><bibl id="B27"><title><p>ASA Physical Status Classification System</p></title><note>[
					<url>http://www.asahq.org/Home/For-Members/Clinical-Information/ASA-Physical-Status-Classification-System</url>]</note></bibl><bibl id="B28"><title><p>National Surgical Quality Improvement Program (NSQIP) risk factors can be used to validate American Society of Anesthesiologists Physical Status Classification (ASA PS) levels</p></title><aug><au><snm>Davenport</snm><fnm>DL</fnm></au><au><snm>Bowe</snm><fnm>EA</fnm></au><au><snm>Henderson</snm><fnm>WG</fnm></au><au><snm>Khuri</snm><fnm>SF</fnm></au><au><snm>Mentzer</snm><fnm>RM</fnm><suf>Jr</suf></au></aug><source>Ann Surg</source><pubdate>2006</pubdate><volume>243</volume><issue>5</issue><fpage>636</fpage><lpage>641</lpage><note>discussion 641-634</note></bibl><bibl id="B29"><title><p>The role of anesthesia in surgical mortality</p></title><aug><au><snm>Dripps</snm><fnm>RD</fnm></au><au><snm>Lamont</snm><fnm>A</fnm></au><au><snm>Eckenhoff</snm><fnm>JE</fnm></au></aug><source>JAMA</source><pubdate>1961</pubdate><volume>178</volume><fpage>261</fpage><lpage>266</lpage></bibl><bibl id="B30"><title><p>A statistical analysis of the relationship of physical status to postoperative mortality in 68,388 cases</p></title><aug><au><snm>Vacanti</snm><fnm>CJ</fnm></au><au><snm>VanHouten</snm><fnm>RJ</fnm></au><au><snm>Hill</snm><fnm>RC</fnm></au></aug><source>Anesth Analg</source><pubdate>1970</pubdate><volume>49</volume><issue>4</issue><fpage>564</fpage><lpage>566</lpage></bibl><bibl id="B31"><title><p>ASA classification and perioperative variables as predictors of postoperative outcome</p></title><aug><au><snm>Wolters</snm><fnm>U</fnm></au><au><snm>Wolf</snm><fnm>T</fnm></au><au><snm>Stutzer</snm><fnm>H</fnm></au><au><snm>Schroder</snm><fnm>T</fnm></au></aug><source>Br J Anaesth</source><pubdate>1996</pubdate><volume>77</volume><issue>2</issue><fpage>217</fpage><lpage>222</lpage></bibl><bibl id="B32"><title><p>International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM)</p></title><note>[
					<url>http://www.cdc.gov/nchs/icd/icd9cm.htm</url>]</note></bibl><bibl id="B33"><title><p>Current Procedural Terminology</p></title><note>[
					<url>http://www.ama-assn.org/ama/pub/physician-resources/solutions-managing-your-practice/coding-billing-insurance/cpt.page</url>]</note></bibl><bibl id="B34"><title><p>HCUP Tools and Software</p></title><note>[
					<url>http://www.hcup-us.ahrq.gov/tools_software.jsp</url>]</note></bibl><bibl id="B35"><aug><au><snm>Hilbe</snm><fnm>JM</fnm></au></aug><source>Negative Binomial Regression</source><publisher>New York: Cambridge University Press</publisher><edition>2</edition><pubdate>2011</pubdate></bibl><bibl id="B36"><title><p>Relationship between nursing documentation and patients&#8217; mortality</p></title><aug><au><snm>Collins</snm><fnm>SA</fnm></au><au><snm>Cato</snm><fnm>K</fnm></au><au><snm>Albers</snm><fnm>D</fnm></au><au><snm>Scott</snm><fnm>K</fnm></au><au><snm>Stetson</snm><fnm>PD</fnm></au><au><snm>Bakken</snm><fnm>S</fnm></au><au><snm>Vawdrey</snm><fnm>DK</fnm></au></aug><source>Am J Crit Care</source><pubdate>2013</pubdate><volume>22</volume><issue>4</issue><fpage>306</fpage><lpage>313</lpage></bibl><bibl id="B37"><title><p>&#8220;Reading between the lines&#8221; of flow sheet data: nurses&#8217; optional documentation associated with cardiac arrest outcomes</p></title><aug><au><snm>Collins</snm><fnm>SA</fnm></au><au><snm>Vawdrey</snm><fnm>DK</fnm></au></aug><source>Appl Nurs Res</source><pubdate>2012</pubdate><volume>25</volume><issue>4</issue><fpage>251</fpage><lpage>257</lpage></bibl><bibl id="B38"><title><p>Bias associated with mining electronic health records</p></title><aug><au><snm>Hripcsak</snm><fnm>G</fnm></au><au><snm>Knirsch</snm><fnm>C</fnm></au><au><snm>Zhou</snm><fnm>L</fnm></au><au><snm>Wilcox</snm><fnm>A</fnm></au><au><snm>Melton</snm><fnm>G</fnm></au></aug><source>J Biomed Discov Collab</source><pubdate>2011</pubdate><volume>6</volume><fpage>48</fpage><lpage>52</lpage></bibl></refgrp>
	<sec><st><p>Pre-publication history</p></st><p>The pre-publication history for this paper can be accessed here:</p><p><url>http://www.biomedcentral.com/1472-6947/14/51/prepub</url></p></sec></bm>
</art>