2009 Articles
Characterizing environmental and phenotypic associations using information theory and electronic health records
The availability of up-to-date, executable, evidence-based medical knowledge is essential for many clinical applications, such as pharmacovigilance, but executable knowledge is costly to obtain and update. Automated acquisition of environmental and phenotypic associations in biomedical and clinical documents using text mining has showed some success. The usefulness of the association knowledge is limited, however, due to the fact that the specific relationships between clinical entities remain unknown. In particular, some associations are indirect relations due to interdependencies among the data. In this work, we develop methods using mutual information (MI) and its property, the data processing inequality (DPI), to help characterize associations that were generated based on use of natural language processing to encode clinical information in narrative patient records followed by statistical methods. Evaluation based on a random sample consisting of two drugs and two diseases indicates an overall precision of 81%. This preliminary study demonstrates that the proposed method is effective for helping to characterize phenotypic and environmental associations obtained from clinical reports.
Subjects
Files
- f1d0416bf601e177e722d44fdec1cb11.zip application/zip 230 KB Download File
- 1471-2105-10-S9-S13.pdf application/pdf 251 KB Download File
- 1471-2105-10-S9-S13.xml application/xml 67.5 KB Download File
Also Published In
- Title
- BMC Bioinformatics
- DOI
- https://doi.org/10.1186/1471-2105-10-S9-S13
More About This Work
- Academic Units
- Biomedical Informatics
- Publisher
- BioMed Central
- Published Here
- September 8, 2014