Articles

Copy number variation genotyping using family information

Darvishi, Katayoon; Mills, Ryan E.; Lee, Charles; Raby, Benjamin A.; Chu, Jen-hwa; Rogers, Angela; Ionita-Laza, Iuliana

Background: In recent years there has been a growing interest in the role of copy number variations (CNV) in genetic diseases. Though there has been rapid development of technologies and statistical methods devoted to detection in CNVs from array data, the inherent challenges in data quality associated with most hybridization techniques remains a challenging problem in CNV association studies. Results: To help address these data quality issues in the context of family-based association studies, we introduce a statistical framework for the intensity-based array data that takes into account the family information for copy-number assignment. The method is an adaptation of traditional methods for modeling SNP genotype data that assume Gaussian mixture model, whereby CNV calling is performed for all family members simultaneously and leveraging within family-data to reduce CNV calls that are incompatible with Mendelian inheritance while still allowing de-novo CNVs. Applying this method to simulation studies and a genome-wide association study in asthma, we find that our approach significantly improves CNV calls accuracy, and reduces the Mendelian inconsistency rates and false positive genotype calls. The results were validated using qPCR experiments. Conclusions: In conclusion, we have demonstrated that the use of family information can improve the quality of CNV calling and hopefully give more powerful association test of CNVs.

Subjects

Files

  • thumnail for 3726ddcb24b37c8e3ee574a0e9ec7f4a.zip 3726ddcb24b37c8e3ee574a0e9ec7f4a.zip application/zip 3.18 MB Download File

Also Published In

Title
BMC Bioinformatics
DOI
https://doi.org/10.1186/1471-2105-14-157

More About This Work

Academic Units
Mailman School of Public Health
Publisher
BioMed Central
Published Here
September 8, 2014