Allelic mapping bias in RNA-sequencing is not a major confounder in eQTL studies

Panousis, Nikolaos; Gutierrez-Arcelus, Maria; Dermitzakis, Emmanouil; Lappalainen, Tuuli E.

RNA sequencing (RNA-seq) is the current gold-standard method to quantify gene expression for expression quantitative trait locus (eQTL) studies. However, a potential caveat in these studies is that RNA-seq reads carrying the non-reference allele of variant loci can have lower probability to map correctly to the reference genome, which could bias gene quantifications and cause false positive eQTL associations. In this study, we analyze the effect of this allelic mapping bias in eQTL discovery. We simulate RNA-seq read mapping over 9.5 M common SNPs and indels, with 15.6% of variants showing biased mapping rate for reference versus non-reference reads. However, removing potentially biased RNA-seq reads from an eQTL dataset of 185 individuals has a very small effect on gene and exon quantifications and eQTL discovery. We detect only a handful of likely false positive eQTLs, and overall eQTL SNPs show no significant enrichment for high mapping bias. Our results suggest that RNA-seq quantifications are generally robust against allelic mapping bias, and that this does not have a severe effect on eQTL discovery. Nevertheless, we provide our catalog of putatively biased loci to allow better controlling for mapping bias to obtain more accurate results in future RNA-seq studies.



  • thumnail for binary/octet-stream 2.47 MB Download File
  • thumnail for s13059-014-0467-2.pdf s13059-014-0467-2.pdf binary/octet-stream 2.05 MB Download File
  • thumnail for s13059-014-0467-2.xml s13059-014-0467-2.xml binary/octet-stream 96.8 KB Download File
  • thumnail for s13059-014-0467-2-S1.pdf s13059-014-0467-2-S1.pdf binary/octet-stream 557 KB Download File

Also Published In

More About This Work

Academic Units
Systems Biology
BioMed Central
Published Here
October 30, 2014