2022 Articles
Grace-AKO: a novel and stable knockoff filter for variable selection incorporating gene network structures
Motivation
Variable selection is a common statistical approach to identifying genes associated with clinical outcomes of scientific interest. There are thousands of genes in genomic studies, while only a limited number of individual samples are available. Therefore, it is important to develop a method to identify genes associated with outcomes of interest that can control finite-sample false discovery rate (FDR) in high-dimensional data settings.
Results
This article proposes a novel method named Grace-AKO for graph-constrained estimation (Grace), which incorporates aggregation of multiple knockoffs (AKO) with the network-constrained penalty. Grace-AKO can control FDR in finite-sample settings and improve model stability simultaneously. Simulation studies show that Grace-AKO has better performance in finite-sample FDR control than the original Grace model. We apply Grace-AKO to the prostate cancer data in The Cancer Genome Atlas program by incorporating prostate-specific antigen (PSA) pathways in the Kyoto Encyclopedia of Genes and Genomes as the prior information. Grace-AKO finally identifies 47 candidate genes associated with PSA level, and more than 75% of the detected genes can be validated.
Subjects
Files
- 12859_2022_Article_5016.pdf application/pdf 926 KB Download File
Also Published In
- Title
- BMC Bioinformatics
- DOI
- https://doi.org/10.1186/s12859-022-05016-y
More About This Work
- Published Here
- July 22, 2024
Related Items
Notes
False discovery rate, High-dimensional data, Variable selection, Graph-constrained