Articles

Grace-AKO: a novel and stable knockoff filter for variable selection incorporating gene network structures

Tian, Peixin; Hu, Yiqian; Liu, Zhonghua; Zhang, Yan D.

Motivation
Variable selection is a common statistical approach to identifying genes associated with clinical outcomes of scientific interest. There are thousands of genes in genomic studies, while only a limited number of individual samples are available. Therefore, it is important to develop a method to identify genes associated with outcomes of interest that can control finite-sample false discovery rate (FDR) in high-dimensional data settings.
Results
This article proposes a novel method named Grace-AKO for graph-constrained estimation (Grace), which incorporates aggregation of multiple knockoffs (AKO) with the network-constrained penalty. Grace-AKO can control FDR in finite-sample settings and improve model stability simultaneously. Simulation studies show that Grace-AKO has better performance in finite-sample FDR control than the original Grace model. We apply Grace-AKO to the prostate cancer data in The Cancer Genome Atlas program by incorporating prostate-specific antigen (PSA) pathways in the Kyoto Encyclopedia of Genes and Genomes as the prior information. Grace-AKO finally identifies 47 candidate genes associated with PSA level, and more than 75% of the detected genes can be validated.

Files

  • thumnail for 12859_2022_Article_5016.pdf 12859_2022_Article_5016.pdf application/pdf 926 KB Download File

Also Published In

Title
BMC Bioinformatics
DOI
https://doi.org/10.1186/s12859-022-05016-y

More About This Work

Published Here
July 22, 2024

Notes

False discovery rate, High-dimensional data, Variable selection, Graph-constrained