2013 Articles
Hierarchical Dirichlet process model for gene expression clustering
Clustering is an important data processing tool for interpreting microarray data and genomic network inference. In this article, we propose a clustering algorithm based on the hierarchical Dirichlet processes (HDP). The HDP clustering introduces a hierarchical structure in the statistical model which captures the hierarchical features prevalent in biological data such as the gene express data. We develop a Gibbs sampling algorithm based on the Chinese restaurant metaphor for the HDP clustering. We apply the proposed HDP algorithm to both regulatory network segmentation and gene expression clustering. The HDP algorithm is shown to outperform several popular clustering algorithms by revealing the underlying hierarchical structure of the data. For the yeast cell cycle data, we compare the HDP result to the standard result and show that the HDP algorithm provides more information and reduces the unnecessary clustering fragments.
Subjects
Files
-
5d67fb7d8f972e21fe74cbecd2e8dec0.zip application/zip 557 KB Download File
-
1687-4153-2013-5.pdf application/pdf 629 KB Download File
-
1687-4153-2013-5.xml application/xml 319 KB Download File
Also Published In
- Title
- EURASIP Journal on Bioinformatics and Systems Biology
- DOI
- https://doi.org/10.1186/1687-4153-2013-5
More About This Work
- Academic Units
- Electrical Engineering
- Publisher
- Springer
- Published Here
- September 8, 2014