TY - JOUR
T1 - Maximal subspace co-regulated gene clustering
AU - Zhao, Yuhai
AU - Xu, Jeffrey Yu
AU - Wang, Guoren
AU - Chen, Lei
AU - Wang, Bin
AU - Yu, Ge
PY - 2008/1
Y1 - 2008/1
N2 - Clustering is a popular technique for analyzing microarray datasets, with n genes and m experimental conditions. As explored by biologists, there is a real need to identify co-regulated gene clusters, which include both positive/negative regulated gene clusters. The existing pattern-based and tendency-based clustering approaches cannot be directly applied to find such coregulated gene clusters, because they are designed for finding positive regulated gene clusters. In this paper, in order to cluster co-regulated genes, we propose a coding scheme which allows us to cluster two genes into the same cluster if they have the same code, where two genes that have the same code can be either positive or negative regulated. Based on the coding scheme, we propose a new algorithm to find maximal subspace coregulated gene clusters with new pruning techniques. A maximal subspace co-regulated gene cluster clusters a set of genes on a condition sequence such that the cluster is not included in any other subspace co-regulated gene clusters. We conduct extensive experimental studies. Our approach can effectively and efficiently find maximal subspace co-regulated gene clusters. In addition, our approach outperforms the existing approaches for finding positive regulated gene clusters.
AB - Clustering is a popular technique for analyzing microarray datasets, with n genes and m experimental conditions. As explored by biologists, there is a real need to identify co-regulated gene clusters, which include both positive/negative regulated gene clusters. The existing pattern-based and tendency-based clustering approaches cannot be directly applied to find such coregulated gene clusters, because they are designed for finding positive regulated gene clusters. In this paper, in order to cluster co-regulated genes, we propose a coding scheme which allows us to cluster two genes into the same cluster if they have the same code, where two genes that have the same code can be either positive or negative regulated. Based on the coding scheme, we propose a new algorithm to find maximal subspace coregulated gene clusters with new pruning techniques. A maximal subspace co-regulated gene cluster clusters a set of genes on a condition sequence such that the cluster is not included in any other subspace co-regulated gene clusters. We conduct extensive experimental studies. Our approach can effectively and efficiently find maximal subspace co-regulated gene clusters. In addition, our approach outperforms the existing approaches for finding positive regulated gene clusters.
KW - Clustering
KW - Coregulated gene clustering
KW - Gene expression data
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000251003300007
UR - https://openalex.org/W2121246530
UR - https://www.scopus.com/pages/publications/36649005562
U2 - 10.1109/TKDE.2007.190670
DO - 10.1109/TKDE.2007.190670
M3 - Journal Article
SN - 1041-4347
VL - 20
SP - 83
EP - 98
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 1
ER -