Maximal subspace co-regulated gene clustering

Yuhai Zhao*, Jeffrey Yu Xu, Guoren Wang, Lei Chen, Bin Wang, Ge Yu

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

51 Citations (Scopus)

Abstract

Clustering is a popular technique for analyzing microarray datasets, with n genes and m experimental conditions. As explored by biologists, there is a real need to identify co-regulated gene clusters, which include both positive/negative regulated gene clusters. The existing pattern-based and tendency-based clustering approaches cannot be directly applied to find such coregulated gene clusters, because they are designed for finding positive regulated gene clusters. In this paper, in order to cluster co-regulated genes, we propose a coding scheme which allows us to cluster two genes into the same cluster if they have the same code, where two genes that have the same code can be either positive or negative regulated. Based on the coding scheme, we propose a new algorithm to find maximal subspace coregulated gene clusters with new pruning techniques. A maximal subspace co-regulated gene cluster clusters a set of genes on a condition sequence such that the cluster is not included in any other subspace co-regulated gene clusters. We conduct extensive experimental studies. Our approach can effectively and efficiently find maximal subspace co-regulated gene clusters. In addition, our approach outperforms the existing approaches for finding positive regulated gene clusters.

Original languageEnglish
Pages (from-to)83-98
Number of pages16
JournalIEEE Transactions on Knowledge and Data Engineering
Volume20
Issue number1
DOIs
Publication statusPublished - Jan 2008

Keywords

  • Clustering
  • Coregulated gene clustering
  • Gene expression data

Fingerprint

Dive into the research topics of 'Maximal subspace co-regulated gene clustering'. Together they form a unique fingerprint.

Cite this