A Discriminative Model for Identifying Spatial cis-Regulatory Modules

TitleA Discriminative Model for Identifying Spatial cis-Regulatory Modules
Publication TypeTechnical Report
Year of Publication2003
AuthorsSegal, E., & Sharan R.
Other Numbers1209
Abstract

Transcriptional regulation is mediated by the coordinated binding of transcription factors to the upstream region of genes. In higher eukaryotes, the binding sites of cooperating transcription factors are organized into short sequence units, called cis-regulatory modules. In this paper we propose a method for identifying modules of transcription factor binding sites in a set of co-regulated genes, using only the raw sequence data as input. Our method is based on a novel probabilistic model that describes the mechanism of cis-regulation, including the binding sites of cooperating transcription factors, the organization of these binding sites into short sequence modules, and the regulation of a gene by its modules. We show that our method is successful in discovering planted modules in simulated data and known modules in yeast. More importantly, we applied our method to a large collection of human gene sets, and found 83 significant cis-regulatory modules, which included 36 known motifs and many novel ones. Thus, our results provide one of the first comprehensive compendiums of putative cis-regulatory modules in human.

URLhttp://www.icsi.berkeley.edu/ftp/global/pub/techreports/2003/tr-03-004.pdf
Bibliographic Notes

ICSI Technical Report TR-03-004

Abbreviated Authors

E. Segal and R. Sharan

ICSI Research Group

Algorithms

ICSI Publication Type

Technical Report