Abstract

Microarray experiments can reveal important information about transcriptional regulation. In our case, we look for potential promoter regulatory elements in the upstream region of coexpressed genes. Here we present two modifications of the original Gibbs sampling algorithm for motif finding (Lawrence et al., 1993). First, we introduce the use of a probability distribution to estimate the number of copies of the motif in a sequence. Second, we describe the technical aspects of the incorporation of a higher-order background model whose application we discussed in Thijs et al. (2001). Our implementation is referred to as the Motif Sampler. We successfully validate our algorithm on several data sets. First, we show results for three sets of upstream sequences containing known motifs: 1) the G-box light-response element in plants, 2) elements involved in methionine response in Saccharomyces cerevisiae, and 3) the FNR O(2)-responsive element in bacteria. We use these data sets to explain the influence of the parameters on the performance of our algorithm. Second, we show results for upstream sequences from four clusters of coexpressed genes identified in a microarray experiment on wounding in Arabidopsis thaliana. Several motifs could be matched to regulatory elements from plant defence pathways in our database of plant cis-acting regulatory elements (PlantCARE). Some other strong motifs do not have corresponding motifs in PlantCARE but are promising candidates for further analysis.

Keywords

GeneComputational biologyArabidopsisSequence motifUpstream (networking)BiologyRegulatory sequencePromoterGeneticsMicroarray analysis techniquesSaccharomyces cerevisiaeRegulation of gene expressionComputer scienceGene expressionMutant

Affiliated Institutions

Related Publications

Publication Info

Year
2002
Type
article
Volume
9
Issue
2
Pages
447-464
Citations
344
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

344
OpenAlex

Cite This

Gert Thijs, Kathleen Marchal, Magali Lescot et al. (2002). A Gibbs Sampling Method to Detect Overrepresented Motifs in the Upstream Regions of Coexpressed Genes. Journal of Computational Biology , 9 (2) , 447-464. https://doi.org/10.1089/10665270252935566

Identifiers

DOI
10.1089/10665270252935566