Abstract
Reconstructing and modeling regulatory networks is an active area of research in bioinformatics and systems biology. Hence, various computational methods have been published, often successfully modeling one aspect of regulatory control. Gene regulation, however, is a process that depends on many different components such as transcription factors (TFs), cis-regulatory motifs and their temporal and spatial coordination. Accordingly, a promising new direction for computational analysis is the incorporation of multiple data types to discover, for instance, cluster membership, the spatial organization of cis-regulatory motifs and TFs that bind to these motifs. Here, we present such a data-driven framework, comprising four stages, to infer gene regulatory networks (GRNs) by modeling: 1. motif presence in the promoter, 2. spatial motif arrangement in co-regulated genes, 3. TFs that bind the respective motifs, and 4. dynamic properties of the GRN. A novel method is presented in stage 2, where we optimize for the spatial motif properties: orientation, occurrence of multiple motifs, relative distance between two motifs and distance to the Transcription Start Site (TSS). To find optimal distance based properties in efficient time we describe a dynamic programming approach. To combine multiple motif properties that are shared by genes with similar expression profiles a Hill-climber is employed. Subsequently, in stage 3 and 4, we infer GRNs by assigning TFs to the derived motifs and model time-dependent regulatory relationships between them with the Inferelator approach. None of the stages require the user to manually adjust any parameter, and thus derived properties can be analyzed without the bias introduced by parametrization. We applied this approach to 5. cerevisiae data and obtained insight into individual and general properties of the spatial assembly of regulatory elements and inferred the corresponding GRN.
Original language | English (US) |
---|---|
Title of host publication | 8th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2008 |
DOIs | |
State | Published - 2008 |
Event | 8th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2008 - Athens, Greece Duration: Oct 8 2008 → Oct 10 2008 |
Other
Other | 8th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2008 |
---|---|
Country | Greece |
City | Athens |
Period | 10/8/08 → 10/10/08 |
Fingerprint
ASJC Scopus subject areas
- Biotechnology
- Bioengineering
Cite this
Modeling gene regulation and spatial organization of sequence based motifs. / Supper, Jochen; Kampe, Claas Aufm; Wanke, Dierk; Berendzen, Kenneth W.; Harter, Klaus; Bonneau, Richard; Zell, Andreas.
8th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2008. 2008. 4696696.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
}
TY - GEN
T1 - Modeling gene regulation and spatial organization of sequence based motifs
AU - Supper, Jochen
AU - Kampe, Claas Aufm
AU - Wanke, Dierk
AU - Berendzen, Kenneth W.
AU - Harter, Klaus
AU - Bonneau, Richard
AU - Zell, Andreas
PY - 2008
Y1 - 2008
N2 - Reconstructing and modeling regulatory networks is an active area of research in bioinformatics and systems biology. Hence, various computational methods have been published, often successfully modeling one aspect of regulatory control. Gene regulation, however, is a process that depends on many different components such as transcription factors (TFs), cis-regulatory motifs and their temporal and spatial coordination. Accordingly, a promising new direction for computational analysis is the incorporation of multiple data types to discover, for instance, cluster membership, the spatial organization of cis-regulatory motifs and TFs that bind to these motifs. Here, we present such a data-driven framework, comprising four stages, to infer gene regulatory networks (GRNs) by modeling: 1. motif presence in the promoter, 2. spatial motif arrangement in co-regulated genes, 3. TFs that bind the respective motifs, and 4. dynamic properties of the GRN. A novel method is presented in stage 2, where we optimize for the spatial motif properties: orientation, occurrence of multiple motifs, relative distance between two motifs and distance to the Transcription Start Site (TSS). To find optimal distance based properties in efficient time we describe a dynamic programming approach. To combine multiple motif properties that are shared by genes with similar expression profiles a Hill-climber is employed. Subsequently, in stage 3 and 4, we infer GRNs by assigning TFs to the derived motifs and model time-dependent regulatory relationships between them with the Inferelator approach. None of the stages require the user to manually adjust any parameter, and thus derived properties can be analyzed without the bias introduced by parametrization. We applied this approach to 5. cerevisiae data and obtained insight into individual and general properties of the spatial assembly of regulatory elements and inferred the corresponding GRN.
AB - Reconstructing and modeling regulatory networks is an active area of research in bioinformatics and systems biology. Hence, various computational methods have been published, often successfully modeling one aspect of regulatory control. Gene regulation, however, is a process that depends on many different components such as transcription factors (TFs), cis-regulatory motifs and their temporal and spatial coordination. Accordingly, a promising new direction for computational analysis is the incorporation of multiple data types to discover, for instance, cluster membership, the spatial organization of cis-regulatory motifs and TFs that bind to these motifs. Here, we present such a data-driven framework, comprising four stages, to infer gene regulatory networks (GRNs) by modeling: 1. motif presence in the promoter, 2. spatial motif arrangement in co-regulated genes, 3. TFs that bind the respective motifs, and 4. dynamic properties of the GRN. A novel method is presented in stage 2, where we optimize for the spatial motif properties: orientation, occurrence of multiple motifs, relative distance between two motifs and distance to the Transcription Start Site (TSS). To find optimal distance based properties in efficient time we describe a dynamic programming approach. To combine multiple motif properties that are shared by genes with similar expression profiles a Hill-climber is employed. Subsequently, in stage 3 and 4, we infer GRNs by assigning TFs to the derived motifs and model time-dependent regulatory relationships between them with the Inferelator approach. None of the stages require the user to manually adjust any parameter, and thus derived properties can be analyzed without the bias introduced by parametrization. We applied this approach to 5. cerevisiae data and obtained insight into individual and general properties of the spatial assembly of regulatory elements and inferred the corresponding GRN.
UR - http://www.scopus.com/inward/record.url?scp=67549125551&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67549125551&partnerID=8YFLogxK
U2 - 10.1109/BIBE.2008.4696696
DO - 10.1109/BIBE.2008.4696696
M3 - Conference contribution
AN - SCOPUS:67549125551
SN - 9781424428458
BT - 8th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2008
ER -