Functional and evolutionary inference in gene networks: Does topology matter?

Mark L. Siegal, Daniel E L Promislow, Aviv Bergman

Research output: Contribution to journalArticle

Abstract

The relationship between the topology of a biological network and its functional or evolutionary properties has attracted much recent interest. It has been suggested that most, if not all, biological networks are 'scale free.' That is, their connections follow power-law distributions, such that there are very few nodes with very many connections and vice versa. The number of target genes of known transcriptional regulators in the yeast, Saccharomyces cerevisiae, appears to follow such a distribution, as do other networks, such as the yeast network of protein-protein interactions. These findings have inspired attempts to draw biological inferences from general properties associated with scale-free network topology. One often cited general property is that, when compromised, highly connected nodes will tend to have a larger effect on network function than sparsely connected nodes. For example, more highly connected proteins are more likely to be lethal when knocked out. However, the correlation between lethality and connectivity is relatively weak, and some highly connected proteins can be removed without noticeable phenotypic effect. Similarly, network topology only weakly predicts the response of gene expression to environmental perturbations. Evolutionary simulations of gene-regulatory networks, presented here, suggest that such weak or non-existent correlations are to be expected, and are likely not due to inadequacy of experimental data. We argue that 'top-down' inferences of biological properties based on simple measures of network topology are of limited utility, and we present simulation results suggesting that much more detailed information about a gene's location in a regulatory network, as well as dynamic gene-expression data, are needed to make more meaningful functional and evolutionary predictions. Specifically, we find in our simulations that: (1) the relationship between a gene's connectivity and its fitness effect upon knockout depends on its equilibrium expression level; (2) correlation between connectivity and genetic variation is virtually non-existent, yet upon independent evolution of networks with identical topologies, some nodes exhibit consistently low or high polymorphism; and (3) certain genes show low polymorphism yet high divergence among independent evolutionary runs. This latter pattern is generally taken as a signature of positive selection, but in our simulations its cause is often neutral coevolution of regulatory inputs to the same gene.

Original languageEnglish (US)
Pages (from-to)83-103
Number of pages21
JournalGenetica
Volume129
Issue number1
DOIs
StatePublished - Jan 2007

Fingerprint

Gene Regulatory Networks
topology
gene
Genes
connectivity
general property
protein
genes
gene expression
simulation
yeast
genetic polymorphism
yeasts
polymorphism
Gene Expression
Proteins
Fungal Proteins
protein-protein interactions
coevolution
power law distribution

Keywords

  • Developmental systems drift
  • Evolutionary systems biology
  • Gene network

ASJC Scopus subject areas

  • Genetics
  • Ecology, Evolution, Behavior and Systematics

Cite this

Functional and evolutionary inference in gene networks : Does topology matter? / Siegal, Mark L.; Promislow, Daniel E L; Bergman, Aviv.

In: Genetica, Vol. 129, No. 1, 01.2007, p. 83-103.

Research output: Contribution to journalArticle

Siegal, Mark L. ; Promislow, Daniel E L ; Bergman, Aviv. / Functional and evolutionary inference in gene networks : Does topology matter?. In: Genetica. 2007 ; Vol. 129, No. 1. pp. 83-103.
@article{83abdd5bcaa24d6ea08df1b2590affb7,
title = "Functional and evolutionary inference in gene networks: Does topology matter?",
abstract = "The relationship between the topology of a biological network and its functional or evolutionary properties has attracted much recent interest. It has been suggested that most, if not all, biological networks are 'scale free.' That is, their connections follow power-law distributions, such that there are very few nodes with very many connections and vice versa. The number of target genes of known transcriptional regulators in the yeast, Saccharomyces cerevisiae, appears to follow such a distribution, as do other networks, such as the yeast network of protein-protein interactions. These findings have inspired attempts to draw biological inferences from general properties associated with scale-free network topology. One often cited general property is that, when compromised, highly connected nodes will tend to have a larger effect on network function than sparsely connected nodes. For example, more highly connected proteins are more likely to be lethal when knocked out. However, the correlation between lethality and connectivity is relatively weak, and some highly connected proteins can be removed without noticeable phenotypic effect. Similarly, network topology only weakly predicts the response of gene expression to environmental perturbations. Evolutionary simulations of gene-regulatory networks, presented here, suggest that such weak or non-existent correlations are to be expected, and are likely not due to inadequacy of experimental data. We argue that 'top-down' inferences of biological properties based on simple measures of network topology are of limited utility, and we present simulation results suggesting that much more detailed information about a gene's location in a regulatory network, as well as dynamic gene-expression data, are needed to make more meaningful functional and evolutionary predictions. Specifically, we find in our simulations that: (1) the relationship between a gene's connectivity and its fitness effect upon knockout depends on its equilibrium expression level; (2) correlation between connectivity and genetic variation is virtually non-existent, yet upon independent evolution of networks with identical topologies, some nodes exhibit consistently low or high polymorphism; and (3) certain genes show low polymorphism yet high divergence among independent evolutionary runs. This latter pattern is generally taken as a signature of positive selection, but in our simulations its cause is often neutral coevolution of regulatory inputs to the same gene.",
keywords = "Developmental systems drift, Evolutionary systems biology, Gene network",
author = "Siegal, {Mark L.} and Promislow, {Daniel E L} and Aviv Bergman",
year = "2007",
month = "1",
doi = "10.1007/s10709-006-0035-0",
language = "English (US)",
volume = "129",
pages = "83--103",
journal = "Genetica",
issn = "0016-6707",
publisher = "Springer Netherlands",
number = "1",

}

TY - JOUR

T1 - Functional and evolutionary inference in gene networks

T2 - Does topology matter?

AU - Siegal, Mark L.

AU - Promislow, Daniel E L

AU - Bergman, Aviv

PY - 2007/1

Y1 - 2007/1

N2 - The relationship between the topology of a biological network and its functional or evolutionary properties has attracted much recent interest. It has been suggested that most, if not all, biological networks are 'scale free.' That is, their connections follow power-law distributions, such that there are very few nodes with very many connections and vice versa. The number of target genes of known transcriptional regulators in the yeast, Saccharomyces cerevisiae, appears to follow such a distribution, as do other networks, such as the yeast network of protein-protein interactions. These findings have inspired attempts to draw biological inferences from general properties associated with scale-free network topology. One often cited general property is that, when compromised, highly connected nodes will tend to have a larger effect on network function than sparsely connected nodes. For example, more highly connected proteins are more likely to be lethal when knocked out. However, the correlation between lethality and connectivity is relatively weak, and some highly connected proteins can be removed without noticeable phenotypic effect. Similarly, network topology only weakly predicts the response of gene expression to environmental perturbations. Evolutionary simulations of gene-regulatory networks, presented here, suggest that such weak or non-existent correlations are to be expected, and are likely not due to inadequacy of experimental data. We argue that 'top-down' inferences of biological properties based on simple measures of network topology are of limited utility, and we present simulation results suggesting that much more detailed information about a gene's location in a regulatory network, as well as dynamic gene-expression data, are needed to make more meaningful functional and evolutionary predictions. Specifically, we find in our simulations that: (1) the relationship between a gene's connectivity and its fitness effect upon knockout depends on its equilibrium expression level; (2) correlation between connectivity and genetic variation is virtually non-existent, yet upon independent evolution of networks with identical topologies, some nodes exhibit consistently low or high polymorphism; and (3) certain genes show low polymorphism yet high divergence among independent evolutionary runs. This latter pattern is generally taken as a signature of positive selection, but in our simulations its cause is often neutral coevolution of regulatory inputs to the same gene.

AB - The relationship between the topology of a biological network and its functional or evolutionary properties has attracted much recent interest. It has been suggested that most, if not all, biological networks are 'scale free.' That is, their connections follow power-law distributions, such that there are very few nodes with very many connections and vice versa. The number of target genes of known transcriptional regulators in the yeast, Saccharomyces cerevisiae, appears to follow such a distribution, as do other networks, such as the yeast network of protein-protein interactions. These findings have inspired attempts to draw biological inferences from general properties associated with scale-free network topology. One often cited general property is that, when compromised, highly connected nodes will tend to have a larger effect on network function than sparsely connected nodes. For example, more highly connected proteins are more likely to be lethal when knocked out. However, the correlation between lethality and connectivity is relatively weak, and some highly connected proteins can be removed without noticeable phenotypic effect. Similarly, network topology only weakly predicts the response of gene expression to environmental perturbations. Evolutionary simulations of gene-regulatory networks, presented here, suggest that such weak or non-existent correlations are to be expected, and are likely not due to inadequacy of experimental data. We argue that 'top-down' inferences of biological properties based on simple measures of network topology are of limited utility, and we present simulation results suggesting that much more detailed information about a gene's location in a regulatory network, as well as dynamic gene-expression data, are needed to make more meaningful functional and evolutionary predictions. Specifically, we find in our simulations that: (1) the relationship between a gene's connectivity and its fitness effect upon knockout depends on its equilibrium expression level; (2) correlation between connectivity and genetic variation is virtually non-existent, yet upon independent evolution of networks with identical topologies, some nodes exhibit consistently low or high polymorphism; and (3) certain genes show low polymorphism yet high divergence among independent evolutionary runs. This latter pattern is generally taken as a signature of positive selection, but in our simulations its cause is often neutral coevolution of regulatory inputs to the same gene.

KW - Developmental systems drift

KW - Evolutionary systems biology

KW - Gene network

UR - http://www.scopus.com/inward/record.url?scp=33845637127&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33845637127&partnerID=8YFLogxK

U2 - 10.1007/s10709-006-0035-0

DO - 10.1007/s10709-006-0035-0

M3 - Article

VL - 129

SP - 83

EP - 103

JO - Genetica

JF - Genetica

SN - 0016-6707

IS - 1

ER -