Design and analysis of bar-seq experiments

David G. Robinson, Wei Chen, John D. Storey, David Gresham

Research output: Contribution to journalArticle

Abstract

High-throughput quantitative DNA sequencing enables the parallel phenotyping of pools of thousands of mutants. However, the appropriate analytical methods and experimental design that maximize the efficiency of these methods while maintaining statistical power are currently unknown. Here, we have used Bar-seq analysis of the Saccharomyces cerevisiae yeast deletion library to systematically test the effect of experimental design parameters and sequence read depth on experimental results. We present computational methods that efficiently and accurately estimate effect sizes and their statistical significance by adapting existing methods for RNA-seq analysis. Using simulated variation of experimental designs, we found that biological replicates are critical for statistical analysis of Bar-seq data, whereas technical replicates are of less value. By subsampling sequence reads, we found that when using four-fold biological replication, 6 million reads per condition achieved 96% power to detect a two-fold change (or more) at a 5% false discovery rate. Our guidelines for experimental design and computational analysis enables the study of the yeast deletion collection in up to 30 different conditions in a single sequencing lane. These findings are relevant to a variety of pooled genetic screening methods that use high-throughput quantitative DNA sequencing, including Tn-seq.

Original languageEnglish (US)
Pages (from-to)11-18
Number of pages8
JournalG3: Genes, Genomes, Genetics
Volume4
Issue number1
DOIs
StatePublished - 2014

Fingerprint

Research Design
High-Throughput Nucleotide Sequencing
Yeasts
Genetic Testing
Libraries
Saccharomyces cerevisiae
Guidelines
RNA

Keywords

  • Bar-seq
  • Cerevisiae
  • Functional
  • Galactose
  • Genomics
  • Sacchromyces
  • Yeast

ASJC Scopus subject areas

  • Genetics
  • Molecular Biology
  • Genetics(clinical)

Cite this

Design and analysis of bar-seq experiments. / Robinson, David G.; Chen, Wei; Storey, John D.; Gresham, David.

In: G3: Genes, Genomes, Genetics, Vol. 4, No. 1, 2014, p. 11-18.

Research output: Contribution to journalArticle

Robinson, David G. ; Chen, Wei ; Storey, John D. ; Gresham, David. / Design and analysis of bar-seq experiments. In: G3: Genes, Genomes, Genetics. 2014 ; Vol. 4, No. 1. pp. 11-18.
@article{caec933a3e4e431fa1ca32cfc090e69c,
title = "Design and analysis of bar-seq experiments",
abstract = "High-throughput quantitative DNA sequencing enables the parallel phenotyping of pools of thousands of mutants. However, the appropriate analytical methods and experimental design that maximize the efficiency of these methods while maintaining statistical power are currently unknown. Here, we have used Bar-seq analysis of the Saccharomyces cerevisiae yeast deletion library to systematically test the effect of experimental design parameters and sequence read depth on experimental results. We present computational methods that efficiently and accurately estimate effect sizes and their statistical significance by adapting existing methods for RNA-seq analysis. Using simulated variation of experimental designs, we found that biological replicates are critical for statistical analysis of Bar-seq data, whereas technical replicates are of less value. By subsampling sequence reads, we found that when using four-fold biological replication, 6 million reads per condition achieved 96{\%} power to detect a two-fold change (or more) at a 5{\%} false discovery rate. Our guidelines for experimental design and computational analysis enables the study of the yeast deletion collection in up to 30 different conditions in a single sequencing lane. These findings are relevant to a variety of pooled genetic screening methods that use high-throughput quantitative DNA sequencing, including Tn-seq.",
keywords = "Bar-seq, Cerevisiae, Functional, Galactose, Genomics, Sacchromyces, Yeast",
author = "Robinson, {David G.} and Wei Chen and Storey, {John D.} and David Gresham",
year = "2014",
doi = "10.1534/g3.113.008565",
language = "English (US)",
volume = "4",
pages = "11--18",
journal = "G3 (Bethesda, Md.)",
issn = "2160-1836",
publisher = "Genetics Society of America",
number = "1",

}

TY - JOUR

T1 - Design and analysis of bar-seq experiments

AU - Robinson, David G.

AU - Chen, Wei

AU - Storey, John D.

AU - Gresham, David

PY - 2014

Y1 - 2014

N2 - High-throughput quantitative DNA sequencing enables the parallel phenotyping of pools of thousands of mutants. However, the appropriate analytical methods and experimental design that maximize the efficiency of these methods while maintaining statistical power are currently unknown. Here, we have used Bar-seq analysis of the Saccharomyces cerevisiae yeast deletion library to systematically test the effect of experimental design parameters and sequence read depth on experimental results. We present computational methods that efficiently and accurately estimate effect sizes and their statistical significance by adapting existing methods for RNA-seq analysis. Using simulated variation of experimental designs, we found that biological replicates are critical for statistical analysis of Bar-seq data, whereas technical replicates are of less value. By subsampling sequence reads, we found that when using four-fold biological replication, 6 million reads per condition achieved 96% power to detect a two-fold change (or more) at a 5% false discovery rate. Our guidelines for experimental design and computational analysis enables the study of the yeast deletion collection in up to 30 different conditions in a single sequencing lane. These findings are relevant to a variety of pooled genetic screening methods that use high-throughput quantitative DNA sequencing, including Tn-seq.

AB - High-throughput quantitative DNA sequencing enables the parallel phenotyping of pools of thousands of mutants. However, the appropriate analytical methods and experimental design that maximize the efficiency of these methods while maintaining statistical power are currently unknown. Here, we have used Bar-seq analysis of the Saccharomyces cerevisiae yeast deletion library to systematically test the effect of experimental design parameters and sequence read depth on experimental results. We present computational methods that efficiently and accurately estimate effect sizes and their statistical significance by adapting existing methods for RNA-seq analysis. Using simulated variation of experimental designs, we found that biological replicates are critical for statistical analysis of Bar-seq data, whereas technical replicates are of less value. By subsampling sequence reads, we found that when using four-fold biological replication, 6 million reads per condition achieved 96% power to detect a two-fold change (or more) at a 5% false discovery rate. Our guidelines for experimental design and computational analysis enables the study of the yeast deletion collection in up to 30 different conditions in a single sequencing lane. These findings are relevant to a variety of pooled genetic screening methods that use high-throughput quantitative DNA sequencing, including Tn-seq.

KW - Bar-seq

KW - Cerevisiae

KW - Functional

KW - Galactose

KW - Genomics

KW - Sacchromyces

KW - Yeast

UR - http://www.scopus.com/inward/record.url?scp=84892705629&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84892705629&partnerID=8YFLogxK

U2 - 10.1534/g3.113.008565

DO - 10.1534/g3.113.008565

M3 - Article

VL - 4

SP - 11

EP - 18

JO - G3 (Bethesda, Md.)

JF - G3 (Bethesda, Md.)

SN - 2160-1836

IS - 1

ER -