A catalog of stream processing optimizations

Martin Hirzel, Robert Soulé, Scott Schneider, Bugra Gedik, Robert Grimm

Research output: Contribution to journalArticle

Abstract

Various research communities have independently arrived at stream processing as a programming model for efficient and parallel computing. These communities include digital signal processing, databases, operating systems, and complex event processing. Since each community faces applications with challenging performance requirements, each of them has developed some of the same optimizations, but often with conflicting terminology and unstated assumptions. This article presents a survey of optimizations for stream processing. It is aimed both at users who need to understand and guide the system's optimizer and at implementers who need to make engineering tradeoffs. To consolidate terminology, this article is organized as a catalog, in a style similar to catalogs of design patterns or refactorings. To make assumptions explicit and help understand tradeoffs, each optimization is presented with its safety constraints (when does it preserve correctness?) and a profitability experiment (when does it improve performance?). We hope that this survey will help future streaming system builders to stand on the shoulders of giants from not just their own community.

Original languageEnglish (US)
Article number46
JournalACM Computing Surveys
Volume46
Issue number4
DOIs
Publication statusPublished - 2014

    Fingerprint

Keywords

  • Optimizations
  • Stream processing

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Hirzel, M., Soulé, R., Schneider, S., Gedik, B., & Grimm, R. (2014). A catalog of stream processing optimizations. ACM Computing Surveys, 46(4), [46]. https://doi.org/10.1145/2528412