HadoopDB in action: Building real world applications

Azza Abouzied, Kamil Bajda-Pawlikowski, Jiewen Huang, Daniel J. Abadi, Avi Silberschatz

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

HadoopDB is a hybrid of MapReduce and DBMS technologies, designed to meet the growing demand of analyzing massive datasets on very large clusters of machines. Our previous work has shown that HadoopDB approaches parallel databases in performance and still yields the scalability and fault tolerance of MapReduce-based systems. In this demonstration, we focus on HadoopDB's flexible architecture and versatility with two real world application scenarios: a semantic web data application for protein sequence analysis and a business data warehousing application based on TPC-H. The demonstration offers a thorough walk-through of how to easily build applications on top of HadoopDB.

Original languageEnglish (US)
Title of host publicationProceedings of the 2010 International Conference on Management of Data, SIGMOD '10
Pages1111-1113
Number of pages3
DOIs
Publication statusPublished - Jul 23 2010
Event2010 International Conference on Management of Data, SIGMOD '10 - Indianapolis, IN, United States
Duration: Jun 6 2010Jun 11 2010

Other

Other2010 International Conference on Management of Data, SIGMOD '10
CountryUnited States
CityIndianapolis, IN
Period6/6/106/11/10

    Fingerprint

Keywords

  • hadoop
  • hadoopdb
  • hive
  • mapreduce
  • parallel database
  • semantic web
  • tpc-h
  • uniprot

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Abouzied, A., Bajda-Pawlikowski, K., Huang, J., Abadi, D. J., & Silberschatz, A. (2010). HadoopDB in action: Building real world applications. In Proceedings of the 2010 International Conference on Management of Data, SIGMOD '10 (pp. 1111-1113) https://doi.org/10.1145/1807167.1807294