Diversity in Big Data: A Review

Marina Drosou, H. V. Jagadish, Evaggelia Pitoura, Julia Stoyanovich

    Research output: Contribution to journalReview article

    Abstract

    Big data technology offers unprecedented opportunities to society as a whole and also to its individual members. At the same time, this technology poses significant risks to those it overlooks. In this article, we give an overview of recent technical work on diversity, particularly in selection tasks, discuss connections between diversity and fairness, and identify promising directions for future work that will position diversity as an important component of a data-responsible society. We argue that diversity should come to the forefront of our discourse, for reasons that are both ethical - to mitigate the risks of exclusion - and utilitarian, to enable more powerful, accurate, and engaging data analysis and use.

    Original languageEnglish (US)
    Pages (from-to)73-84
    Number of pages12
    JournalBig Data
    Volume5
    Issue number2
    DOIs
    StatePublished - Jun 1 2017

    Fingerprint

    Big data
    Discourse
    Fairness
    Exclusion

    Keywords

    • data
    • diversity
    • empirical studies
    • models and algorithms
    • responsibly

    ASJC Scopus subject areas

    • Information Systems
    • Computer Science Applications
    • Information Systems and Management

    Cite this

    Drosou, M., Jagadish, H. V., Pitoura, E., & Stoyanovich, J. (2017). Diversity in Big Data: A Review. Big Data, 5(2), 73-84. https://doi.org/10.1089/big.2016.0054

    Diversity in Big Data : A Review. / Drosou, Marina; Jagadish, H. V.; Pitoura, Evaggelia; Stoyanovich, Julia.

    In: Big Data, Vol. 5, No. 2, 01.06.2017, p. 73-84.

    Research output: Contribution to journalReview article

    Drosou, M, Jagadish, HV, Pitoura, E & Stoyanovich, J 2017, 'Diversity in Big Data: A Review', Big Data, vol. 5, no. 2, pp. 73-84. https://doi.org/10.1089/big.2016.0054
    Drosou M, Jagadish HV, Pitoura E, Stoyanovich J. Diversity in Big Data: A Review. Big Data. 2017 Jun 1;5(2):73-84. https://doi.org/10.1089/big.2016.0054
    Drosou, Marina ; Jagadish, H. V. ; Pitoura, Evaggelia ; Stoyanovich, Julia. / Diversity in Big Data : A Review. In: Big Data. 2017 ; Vol. 5, No. 2. pp. 73-84.
    @article{9419c8c154144bcd9092907855f34ff2,
    title = "Diversity in Big Data: A Review",
    abstract = "Big data technology offers unprecedented opportunities to society as a whole and also to its individual members. At the same time, this technology poses significant risks to those it overlooks. In this article, we give an overview of recent technical work on diversity, particularly in selection tasks, discuss connections between diversity and fairness, and identify promising directions for future work that will position diversity as an important component of a data-responsible society. We argue that diversity should come to the forefront of our discourse, for reasons that are both ethical - to mitigate the risks of exclusion - and utilitarian, to enable more powerful, accurate, and engaging data analysis and use.",
    keywords = "data, diversity, empirical studies, models and algorithms, responsibly",
    author = "Marina Drosou and Jagadish, {H. V.} and Evaggelia Pitoura and Julia Stoyanovich",
    year = "2017",
    month = "6",
    day = "1",
    doi = "10.1089/big.2016.0054",
    language = "English (US)",
    volume = "5",
    pages = "73--84",
    journal = "Big Data",
    issn = "2167-6461",
    publisher = "Mary Ann Liebert Inc.",
    number = "2",

    }

    TY - JOUR

    T1 - Diversity in Big Data

    T2 - A Review

    AU - Drosou, Marina

    AU - Jagadish, H. V.

    AU - Pitoura, Evaggelia

    AU - Stoyanovich, Julia

    PY - 2017/6/1

    Y1 - 2017/6/1

    N2 - Big data technology offers unprecedented opportunities to society as a whole and also to its individual members. At the same time, this technology poses significant risks to those it overlooks. In this article, we give an overview of recent technical work on diversity, particularly in selection tasks, discuss connections between diversity and fairness, and identify promising directions for future work that will position diversity as an important component of a data-responsible society. We argue that diversity should come to the forefront of our discourse, for reasons that are both ethical - to mitigate the risks of exclusion - and utilitarian, to enable more powerful, accurate, and engaging data analysis and use.

    AB - Big data technology offers unprecedented opportunities to society as a whole and also to its individual members. At the same time, this technology poses significant risks to those it overlooks. In this article, we give an overview of recent technical work on diversity, particularly in selection tasks, discuss connections between diversity and fairness, and identify promising directions for future work that will position diversity as an important component of a data-responsible society. We argue that diversity should come to the forefront of our discourse, for reasons that are both ethical - to mitigate the risks of exclusion - and utilitarian, to enable more powerful, accurate, and engaging data analysis and use.

    KW - data

    KW - diversity

    KW - empirical studies

    KW - models and algorithms

    KW - responsibly

    UR - http://www.scopus.com/inward/record.url?scp=85021157002&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=85021157002&partnerID=8YFLogxK

    U2 - 10.1089/big.2016.0054

    DO - 10.1089/big.2016.0054

    M3 - Review article

    C2 - 28632443

    AN - SCOPUS:85021157002

    VL - 5

    SP - 73

    EP - 84

    JO - Big Data

    JF - Big Data

    SN - 2167-6461

    IS - 2

    ER -