Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 U.S. cities

Kunal Relia, Zhengyi Li, Stephanie Cook, Rumi Chunara

Research output: Contribution to conferencePaper

Abstract

We study malicious online content via a specific type of hate speech: race, ethnicity and national-origin based discrimination in social media, alongside hate crimes motivated by those characteristics, in 100 cities across the United States. We develop a spatially-diverse training dataset and classification pipeline to delineate targeted and self-narration of discrimination on social media, accounting for language across geographies. Controlling for census parameters, we find that the proportion of discrimination that is targeted is associated with the number of hate crimes. Finally, we explore the linguistic features of discrimination Tweets in relation to hate crimes by city, features used by users who Tweet different amounts of discrimination, and features of discrimination compared to non-discrimination Tweets. Findings from this spatial study can inform future studies of how discrimination in physical and virtual worlds vary by place, or how physical and virtual world discrimination may synergize.

Original languageEnglish (US)
Pages417-427
Number of pages11
StatePublished - Jan 1 2019
Event13th International Conference on Web and Social Media, ICWSM 2019 - Munich, Germany
Duration: Jun 11 2019Jun 14 2019

Conference

Conference13th International Conference on Web and Social Media, ICWSM 2019
CountryGermany
CityMunich
Period6/11/196/14/19

Fingerprint

Crime
Linguistics
Pipelines

ASJC Scopus subject areas

  • Computer Networks and Communications

Cite this

Relia, K., Li, Z., Cook, S., & Chunara, R. (2019). Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 U.S. cities. 417-427. Paper presented at 13th International Conference on Web and Social Media, ICWSM 2019, Munich, Germany.

Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 U.S. cities. / Relia, Kunal; Li, Zhengyi; Cook, Stephanie; Chunara, Rumi.

2019. 417-427 Paper presented at 13th International Conference on Web and Social Media, ICWSM 2019, Munich, Germany.

Research output: Contribution to conferencePaper

Relia, K, Li, Z, Cook, S & Chunara, R 2019, 'Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 U.S. cities', Paper presented at 13th International Conference on Web and Social Media, ICWSM 2019, Munich, Germany, 6/11/19 - 6/14/19 pp. 417-427.
Relia K, Li Z, Cook S, Chunara R. Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 U.S. cities. 2019. Paper presented at 13th International Conference on Web and Social Media, ICWSM 2019, Munich, Germany.
Relia, Kunal ; Li, Zhengyi ; Cook, Stephanie ; Chunara, Rumi. / Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 U.S. cities. Paper presented at 13th International Conference on Web and Social Media, ICWSM 2019, Munich, Germany.11 p.
@conference{c926644e6a604fa6a34289a335eab3f7,
title = "Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 U.S. cities",
abstract = "We study malicious online content via a specific type of hate speech: race, ethnicity and national-origin based discrimination in social media, alongside hate crimes motivated by those characteristics, in 100 cities across the United States. We develop a spatially-diverse training dataset and classification pipeline to delineate targeted and self-narration of discrimination on social media, accounting for language across geographies. Controlling for census parameters, we find that the proportion of discrimination that is targeted is associated with the number of hate crimes. Finally, we explore the linguistic features of discrimination Tweets in relation to hate crimes by city, features used by users who Tweet different amounts of discrimination, and features of discrimination compared to non-discrimination Tweets. Findings from this spatial study can inform future studies of how discrimination in physical and virtual worlds vary by place, or how physical and virtual world discrimination may synergize.",
author = "Kunal Relia and Zhengyi Li and Stephanie Cook and Rumi Chunara",
year = "2019",
month = "1",
day = "1",
language = "English (US)",
pages = "417--427",
note = "13th International Conference on Web and Social Media, ICWSM 2019 ; Conference date: 11-06-2019 Through 14-06-2019",

}

TY - CONF

T1 - Race, ethnicity and national origin-based discrimination in social media and hate crimes across 100 U.S. cities

AU - Relia, Kunal

AU - Li, Zhengyi

AU - Cook, Stephanie

AU - Chunara, Rumi

PY - 2019/1/1

Y1 - 2019/1/1

N2 - We study malicious online content via a specific type of hate speech: race, ethnicity and national-origin based discrimination in social media, alongside hate crimes motivated by those characteristics, in 100 cities across the United States. We develop a spatially-diverse training dataset and classification pipeline to delineate targeted and self-narration of discrimination on social media, accounting for language across geographies. Controlling for census parameters, we find that the proportion of discrimination that is targeted is associated with the number of hate crimes. Finally, we explore the linguistic features of discrimination Tweets in relation to hate crimes by city, features used by users who Tweet different amounts of discrimination, and features of discrimination compared to non-discrimination Tweets. Findings from this spatial study can inform future studies of how discrimination in physical and virtual worlds vary by place, or how physical and virtual world discrimination may synergize.

AB - We study malicious online content via a specific type of hate speech: race, ethnicity and national-origin based discrimination in social media, alongside hate crimes motivated by those characteristics, in 100 cities across the United States. We develop a spatially-diverse training dataset and classification pipeline to delineate targeted and self-narration of discrimination on social media, accounting for language across geographies. Controlling for census parameters, we find that the proportion of discrimination that is targeted is associated with the number of hate crimes. Finally, we explore the linguistic features of discrimination Tweets in relation to hate crimes by city, features used by users who Tweet different amounts of discrimination, and features of discrimination compared to non-discrimination Tweets. Findings from this spatial study can inform future studies of how discrimination in physical and virtual worlds vary by place, or how physical and virtual world discrimination may synergize.

UR - http://www.scopus.com/inward/record.url?scp=85070382250&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85070382250&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85070382250

SP - 417

EP - 427

ER -