Authorship analysis of inspire magazine through stylometric and psychological features

Jennifer Sikos, Peter David, Nizar Habash, Reem Faraj

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When we read a piece of writing, the meaning we derive from that text often includes information about the authors themselves. Clues to their identity, worldview, and even psychological states are encoded in features such as word choice and sentence structure. This work describes how writing style features can be used to analyze the authorship of extreme jihadist writing. Inspire magazine is an online, English-language magazine published by Al-Qaeda in the Arabian Peninsula. Our work has revealed similarities and disparities in the writing styles of Inspire authors using features such as word choice and sentence structure, as well as semantic and psychological features. The Linguistic Inquiry and Word Count (LIWC) resource is a lexicon that identifies words and phrases associated with a set of cognitive processes and psychological states [1]. LIWC was originally developed to determine the psychological properties of English text but has since been expanded to other languages, including Arabic. Prior to this work, the Arabic-language version of LIWC was limited to a small category of function words and did not have the full analytical power of the English-language version. We show how a method of lexicon expansion, translation, and assessment by a native Arabic speaker was used to produce a more robust Arabic-language version of the resource and is applied to the psychological analysis of Inspire content in both English and Arabic.

Original languageEnglish (US)
Title of host publicationProceedings - 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages33-40
Number of pages8
ISBN (Electronic)9781479963645
DOIs
StatePublished - Jan 1 2014
Event2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014 - The Hague, Netherlands
Duration: Sep 24 2014Sep 26 2014

Other

Other2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014
CountryNetherlands
CityThe Hague
Period9/24/149/26/14

Fingerprint

Linguistics
Semantics

Keywords

  • Authorship Analysis
  • Linguistic Inquiry and Word Count
  • Natural Language Processing
  • Stylometry

ASJC Scopus subject areas

  • Artificial Intelligence
  • Information Systems
  • Software

Cite this

Sikos, J., David, P., Habash, N., & Faraj, R. (2014). Authorship analysis of inspire magazine through stylometric and psychological features. In Proceedings - 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014 (pp. 33-40). [6975551] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/JISIC.2014.15

Authorship analysis of inspire magazine through stylometric and psychological features. / Sikos, Jennifer; David, Peter; Habash, Nizar; Faraj, Reem.

Proceedings - 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014. Institute of Electrical and Electronics Engineers Inc., 2014. p. 33-40 6975551.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sikos, J, David, P, Habash, N & Faraj, R 2014, Authorship analysis of inspire magazine through stylometric and psychological features. in Proceedings - 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014., 6975551, Institute of Electrical and Electronics Engineers Inc., pp. 33-40, 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014, The Hague, Netherlands, 9/24/14. https://doi.org/10.1109/JISIC.2014.15
Sikos J, David P, Habash N, Faraj R. Authorship analysis of inspire magazine through stylometric and psychological features. In Proceedings - 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014. Institute of Electrical and Electronics Engineers Inc. 2014. p. 33-40. 6975551 https://doi.org/10.1109/JISIC.2014.15
Sikos, Jennifer ; David, Peter ; Habash, Nizar ; Faraj, Reem. / Authorship analysis of inspire magazine through stylometric and psychological features. Proceedings - 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014. Institute of Electrical and Electronics Engineers Inc., 2014. pp. 33-40
@inproceedings{2b27d911a33e449a91990a0174e3ef90,
title = "Authorship analysis of inspire magazine through stylometric and psychological features",
abstract = "When we read a piece of writing, the meaning we derive from that text often includes information about the authors themselves. Clues to their identity, worldview, and even psychological states are encoded in features such as word choice and sentence structure. This work describes how writing style features can be used to analyze the authorship of extreme jihadist writing. Inspire magazine is an online, English-language magazine published by Al-Qaeda in the Arabian Peninsula. Our work has revealed similarities and disparities in the writing styles of Inspire authors using features such as word choice and sentence structure, as well as semantic and psychological features. The Linguistic Inquiry and Word Count (LIWC) resource is a lexicon that identifies words and phrases associated with a set of cognitive processes and psychological states [1]. LIWC was originally developed to determine the psychological properties of English text but has since been expanded to other languages, including Arabic. Prior to this work, the Arabic-language version of LIWC was limited to a small category of function words and did not have the full analytical power of the English-language version. We show how a method of lexicon expansion, translation, and assessment by a native Arabic speaker was used to produce a more robust Arabic-language version of the resource and is applied to the psychological analysis of Inspire content in both English and Arabic.",
keywords = "Authorship Analysis, Linguistic Inquiry and Word Count, Natural Language Processing, Stylometry",
author = "Jennifer Sikos and Peter David and Nizar Habash and Reem Faraj",
year = "2014",
month = "1",
day = "1",
doi = "10.1109/JISIC.2014.15",
language = "English (US)",
pages = "33--40",
booktitle = "Proceedings - 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Authorship analysis of inspire magazine through stylometric and psychological features

AU - Sikos, Jennifer

AU - David, Peter

AU - Habash, Nizar

AU - Faraj, Reem

PY - 2014/1/1

Y1 - 2014/1/1

N2 - When we read a piece of writing, the meaning we derive from that text often includes information about the authors themselves. Clues to their identity, worldview, and even psychological states are encoded in features such as word choice and sentence structure. This work describes how writing style features can be used to analyze the authorship of extreme jihadist writing. Inspire magazine is an online, English-language magazine published by Al-Qaeda in the Arabian Peninsula. Our work has revealed similarities and disparities in the writing styles of Inspire authors using features such as word choice and sentence structure, as well as semantic and psychological features. The Linguistic Inquiry and Word Count (LIWC) resource is a lexicon that identifies words and phrases associated with a set of cognitive processes and psychological states [1]. LIWC was originally developed to determine the psychological properties of English text but has since been expanded to other languages, including Arabic. Prior to this work, the Arabic-language version of LIWC was limited to a small category of function words and did not have the full analytical power of the English-language version. We show how a method of lexicon expansion, translation, and assessment by a native Arabic speaker was used to produce a more robust Arabic-language version of the resource and is applied to the psychological analysis of Inspire content in both English and Arabic.

AB - When we read a piece of writing, the meaning we derive from that text often includes information about the authors themselves. Clues to their identity, worldview, and even psychological states are encoded in features such as word choice and sentence structure. This work describes how writing style features can be used to analyze the authorship of extreme jihadist writing. Inspire magazine is an online, English-language magazine published by Al-Qaeda in the Arabian Peninsula. Our work has revealed similarities and disparities in the writing styles of Inspire authors using features such as word choice and sentence structure, as well as semantic and psychological features. The Linguistic Inquiry and Word Count (LIWC) resource is a lexicon that identifies words and phrases associated with a set of cognitive processes and psychological states [1]. LIWC was originally developed to determine the psychological properties of English text but has since been expanded to other languages, including Arabic. Prior to this work, the Arabic-language version of LIWC was limited to a small category of function words and did not have the full analytical power of the English-language version. We show how a method of lexicon expansion, translation, and assessment by a native Arabic speaker was used to produce a more robust Arabic-language version of the resource and is applied to the psychological analysis of Inspire content in both English and Arabic.

KW - Authorship Analysis

KW - Linguistic Inquiry and Word Count

KW - Natural Language Processing

KW - Stylometry

UR - http://www.scopus.com/inward/record.url?scp=84920264316&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84920264316&partnerID=8YFLogxK

U2 - 10.1109/JISIC.2014.15

DO - 10.1109/JISIC.2014.15

M3 - Conference contribution

AN - SCOPUS:84920264316

SP - 33

EP - 40

BT - Proceedings - 2014 IEEE Joint Intelligence and Security Informatics Conference, JISIC 2014

PB - Institute of Electrical and Electronics Engineers Inc.

ER -