Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine

Ramy Eskander, Nizar Habash, Owen Rambow, Arfath Pasha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.

Original languageEnglish (US)
Title of host publicationCOLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016
Subtitle of host publicationTechnical Papers
PublisherAssociation for Computational Linguistics, ACL Anthology
Pages3455-3465
Number of pages11
ISBN (Print)9784879747020
StatePublished - Jan 1 2016
Event26th International Conference on Computational Linguistics, COLING 2016 - Osaka, Japan
Duration: Dec 11 2016Dec 16 2016

Other

Other26th International Conference on Computational Linguistics, COLING 2016
CountryJapan
CityOsaka
Period12/11/1612/16/16

Fingerprint

dialect
Processing
resources
orthography
language
social media
methodology
Annotation
Egyptians
Arabic Dialects
Resources
Tag
Natural Language Processing

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Language and Linguistics
  • Linguistics and Language

Cite this

Eskander, R., Habash, N., Rambow, O., & Pasha, A. (2016). Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine. In COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers (pp. 3455-3465). Association for Computational Linguistics, ACL Anthology.

Creating resources for dialectal Arabic from a single annotation : A case study on egyptian and levantine. / Eskander, Ramy; Habash, Nizar; Rambow, Owen; Pasha, Arfath.

COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers. Association for Computational Linguistics, ACL Anthology, 2016. p. 3455-3465.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Eskander, R, Habash, N, Rambow, O & Pasha, A 2016, Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine. in COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers. Association for Computational Linguistics, ACL Anthology, pp. 3455-3465, 26th International Conference on Computational Linguistics, COLING 2016, Osaka, Japan, 12/11/16.
Eskander R, Habash N, Rambow O, Pasha A. Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine. In COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers. Association for Computational Linguistics, ACL Anthology. 2016. p. 3455-3465
Eskander, Ramy ; Habash, Nizar ; Rambow, Owen ; Pasha, Arfath. / Creating resources for dialectal Arabic from a single annotation : A case study on egyptian and levantine. COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers. Association for Computational Linguistics, ACL Anthology, 2016. pp. 3455-3465
@inproceedings{c5dd7d4fb2644135834a3ecf6f65738b,
title = "Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine",
abstract = "Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.",
author = "Ramy Eskander and Nizar Habash and Owen Rambow and Arfath Pasha",
year = "2016",
month = "1",
day = "1",
language = "English (US)",
isbn = "9784879747020",
pages = "3455--3465",
booktitle = "COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016",
publisher = "Association for Computational Linguistics, ACL Anthology",

}

TY - GEN

T1 - Creating resources for dialectal Arabic from a single annotation

T2 - A case study on egyptian and levantine

AU - Eskander, Ramy

AU - Habash, Nizar

AU - Rambow, Owen

AU - Pasha, Arfath

PY - 2016/1/1

Y1 - 2016/1/1

N2 - Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.

AB - Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.

UR - http://www.scopus.com/inward/record.url?scp=85055005100&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85055005100&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9784879747020

SP - 3455

EP - 3465

BT - COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016

PB - Association for Computational Linguistics, ACL Anthology

ER -