Creating resources for dialectal Arabic from a single annotation

A case study on egyptian and levantine

Ramy Eskander, Nizar Habash, Owen Rambow, Arfath Pasha

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.

    Original languageEnglish (US)
    Title of host publicationCOLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016
    Subtitle of host publicationTechnical Papers
    PublisherAssociation for Computational Linguistics, ACL Anthology
    Pages3455-3465
    Number of pages11
    ISBN (Print)9784879747020
    StatePublished - Jan 1 2016
    Event26th International Conference on Computational Linguistics, COLING 2016 - Osaka, Japan
    Duration: Dec 11 2016Dec 16 2016

    Other

    Other26th International Conference on Computational Linguistics, COLING 2016
    CountryJapan
    CityOsaka
    Period12/11/1612/16/16

    Fingerprint

    dialect
    Processing
    resources
    orthography
    language
    social media
    methodology
    Egyptians
    Annotation
    Resources
    Arabic Dialects
    Tag
    Natural Language Processing

    ASJC Scopus subject areas

    • Computational Theory and Mathematics
    • Language and Linguistics
    • Linguistics and Language

    Cite this

    Eskander, R., Habash, N., Rambow, O., & Pasha, A. (2016). Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine. In COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers (pp. 3455-3465). Association for Computational Linguistics, ACL Anthology.

    Creating resources for dialectal Arabic from a single annotation : A case study on egyptian and levantine. / Eskander, Ramy; Habash, Nizar; Rambow, Owen; Pasha, Arfath.

    COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers. Association for Computational Linguistics, ACL Anthology, 2016. p. 3455-3465.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Eskander, R, Habash, N, Rambow, O & Pasha, A 2016, Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine. in COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers. Association for Computational Linguistics, ACL Anthology, pp. 3455-3465, 26th International Conference on Computational Linguistics, COLING 2016, Osaka, Japan, 12/11/16.
    Eskander R, Habash N, Rambow O, Pasha A. Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine. In COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers. Association for Computational Linguistics, ACL Anthology. 2016. p. 3455-3465
    Eskander, Ramy ; Habash, Nizar ; Rambow, Owen ; Pasha, Arfath. / Creating resources for dialectal Arabic from a single annotation : A case study on egyptian and levantine. COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers. Association for Computational Linguistics, ACL Anthology, 2016. pp. 3455-3465
    @inproceedings{7e724e6ca1c549b2935aadab9aa7af59,
    title = "Creating resources for dialectal Arabic from a single annotation: A case study on egyptian and levantine",
    abstract = "Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.",
    author = "Ramy Eskander and Nizar Habash and Owen Rambow and Arfath Pasha",
    year = "2016",
    month = "1",
    day = "1",
    language = "English (US)",
    isbn = "9784879747020",
    pages = "3455--3465",
    booktitle = "COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016",
    publisher = "Association for Computational Linguistics, ACL Anthology",

    }

    TY - GEN

    T1 - Creating resources for dialectal Arabic from a single annotation

    T2 - A case study on egyptian and levantine

    AU - Eskander, Ramy

    AU - Habash, Nizar

    AU - Rambow, Owen

    AU - Pasha, Arfath

    PY - 2016/1/1

    Y1 - 2016/1/1

    N2 - Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.

    AB - Arabic dialects present a special problem for natural language processing because there are few Arabic dialect resources, they have no standard orthography, and they have not been studied much. However, as more and more written dialectal Arabic is found on social media, natural language processing for Arabic dialects has become an important goal. We present a methodology for creating a morphological analyzer and a morphological tagger for dialectal Arabic, and we illustrate it on Egyptian and Levantine Arabic. To our knowledge, these are the first analyzer and tagger for Levantine.

    UR - http://www.scopus.com/inward/record.url?scp=85055005100&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=85055005100&partnerID=8YFLogxK

    M3 - Conference contribution

    SN - 9784879747020

    SP - 3455

    EP - 3465

    BT - COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016

    PB - Association for Computational Linguistics, ACL Anthology

    ER -