Conventional orthography for dialectal Arabic

Nizar Habash, Mona Diab, Owen Rambow

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Dialectal Arabic (DA) refers to the day-to-day vernaculars spoken in the Arab world. DA lives side-by-side with the official language, Modern Standard Arabic (MSA). DA differs from MSA on all levels of linguistic representation, from phonology and morphology to lexicon and syntax. Unlike MSA, DA has no standard orthography since there are no Arabic dialect academies, nor is there a large edited body of dialectal literature that follows the same spelling standard. In this paper, we present CODA, a conventional orthography for dialectal Arabic; it is designed primarily for the purpose of developing computational models of Arabic dialects. We explain the design principles of CODA and provide a detailed description of its guidelines as applied to Egyptian Arabic.

    Original languageEnglish (US)
    Title of host publicationProceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012
    PublisherEuropean Language Resources Association (ELRA)
    Pages711-718
    Number of pages8
    ISBN (Electronic)9782951740877
    StatePublished - Jan 1 2012
    Event8th International Conference on Language Resources and Evaluation, LREC 2012 - Istanbul, Turkey
    Duration: May 21 2012May 27 2012

    Other

    Other8th International Conference on Language Resources and Evaluation, LREC 2012
    CountryTurkey
    CityIstanbul
    Period5/21/125/27/12

    Fingerprint

    orthography
    dialect
    official language
    phonology
    syntax
    academy
    Arab
    Orthography
    Conventional
    linguistics

    Keywords

    • Arabic
    • Dialects
    • Orthography

    ASJC Scopus subject areas

    • Linguistics and Language
    • Language and Linguistics
    • Education
    • Library and Information Sciences

    Cite this

    Habash, N., Diab, M., & Rambow, O. (2012). Conventional orthography for dialectal Arabic. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012 (pp. 711-718). European Language Resources Association (ELRA).

    Conventional orthography for dialectal Arabic. / Habash, Nizar; Diab, Mona; Rambow, Owen.

    Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012. European Language Resources Association (ELRA), 2012. p. 711-718.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Habash, N, Diab, M & Rambow, O 2012, Conventional orthography for dialectal Arabic. in Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012. European Language Resources Association (ELRA), pp. 711-718, 8th International Conference on Language Resources and Evaluation, LREC 2012, Istanbul, Turkey, 5/21/12.
    Habash N, Diab M, Rambow O. Conventional orthography for dialectal Arabic. In Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012. European Language Resources Association (ELRA). 2012. p. 711-718
    Habash, Nizar ; Diab, Mona ; Rambow, Owen. / Conventional orthography for dialectal Arabic. Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012. European Language Resources Association (ELRA), 2012. pp. 711-718
    @inproceedings{ae64487961a94e09ab0e45f81d3333e9,
    title = "Conventional orthography for dialectal Arabic",
    abstract = "Dialectal Arabic (DA) refers to the day-to-day vernaculars spoken in the Arab world. DA lives side-by-side with the official language, Modern Standard Arabic (MSA). DA differs from MSA on all levels of linguistic representation, from phonology and morphology to lexicon and syntax. Unlike MSA, DA has no standard orthography since there are no Arabic dialect academies, nor is there a large edited body of dialectal literature that follows the same spelling standard. In this paper, we present CODA, a conventional orthography for dialectal Arabic; it is designed primarily for the purpose of developing computational models of Arabic dialects. We explain the design principles of CODA and provide a detailed description of its guidelines as applied to Egyptian Arabic.",
    keywords = "Arabic, Dialects, Orthography",
    author = "Nizar Habash and Mona Diab and Owen Rambow",
    year = "2012",
    month = "1",
    day = "1",
    language = "English (US)",
    pages = "711--718",
    booktitle = "Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012",
    publisher = "European Language Resources Association (ELRA)",

    }

    TY - GEN

    T1 - Conventional orthography for dialectal Arabic

    AU - Habash, Nizar

    AU - Diab, Mona

    AU - Rambow, Owen

    PY - 2012/1/1

    Y1 - 2012/1/1

    N2 - Dialectal Arabic (DA) refers to the day-to-day vernaculars spoken in the Arab world. DA lives side-by-side with the official language, Modern Standard Arabic (MSA). DA differs from MSA on all levels of linguistic representation, from phonology and morphology to lexicon and syntax. Unlike MSA, DA has no standard orthography since there are no Arabic dialect academies, nor is there a large edited body of dialectal literature that follows the same spelling standard. In this paper, we present CODA, a conventional orthography for dialectal Arabic; it is designed primarily for the purpose of developing computational models of Arabic dialects. We explain the design principles of CODA and provide a detailed description of its guidelines as applied to Egyptian Arabic.

    AB - Dialectal Arabic (DA) refers to the day-to-day vernaculars spoken in the Arab world. DA lives side-by-side with the official language, Modern Standard Arabic (MSA). DA differs from MSA on all levels of linguistic representation, from phonology and morphology to lexicon and syntax. Unlike MSA, DA has no standard orthography since there are no Arabic dialect academies, nor is there a large edited body of dialectal literature that follows the same spelling standard. In this paper, we present CODA, a conventional orthography for dialectal Arabic; it is designed primarily for the purpose of developing computational models of Arabic dialects. We explain the design principles of CODA and provide a detailed description of its guidelines as applied to Egyptian Arabic.

    KW - Arabic

    KW - Dialects

    KW - Orthography

    UR - http://www.scopus.com/inward/record.url?scp=84964544190&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=84964544190&partnerID=8YFLogxK

    M3 - Conference contribution

    SP - 711

    EP - 718

    BT - Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012

    PB - European Language Resources Association (ELRA)

    ER -