Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop

Nizar Habash, Owen Rambow

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present an approach to using a morphological analyzer for tokenizing andmorphologically tagging (including partof- speech tagging) Arabic words in one process. We learn classifiers for individual morphological features, as well as ways of using these classifiers to choose among entries from the output of the analyzer. We obtain accuracy rates on all tasks in the high nineties.

Original languageEnglish (US)
Title of host publicationACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
Pages573-580
Number of pages8
StatePublished - Dec 1 2005
Event43rd Annual Meeting of the Association for Computational Linguistics, ACL-05 - Ann Arbor, MI, United States
Duration: Jun 25 2005Jun 30 2005

Other

Other43rd Annual Meeting of the Association for Computational Linguistics, ACL-05
CountryUnited States
CityAnn Arbor, MI
Period6/25/056/30/05

Fingerprint

Part-of-speech Tagging
Disambiguation
Classifier
Tagging

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Habash, N., & Rambow, O. (2005). Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop. In ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 573-580)

Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop. / Habash, Nizar; Rambow, Owen.

ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. 2005. p. 573-580.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Habash, N & Rambow, O 2005, Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop. in ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. pp. 573-580, 43rd Annual Meeting of the Association for Computational Linguistics, ACL-05, Ann Arbor, MI, United States, 6/25/05.
Habash N, Rambow O. Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop. In ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. 2005. p. 573-580
Habash, Nizar ; Rambow, Owen. / Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop. ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference. 2005. pp. 573-580
@inproceedings{bbe8c2e069494857bded1abf5c792c74,
title = "Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop",
abstract = "We present an approach to using a morphological analyzer for tokenizing andmorphologically tagging (including partof- speech tagging) Arabic words in one process. We learn classifiers for individual morphological features, as well as ways of using these classifiers to choose among entries from the output of the analyzer. We obtain accuracy rates on all tasks in the high nineties.",
author = "Nizar Habash and Owen Rambow",
year = "2005",
month = "12",
day = "1",
language = "English (US)",
isbn = "1932432515",
pages = "573--580",
booktitle = "ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference",

}

TY - GEN

T1 - Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop

AU - Habash, Nizar

AU - Rambow, Owen

PY - 2005/12/1

Y1 - 2005/12/1

N2 - We present an approach to using a morphological analyzer for tokenizing andmorphologically tagging (including partof- speech tagging) Arabic words in one process. We learn classifiers for individual morphological features, as well as ways of using these classifiers to choose among entries from the output of the analyzer. We obtain accuracy rates on all tasks in the high nineties.

AB - We present an approach to using a morphological analyzer for tokenizing andmorphologically tagging (including partof- speech tagging) Arabic words in one process. We learn classifiers for individual morphological features, as well as ways of using these classifiers to choose among entries from the output of the analyzer. We obtain accuracy rates on all tasks in the high nineties.

UR - http://www.scopus.com/inward/record.url?scp=84859910518&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84859910518&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84859910518

SN - 1932432515

SN - 9781932432510

SP - 573

EP - 580

BT - ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference

ER -