CATiB: The Columbia Arabic Treebank

Nizar Habash, Ryan M. Roth

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The Columbia Arabic Treebank (CATiB) is a database of syntactic analyses of Arabic sentences. CATiB contrasts with previous approaches to Arabic treebanking in its emphasis on speed with some constraints on linguistic richness. Two basic ideas inspire the CATiB approach: no annotation of redundant information and using representations and terminology inspired by traditional Arabic syntax. We describe CATiB's representation and annotation procedure, and report on inter-annotator agreement and speed.

Original languageEnglish (US)
Title of host publicationACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.
Pages221-224
Number of pages4
StatePublished - Dec 1 2009
EventJoint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009 - Suntec, Singapore
Duration: Aug 2 2009Aug 7 2009

Other

OtherJoint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009
CountrySingapore
CitySuntec
Period8/2/098/7/09

Fingerprint

syntax
technical language
linguistics
Treebank
Syntax
Annotation
Data Base

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Habash, N., & Roth, R. M. (2009). CATiB: The Columbia Arabic Treebank. In ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf. (pp. 221-224)

CATiB : The Columbia Arabic Treebank. / Habash, Nizar; Roth, Ryan M.

ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.. 2009. p. 221-224.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Habash, N & Roth, RM 2009, CATiB: The Columbia Arabic Treebank. in ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.. pp. 221-224, Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009, Suntec, Singapore, 8/2/09.
Habash N, Roth RM. CATiB: The Columbia Arabic Treebank. In ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.. 2009. p. 221-224
Habash, Nizar ; Roth, Ryan M. / CATiB : The Columbia Arabic Treebank. ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.. 2009. pp. 221-224
@inproceedings{945f0b30619642b88094986a3c565add,
title = "CATiB: The Columbia Arabic Treebank",
abstract = "The Columbia Arabic Treebank (CATiB) is a database of syntactic analyses of Arabic sentences. CATiB contrasts with previous approaches to Arabic treebanking in its emphasis on speed with some constraints on linguistic richness. Two basic ideas inspire the CATiB approach: no annotation of redundant information and using representations and terminology inspired by traditional Arabic syntax. We describe CATiB's representation and annotation procedure, and report on inter-annotator agreement and speed.",
author = "Nizar Habash and Roth, {Ryan M.}",
year = "2009",
month = "12",
day = "1",
language = "English (US)",
isbn = "9781617382581",
pages = "221--224",
booktitle = "ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.",

}

TY - GEN

T1 - CATiB

T2 - The Columbia Arabic Treebank

AU - Habash, Nizar

AU - Roth, Ryan M.

PY - 2009/12/1

Y1 - 2009/12/1

N2 - The Columbia Arabic Treebank (CATiB) is a database of syntactic analyses of Arabic sentences. CATiB contrasts with previous approaches to Arabic treebanking in its emphasis on speed with some constraints on linguistic richness. Two basic ideas inspire the CATiB approach: no annotation of redundant information and using representations and terminology inspired by traditional Arabic syntax. We describe CATiB's representation and annotation procedure, and report on inter-annotator agreement and speed.

AB - The Columbia Arabic Treebank (CATiB) is a database of syntactic analyses of Arabic sentences. CATiB contrasts with previous approaches to Arabic treebanking in its emphasis on speed with some constraints on linguistic richness. Two basic ideas inspire the CATiB approach: no annotation of redundant information and using representations and terminology inspired by traditional Arabic syntax. We describe CATiB's representation and annotation procedure, and report on inter-annotator agreement and speed.

UR - http://www.scopus.com/inward/record.url?scp=79951865015&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79951865015&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:79951865015

SN - 9781617382581

SP - 221

EP - 224

BT - ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.

ER -