Interlingual annotation for MT development

Florence Reeder, Bonnie Dorr, David Farwell, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Teruko Mitamura, Keith Miller, Owen Rambow, Advaith Siddharthan

    Research output: Contribution to journalArticle

    Abstract

    MT systems that use only superficial representations, including the current generation of statistical MT systems, have been successful and useful. However, they will experience a plateau in quality, much like other "silver bullet" approaches to MT. We pursue work on the development of interlingual representations for use in symbolic or hybrid MT systems. In this paper, we describe the creation of an interlingua and the development of a corpus of semantically annotated text, to be validated in six languages and evaluated in several ways. We have established a distributed, well-functioning research methodology, designed a preliminary interlingua notation, created annotation manuals and tools, developed a test collection in six languages with associated English translations, annotated some 150 translations, and designed and applied various annotation metrics. We describe the data sets being annotated and the interlingual (IL) representation language which uses two ontologies and a systematic theta-role list. We present the annotation tools built and outline the annotation process. Following this, we describe our evaluation methodology and conclude with a summary of issues that have arisen.

    Original languageEnglish (US)
    Pages (from-to)236-245
    Number of pages10
    JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume3265
    StatePublished - Dec 1 2004

    Fingerprint

    Annotation
    Hybrid systems
    Ontology
    Methodology
    Silver
    Hybrid Systems
    Notation
    Metric
    Evaluation
    Language

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • Computer Science(all)

    Cite this

    Reeder, F., Dorr, B., Farwell, D., Habash, N., Helmreich, S., Hovy, E., ... Siddharthan, A. (2004). Interlingual annotation for MT development. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3265, 236-245.

    Interlingual annotation for MT development. / Reeder, Florence; Dorr, Bonnie; Farwell, David; Habash, Nizar; Helmreich, Stephen; Hovy, Eduard; Levin, Lori; Mitamura, Teruko; Miller, Keith; Rambow, Owen; Siddharthan, Advaith.

    In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 3265, 01.12.2004, p. 236-245.

    Research output: Contribution to journalArticle

    Reeder, F, Dorr, B, Farwell, D, Habash, N, Helmreich, S, Hovy, E, Levin, L, Mitamura, T, Miller, K, Rambow, O & Siddharthan, A 2004, 'Interlingual annotation for MT development', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 3265, pp. 236-245.
    Reeder, Florence ; Dorr, Bonnie ; Farwell, David ; Habash, Nizar ; Helmreich, Stephen ; Hovy, Eduard ; Levin, Lori ; Mitamura, Teruko ; Miller, Keith ; Rambow, Owen ; Siddharthan, Advaith. / Interlingual annotation for MT development. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2004 ; Vol. 3265. pp. 236-245.
    @article{565c9846673e4a7e967c55e5a86d65d3,
    title = "Interlingual annotation for MT development",
    abstract = "MT systems that use only superficial representations, including the current generation of statistical MT systems, have been successful and useful. However, they will experience a plateau in quality, much like other {"}silver bullet{"} approaches to MT. We pursue work on the development of interlingual representations for use in symbolic or hybrid MT systems. In this paper, we describe the creation of an interlingua and the development of a corpus of semantically annotated text, to be validated in six languages and evaluated in several ways. We have established a distributed, well-functioning research methodology, designed a preliminary interlingua notation, created annotation manuals and tools, developed a test collection in six languages with associated English translations, annotated some 150 translations, and designed and applied various annotation metrics. We describe the data sets being annotated and the interlingual (IL) representation language which uses two ontologies and a systematic theta-role list. We present the annotation tools built and outline the annotation process. Following this, we describe our evaluation methodology and conclude with a summary of issues that have arisen.",
    author = "Florence Reeder and Bonnie Dorr and David Farwell and Nizar Habash and Stephen Helmreich and Eduard Hovy and Lori Levin and Teruko Mitamura and Keith Miller and Owen Rambow and Advaith Siddharthan",
    year = "2004",
    month = "12",
    day = "1",
    language = "English (US)",
    volume = "3265",
    pages = "236--245",
    journal = "Lecture Notes in Computer Science",
    issn = "0302-9743",
    publisher = "Springer Verlag",

    }

    TY - JOUR

    T1 - Interlingual annotation for MT development

    AU - Reeder, Florence

    AU - Dorr, Bonnie

    AU - Farwell, David

    AU - Habash, Nizar

    AU - Helmreich, Stephen

    AU - Hovy, Eduard

    AU - Levin, Lori

    AU - Mitamura, Teruko

    AU - Miller, Keith

    AU - Rambow, Owen

    AU - Siddharthan, Advaith

    PY - 2004/12/1

    Y1 - 2004/12/1

    N2 - MT systems that use only superficial representations, including the current generation of statistical MT systems, have been successful and useful. However, they will experience a plateau in quality, much like other "silver bullet" approaches to MT. We pursue work on the development of interlingual representations for use in symbolic or hybrid MT systems. In this paper, we describe the creation of an interlingua and the development of a corpus of semantically annotated text, to be validated in six languages and evaluated in several ways. We have established a distributed, well-functioning research methodology, designed a preliminary interlingua notation, created annotation manuals and tools, developed a test collection in six languages with associated English translations, annotated some 150 translations, and designed and applied various annotation metrics. We describe the data sets being annotated and the interlingual (IL) representation language which uses two ontologies and a systematic theta-role list. We present the annotation tools built and outline the annotation process. Following this, we describe our evaluation methodology and conclude with a summary of issues that have arisen.

    AB - MT systems that use only superficial representations, including the current generation of statistical MT systems, have been successful and useful. However, they will experience a plateau in quality, much like other "silver bullet" approaches to MT. We pursue work on the development of interlingual representations for use in symbolic or hybrid MT systems. In this paper, we describe the creation of an interlingua and the development of a corpus of semantically annotated text, to be validated in six languages and evaluated in several ways. We have established a distributed, well-functioning research methodology, designed a preliminary interlingua notation, created annotation manuals and tools, developed a test collection in six languages with associated English translations, annotated some 150 translations, and designed and applied various annotation metrics. We describe the data sets being annotated and the interlingual (IL) representation language which uses two ontologies and a systematic theta-role list. We present the annotation tools built and outline the annotation process. Following this, we describe our evaluation methodology and conclude with a summary of issues that have arisen.

    UR - http://www.scopus.com/inward/record.url?scp=35048825346&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=35048825346&partnerID=8YFLogxK

    M3 - Article

    VL - 3265

    SP - 236

    EP - 245

    JO - Lecture Notes in Computer Science

    JF - Lecture Notes in Computer Science

    SN - 0302-9743

    ER -