The design principles and algorithms of a weighted grammar library

Cyril Allauzen, Mehryar Mohri, Brian Roark

Research output: Contribution to journalArticle

Abstract

We present the software design principles, algorithms, and utilities of a general weighted grammar library, the GRM Library, that can be used in a variety of applications in text, speech, and biosequence processing. Several of the algorithms and utilities of this library are described, including in some cases their pseudocodes and pointers to their use in applications. The algorithms and the utilities were designed to support a wide variety of semirings and the representation and use of large grammars and automata of several hundred million rules or transitions.

Original languageEnglish (US)
Pages (from-to)403-421
Number of pages19
JournalInternational Journal of Foundations of Computer Science
Volume16
Issue number3
DOIs
StatePublished - Jun 2005

Fingerprint

Software design
Processing

ASJC Scopus subject areas

  • Computer Science (miscellaneous)

Cite this

The design principles and algorithms of a weighted grammar library. / Allauzen, Cyril; Mohri, Mehryar; Roark, Brian.

In: International Journal of Foundations of Computer Science, Vol. 16, No. 3, 06.2005, p. 403-421.

Research output: Contribution to journalArticle

@article{6dc2c449756541ab9a1c22f241e90a1b,
title = "The design principles and algorithms of a weighted grammar library",
abstract = "We present the software design principles, algorithms, and utilities of a general weighted grammar library, the GRM Library, that can be used in a variety of applications in text, speech, and biosequence processing. Several of the algorithms and utilities of this library are described, including in some cases their pseudocodes and pointers to their use in applications. The algorithms and the utilities were designed to support a wide variety of semirings and the representation and use of large grammars and automata of several hundred million rules or transitions.",
author = "Cyril Allauzen and Mehryar Mohri and Brian Roark",
year = "2005",
month = "6",
doi = "10.1142/S0129054105003066",
language = "English (US)",
volume = "16",
pages = "403--421",
journal = "International Journal of Foundations of Computer Science",
issn = "0129-0541",
publisher = "World Scientific Publishing Co. Pte Ltd",
number = "3",

}

TY - JOUR

T1 - The design principles and algorithms of a weighted grammar library

AU - Allauzen, Cyril

AU - Mohri, Mehryar

AU - Roark, Brian

PY - 2005/6

Y1 - 2005/6

N2 - We present the software design principles, algorithms, and utilities of a general weighted grammar library, the GRM Library, that can be used in a variety of applications in text, speech, and biosequence processing. Several of the algorithms and utilities of this library are described, including in some cases their pseudocodes and pointers to their use in applications. The algorithms and the utilities were designed to support a wide variety of semirings and the representation and use of large grammars and automata of several hundred million rules or transitions.

AB - We present the software design principles, algorithms, and utilities of a general weighted grammar library, the GRM Library, that can be used in a variety of applications in text, speech, and biosequence processing. Several of the algorithms and utilities of this library are described, including in some cases their pseudocodes and pointers to their use in applications. The algorithms and the utilities were designed to support a wide variety of semirings and the representation and use of large grammars and automata of several hundred million rules or transitions.

UR - http://www.scopus.com/inward/record.url?scp=33746228466&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33746228466&partnerID=8YFLogxK

U2 - 10.1142/S0129054105003066

DO - 10.1142/S0129054105003066

M3 - Article

VL - 16

SP - 403

EP - 421

JO - International Journal of Foundations of Computer Science

JF - International Journal of Foundations of Computer Science

SN - 0129-0541

IS - 3

ER -