Information Extraction

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Information extraction (IE) is the automatic identification of selected types of entities, relations, or events in free text. This article appraises two specific strands of IE - name identification and classification, and event extraction. Conventional treatment of languages pays little attention to proper names, addresses etc. Presentations of language analysis generally look up words in a dictionary and identify them as nouns etc. The incessant presence of names in a text, makes linguistic analysis of the same difficult, in the absence of the names being identified by their types and as linguistic units. Name tagging involves creating, several finite-state patterns, each corresponding to some noun subset. Elements of the patterns would match specific/classes of tokens with particular features. Event extraction typically works by creating a series of regular expressions, customized to capture the relevant events. Enhancement of each expression is corresponded by a relevant, suitable enhancement in the event patterns.

Original languageEnglish (US)
Title of host publicationThe Oxford Handbook of Computational Linguistics
PublisherOxford University Press
ISBN (Print)9780191743573, 9780199276349
DOIs
StatePublished - Sep 18 2012

Fingerprint

event
language analysis
linguistics
dictionary
Information Extraction
Names
language
Enhancement
Language
Nouns
Linguistic Analysis
Dictionary
Tagging
Regular
Proper Names
Conventional
Entity

Keywords

  • Automatic
  • Event
  • Linguistic analysis
  • Name
  • Patterns
  • Tagging

ASJC Scopus subject areas

  • Arts and Humanities(all)
  • Social Sciences(all)

Cite this

Grishman, R. (2012). Information Extraction. In The Oxford Handbook of Computational Linguistics Oxford University Press. https://doi.org/10.1093/oxfordhb/9780199276349.013.0030

Information Extraction. / Grishman, Ralph.

The Oxford Handbook of Computational Linguistics. Oxford University Press, 2012.

Research output: Chapter in Book/Report/Conference proceedingChapter

Grishman, R 2012, Information Extraction. in The Oxford Handbook of Computational Linguistics. Oxford University Press. https://doi.org/10.1093/oxfordhb/9780199276349.013.0030
Grishman R. Information Extraction. In The Oxford Handbook of Computational Linguistics. Oxford University Press. 2012 https://doi.org/10.1093/oxfordhb/9780199276349.013.0030
Grishman, Ralph. / Information Extraction. The Oxford Handbook of Computational Linguistics. Oxford University Press, 2012.
@inbook{063298231f89465cb5693c012ac2623a,
title = "Information Extraction",
abstract = "Information extraction (IE) is the automatic identification of selected types of entities, relations, or events in free text. This article appraises two specific strands of IE - name identification and classification, and event extraction. Conventional treatment of languages pays little attention to proper names, addresses etc. Presentations of language analysis generally look up words in a dictionary and identify them as nouns etc. The incessant presence of names in a text, makes linguistic analysis of the same difficult, in the absence of the names being identified by their types and as linguistic units. Name tagging involves creating, several finite-state patterns, each corresponding to some noun subset. Elements of the patterns would match specific/classes of tokens with particular features. Event extraction typically works by creating a series of regular expressions, customized to capture the relevant events. Enhancement of each expression is corresponded by a relevant, suitable enhancement in the event patterns.",
keywords = "Automatic, Event, Linguistic analysis, Name, Patterns, Tagging",
author = "Ralph Grishman",
year = "2012",
month = "9",
day = "18",
doi = "10.1093/oxfordhb/9780199276349.013.0030",
language = "English (US)",
isbn = "9780191743573",
booktitle = "The Oxford Handbook of Computational Linguistics",
publisher = "Oxford University Press",

}

TY - CHAP

T1 - Information Extraction

AU - Grishman, Ralph

PY - 2012/9/18

Y1 - 2012/9/18

N2 - Information extraction (IE) is the automatic identification of selected types of entities, relations, or events in free text. This article appraises two specific strands of IE - name identification and classification, and event extraction. Conventional treatment of languages pays little attention to proper names, addresses etc. Presentations of language analysis generally look up words in a dictionary and identify them as nouns etc. The incessant presence of names in a text, makes linguistic analysis of the same difficult, in the absence of the names being identified by their types and as linguistic units. Name tagging involves creating, several finite-state patterns, each corresponding to some noun subset. Elements of the patterns would match specific/classes of tokens with particular features. Event extraction typically works by creating a series of regular expressions, customized to capture the relevant events. Enhancement of each expression is corresponded by a relevant, suitable enhancement in the event patterns.

AB - Information extraction (IE) is the automatic identification of selected types of entities, relations, or events in free text. This article appraises two specific strands of IE - name identification and classification, and event extraction. Conventional treatment of languages pays little attention to proper names, addresses etc. Presentations of language analysis generally look up words in a dictionary and identify them as nouns etc. The incessant presence of names in a text, makes linguistic analysis of the same difficult, in the absence of the names being identified by their types and as linguistic units. Name tagging involves creating, several finite-state patterns, each corresponding to some noun subset. Elements of the patterns would match specific/classes of tokens with particular features. Event extraction typically works by creating a series of regular expressions, customized to capture the relevant events. Enhancement of each expression is corresponded by a relevant, suitable enhancement in the event patterns.

KW - Automatic

KW - Event

KW - Linguistic analysis

KW - Name

KW - Patterns

KW - Tagging

UR - http://www.scopus.com/inward/record.url?scp=84924081990&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84924081990&partnerID=8YFLogxK

U2 - 10.1093/oxfordhb/9780199276349.013.0030

DO - 10.1093/oxfordhb/9780199276349.013.0030

M3 - Chapter

SN - 9780191743573

SN - 9780199276349

BT - The Oxford Handbook of Computational Linguistics

PB - Oxford University Press

ER -