Cross-document event extraction and tracking

Task, evaluation, techniques and challenges

Heng Ji, Ralph Grishman, Zheng Chen, Prashant Gupta

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper proposes a new task of cross-document event extraction and tracking and its evaluation metrics. We identify important person entities which are frequently involved in events as 'centroid entities'. Then we link the events involving the same centroid entity along a time line. We also present a system performing this task and our current approaches to address the main research challenges. We demonstrate that global inference from background knowledge and cross-document event aggregation are crucial to enhance the performance. This new task defines several extensions to the traditional single-document Information Extraction paradigm beyond 'slot filling'.

Original languageEnglish (US)
Title of host publicationInternational Conference Recent Advances in Natural Language Processing, RANLP
Pages166-172
Number of pages7
StatePublished - 2009
EventInternational Conference on Recent Advances in Natural Language Processing, RANLP-2009 - Borovets, Bulgaria
Duration: Sep 14 2009Sep 16 2009

Other

OtherInternational Conference on Recent Advances in Natural Language Processing, RANLP-2009
CountryBulgaria
CityBorovets
Period9/14/099/16/09

Fingerprint

Agglomeration

Keywords

  • Cross-document extraction
  • Event
  • Information extraction

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Software
  • Electrical and Electronic Engineering

Cite this

Ji, H., Grishman, R., Chen, Z., & Gupta, P. (2009). Cross-document event extraction and tracking: Task, evaluation, techniques and challenges. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 166-172)

Cross-document event extraction and tracking : Task, evaluation, techniques and challenges. / Ji, Heng; Grishman, Ralph; Chen, Zheng; Gupta, Prashant.

International Conference Recent Advances in Natural Language Processing, RANLP. 2009. p. 166-172.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ji, H, Grishman, R, Chen, Z & Gupta, P 2009, Cross-document event extraction and tracking: Task, evaluation, techniques and challenges. in International Conference Recent Advances in Natural Language Processing, RANLP. pp. 166-172, International Conference on Recent Advances in Natural Language Processing, RANLP-2009, Borovets, Bulgaria, 9/14/09.
Ji H, Grishman R, Chen Z, Gupta P. Cross-document event extraction and tracking: Task, evaluation, techniques and challenges. In International Conference Recent Advances in Natural Language Processing, RANLP. 2009. p. 166-172
Ji, Heng ; Grishman, Ralph ; Chen, Zheng ; Gupta, Prashant. / Cross-document event extraction and tracking : Task, evaluation, techniques and challenges. International Conference Recent Advances in Natural Language Processing, RANLP. 2009. pp. 166-172
@inproceedings{7f6cb9a04be642da8f854b19e257595c,
title = "Cross-document event extraction and tracking: Task, evaluation, techniques and challenges",
abstract = "This paper proposes a new task of cross-document event extraction and tracking and its evaluation metrics. We identify important person entities which are frequently involved in events as 'centroid entities'. Then we link the events involving the same centroid entity along a time line. We also present a system performing this task and our current approaches to address the main research challenges. We demonstrate that global inference from background knowledge and cross-document event aggregation are crucial to enhance the performance. This new task defines several extensions to the traditional single-document Information Extraction paradigm beyond 'slot filling'.",
keywords = "Cross-document extraction, Event, Information extraction",
author = "Heng Ji and Ralph Grishman and Zheng Chen and Prashant Gupta",
year = "2009",
language = "English (US)",
pages = "166--172",
booktitle = "International Conference Recent Advances in Natural Language Processing, RANLP",

}

TY - GEN

T1 - Cross-document event extraction and tracking

T2 - Task, evaluation, techniques and challenges

AU - Ji, Heng

AU - Grishman, Ralph

AU - Chen, Zheng

AU - Gupta, Prashant

PY - 2009

Y1 - 2009

N2 - This paper proposes a new task of cross-document event extraction and tracking and its evaluation metrics. We identify important person entities which are frequently involved in events as 'centroid entities'. Then we link the events involving the same centroid entity along a time line. We also present a system performing this task and our current approaches to address the main research challenges. We demonstrate that global inference from background knowledge and cross-document event aggregation are crucial to enhance the performance. This new task defines several extensions to the traditional single-document Information Extraction paradigm beyond 'slot filling'.

AB - This paper proposes a new task of cross-document event extraction and tracking and its evaluation metrics. We identify important person entities which are frequently involved in events as 'centroid entities'. Then we link the events involving the same centroid entity along a time line. We also present a system performing this task and our current approaches to address the main research challenges. We demonstrate that global inference from background knowledge and cross-document event aggregation are crucial to enhance the performance. This new task defines several extensions to the traditional single-document Information Extraction paradigm beyond 'slot filling'.

KW - Cross-document extraction

KW - Event

KW - Information extraction

UR - http://www.scopus.com/inward/record.url?scp=84866851168&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84866851168&partnerID=8YFLogxK

M3 - Conference contribution

SP - 166

EP - 172

BT - International Conference Recent Advances in Natural Language Processing, RANLP

ER -