From text to structured information-automatic processing of medical reports

Lynette Hirsckman, Ralph Grishman, Naomi Sager

Research output: Contribution to conferencePaper

Abstract

This paper describes the analysis and processing programs for a set of natural language texts in a medical area (x-ray reports on patients with breast cancer). The programs convert the information in the text into a tabular form suitable for further automatic information processing (e.g., editing of records, question answering on the data collected, or statistical summaries of the data). To set up a tabular form appropriate for the data, we first perform a manual linguistic analysis on a sample of the texts. From this we obtain the word classes and the form of the table (called an information format) for this type of material. We then apply the series of processing programs to the sentences of the texts. Each sentence is parsed with the Linguistic String Parser English grammar in order to obtain its grammatical structure; certain standard English transformations are then applied to regularize the grammatical form of the sentence; and finally a set of "formatting transformations" map the words of the sentence into the slots of the format or table, in such a way that the sentence is reconstructible (up to paraphrase) from its representation in the table. The results of applying these programs to a corpus are described. This procedure enables us to convert a natural language corpus into a structured data base.

Original languageEnglish (US)
Pages267-275
Number of pages9
DOIs
StatePublished - Jun 7 1976
Event1976 American Federation of Information Processing Societies National Computer Conference, AFIPS 1976 - New York City, United States
Duration: Jun 7 1976Jun 10 1976

Other

Other1976 American Federation of Information Processing Societies National Computer Conference, AFIPS 1976
CountryUnited States
CityNew York City
Period6/7/766/10/76

Fingerprint

information processing
Linguistics
Processing
formatting
linguistics
X rays
language
grammar
cancer

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Software
  • Education

Cite this

Hirsckman, L., Grishman, R., & Sager, N. (1976). From text to structured information-automatic processing of medical reports. 267-275. Paper presented at 1976 American Federation of Information Processing Societies National Computer Conference, AFIPS 1976, New York City, United States. https://doi.org/10.1145/1499799.1499842

From text to structured information-automatic processing of medical reports. / Hirsckman, Lynette; Grishman, Ralph; Sager, Naomi.

1976. 267-275 Paper presented at 1976 American Federation of Information Processing Societies National Computer Conference, AFIPS 1976, New York City, United States.

Research output: Contribution to conferencePaper

Hirsckman, L, Grishman, R & Sager, N 1976, 'From text to structured information-automatic processing of medical reports' Paper presented at 1976 American Federation of Information Processing Societies National Computer Conference, AFIPS 1976, New York City, United States, 6/7/76 - 6/10/76, pp. 267-275. https://doi.org/10.1145/1499799.1499842
Hirsckman L, Grishman R, Sager N. From text to structured information-automatic processing of medical reports. 1976. Paper presented at 1976 American Federation of Information Processing Societies National Computer Conference, AFIPS 1976, New York City, United States. https://doi.org/10.1145/1499799.1499842
Hirsckman, Lynette ; Grishman, Ralph ; Sager, Naomi. / From text to structured information-automatic processing of medical reports. Paper presented at 1976 American Federation of Information Processing Societies National Computer Conference, AFIPS 1976, New York City, United States.9 p.
@conference{f59ab68c86ff48c59fe365c501faa323,
title = "From text to structured information-automatic processing of medical reports",
abstract = "This paper describes the analysis and processing programs for a set of natural language texts in a medical area (x-ray reports on patients with breast cancer). The programs convert the information in the text into a tabular form suitable for further automatic information processing (e.g., editing of records, question answering on the data collected, or statistical summaries of the data). To set up a tabular form appropriate for the data, we first perform a manual linguistic analysis on a sample of the texts. From this we obtain the word classes and the form of the table (called an information format) for this type of material. We then apply the series of processing programs to the sentences of the texts. Each sentence is parsed with the Linguistic String Parser English grammar in order to obtain its grammatical structure; certain standard English transformations are then applied to regularize the grammatical form of the sentence; and finally a set of {"}formatting transformations{"} map the words of the sentence into the slots of the format or table, in such a way that the sentence is reconstructible (up to paraphrase) from its representation in the table. The results of applying these programs to a corpus are described. This procedure enables us to convert a natural language corpus into a structured data base.",
author = "Lynette Hirsckman and Ralph Grishman and Naomi Sager",
year = "1976",
month = "6",
day = "7",
doi = "10.1145/1499799.1499842",
language = "English (US)",
pages = "267--275",
note = "1976 American Federation of Information Processing Societies National Computer Conference, AFIPS 1976 ; Conference date: 07-06-1976 Through 10-06-1976",

}

TY - CONF

T1 - From text to structured information-automatic processing of medical reports

AU - Hirsckman, Lynette

AU - Grishman, Ralph

AU - Sager, Naomi

PY - 1976/6/7

Y1 - 1976/6/7

N2 - This paper describes the analysis and processing programs for a set of natural language texts in a medical area (x-ray reports on patients with breast cancer). The programs convert the information in the text into a tabular form suitable for further automatic information processing (e.g., editing of records, question answering on the data collected, or statistical summaries of the data). To set up a tabular form appropriate for the data, we first perform a manual linguistic analysis on a sample of the texts. From this we obtain the word classes and the form of the table (called an information format) for this type of material. We then apply the series of processing programs to the sentences of the texts. Each sentence is parsed with the Linguistic String Parser English grammar in order to obtain its grammatical structure; certain standard English transformations are then applied to regularize the grammatical form of the sentence; and finally a set of "formatting transformations" map the words of the sentence into the slots of the format or table, in such a way that the sentence is reconstructible (up to paraphrase) from its representation in the table. The results of applying these programs to a corpus are described. This procedure enables us to convert a natural language corpus into a structured data base.

AB - This paper describes the analysis and processing programs for a set of natural language texts in a medical area (x-ray reports on patients with breast cancer). The programs convert the information in the text into a tabular form suitable for further automatic information processing (e.g., editing of records, question answering on the data collected, or statistical summaries of the data). To set up a tabular form appropriate for the data, we first perform a manual linguistic analysis on a sample of the texts. From this we obtain the word classes and the form of the table (called an information format) for this type of material. We then apply the series of processing programs to the sentences of the texts. Each sentence is parsed with the Linguistic String Parser English grammar in order to obtain its grammatical structure; certain standard English transformations are then applied to regularize the grammatical form of the sentence; and finally a set of "formatting transformations" map the words of the sentence into the slots of the format or table, in such a way that the sentence is reconstructible (up to paraphrase) from its representation in the table. The results of applying these programs to a corpus are described. This procedure enables us to convert a natural language corpus into a structured data base.

UR - http://www.scopus.com/inward/record.url?scp=85026967968&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85026967968&partnerID=8YFLogxK

U2 - 10.1145/1499799.1499842

DO - 10.1145/1499799.1499842

M3 - Paper

SP - 267

EP - 275

ER -