Active learning based named entity recognition and its application in natural language coverless information hiding

Huiyu Sun, Ralph Grishman, Yingchao Wang

Research output: Contribution to journalArticle

Abstract

Named entity recognition systems trained on one domain usually have a substantial drop in performance when applied to a different domain. In this paper, we apply active learning to domain adaptation for named entity recognition systems, propose various sampling optimizations, and show that the labeling effort can be reduced by over 92% while achieving the same performance as supervised method. Named entity recognition can be effectively applied to information extraction, machine translation, text classification and many other areas. We propose a new application area for named entity recognition, namely in natural language information hiding: A novel coverless information hiding method based on text big data is proposed, utilizing named entities to mark the locations of the hidden information. Coverless information hiding is a brand new area of information hiding that achieves the transmission of hidden information without any modification in the carrier text. Furthermore, active learning allows our information hiding method to be applied to text from new domains without substantial labeling effort.

Original languageEnglish (US)
Pages (from-to)443-451
Number of pages9
JournalJournal of Internet Technology
Volume18
Issue number2
StatePublished - 2017

    Fingerprint

Keywords

  • Active learning
  • Coverless information hiding
  • Named entity recognition
  • Natural language information hiding

ASJC Scopus subject areas

  • Software
  • Computer Networks and Communications

Cite this