Category Multi-Representation: A Unified Solution for Named Entity Recognition in Clinical Texts

Page view(s)

Checked on Aug 04, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/14002

Title:

Category Multi-Representation: A Unified Solution for Named Entity Recognition in Clinical Texts

Journal Title:

The Pacific-Asia Conference on Knowledge Discovery and Data Mining 2018

DOI:

Publication URL:

Authors:

Lei Hou, Xiao-Li Li, Jiangtao Zhang, Juanzi Li, Shuai Wang, Yan Zhang, Yixin Cao

Keywords:

Publication Date:

03 June 2018

Citation:

Abstract:

Clinical Named Entity Recognition (CNER), the task of identifying the entity boundaries in clinical texts, is essential for many applications. Previous methods usually follow the traditional NER methods that heavily rely on language specific features (i.e. linguistics and lexicons) and high quality annotated data. However, due to the problem of Limited Availability of Annotated Data and Informal Clinical Texts, CNER becomes more challenging. In this paper, we propose a novel method that learn multiple representations for each category, namely category-multi-representation (CMR) that captures the semantic relatedness between words and clinical categories from different perspectives. CMR is learned based on a large scale unannotated corpus and a small set of annotated data, which greatly alleviates the burden of human effort. Instead of the language specific features, our proposed method uses more evidential features without any additional NLP tools, and enjoys a lightweight adaption among languages. We conduct a series of experiments to verify our new CMR features can further improve the performance of NER significantly without leveraging any external lexicons.

License type:

PublisherCopyrights

Funding Info:

Description:

URI:

https://oar.a-star.edu.sg/communities-collections/articles/14002

ISBN:

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
78.pdf	427.32 KB	PDF	Open