Improving LSTM CRFs using character-based compositions for Korean named entity recognition

Scholarworks@UNIST

UNIST Library

There are no files associated with this item.

Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Related Researcher

나승훈

Read More

Cited time in webofscience

Cited time in scopus

Metadata Downloads

Improving LSTM CRFs using character-based compositions for Korean named entity recognition

Fulltext: https://www.sciencedirect.com/science/article/pii/S0885230817300852?pes=vor&utm_source=clarivate&getft_integrator=clarivate

Abstract: Standard approaches to named entity recognition (NER) are based on sequential labeling methods, such as conditional random fields (CRFs), which label each word in a sentence and extract entities from them that correspond to named entities. With the extensive deployment of deep learning methods for sequential labeling tasks, state-of-the-art NER performance has been achieved on long short-term memory (LSTM) architectures using only basic features. In this paper, we address Korean NER tasks and propose an extension of a bidirectional LSTM CRF by investigating character-based representation. Our extension involves deploying a hybrid representation using ConvNet and LSTM for the sequential modeling of characters, namely a character-based LSTM-ConvNet hybrid representation. Using morphemes as processing units for bidirectional LSTM, we apply a proposed hybrid representation composed of morpheme vectors. Experimental results showed that the proposed LSTM-ConvNet hybrid representation yielded improvements over each single representation on standard Korean NER tasks. © 2018 Elsevier Ltd

Keyword (Author): Character-based composition, Convolutional neural networks, Long short term memory, Named entity recognition

qrcode

Tel : 052-217-1403 / Email : scholarworks@unist.ac.kr

ScholarWorks@UNIST was established as an OAK Project for the National Library of Korea.