File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

나승훈

Na, Seung-Hoon
Natural Language Processing Lab
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Improving LSTM CRFs using character-based compositions for Korean named entity recognition

Author(s)
Na, Seung-HoonKim, H.Min, J.Kim, K.
Issued Date
2019-03
DOI
10.1016/j.csl.2018.09.005
URI
https://scholarworks.unist.ac.kr/handle/201301/86803
Citation
COMPUTER SPEECH AND LANGUAGE, v.54, pp.106 - 121
Abstract
Standard approaches to named entity recognition (NER) are based on sequential labeling methods, such as conditional random fields (CRFs), which label each word in a sentence and extract entities from them that correspond to named entities. With the extensive deployment of deep learning methods for sequential labeling tasks, state-of-the-art NER performance has been achieved on long short-term memory (LSTM) architectures using only basic features. In this paper, we address Korean NER tasks and propose an extension of a bidirectional LSTM CRF by investigating character-based representation. Our extension involves deploying a hybrid representation using ConvNet and LSTM for the sequential modeling of characters, namely a character-based LSTM-ConvNet hybrid representation. Using morphemes as processing units for bidirectional LSTM, we apply a proposed hybrid representation composed of morpheme vectors. Experimental results showed that the proposed LSTM-ConvNet hybrid representation yielded improvements over each single representation on standard Korean NER tasks. © 2018 Elsevier Ltd
Publisher
Academic Press
ISSN
0885-2308
Keyword (Author)
Character-based compositionConvolutional neural networksLong short term memoryNamed entity recognition

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.