File Download

There are no files associated with this item.

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

나승훈

Na, Seung-Hoon
Natural Language Processing Lab
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Conditional Random Fields for Korean Morpheme Segmentation and POS Tagging

Author(s)
Na, Seung-Hoon
Issued Date
2015-06
DOI
10.1145/2700051
URI
https://scholarworks.unist.ac.kr/handle/201301/86824
Citation
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, v.14, no.3, pp.10
Abstract
There has been recent interest in statistical approaches to Korean morphological analysis. However, previous studies have been based mostly on generative models, including a hidden Markov model (HMM), without utilizing discriminative models such as a conditional random field (CRF). We present a two-stage discriminative approach based on CRFs for Korean morphological analysis. Similar to methods used for Chinese, we perform two disambiguation procedures based on CRFs: (1) morpheme segmentation and (2) POS tagging. In morpheme segmentation, an input sentence is segmented into sequences of morphemes, where a morpheme unit is either atomic or compound. In the POS tagging procedure, each morpheme (atomic or compound) is assigned a POS tag. Once POS tagging is complete, we carry out a post-processing of the compound morphemes, where each compound morpheme is further decomposed into atomic morphemes, which is based on pre-analyzed patterns and generalized HMMs obtained from the given tagged corpus. Experimental results show the promise of our proposed method.
Publisher
ASSOC COMPUTING MACHINERY
ISSN
2375-4699
Keyword (Author)
POS taggingKorean morphological analysisAlgorithmsExperimentationConditional random fieldsmorpheme segmentation

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.