• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Lightweight Named Entity Extraction for Korean Short Message Service Text

Vol. 5, No. 3, March 30, 2011
10.3837/tiis.2011.03.006, Download Paper (Free):

Abstract

In this paper, we propose a hybrid method of Machine Learning (ML) algorithm and a rule-based algorithm to implement a lightweight Named Entity (NE) extraction system for Korean SMS text. NE extraction from Korean SMS text is a challenging theme due to the resource limitation on a mobile phone, corruptions in input text, need for extension to include personal information stored in a mobile phone, and sparsity of training data. The proposed hybrid method retaining the advantages of statistical ML and rule-based algorithms provides fully-automated procedures for the combination of ML approaches and their correction rules using a threshold-based soft decision function. The proposed method is applied to Korean SMS texts to extract person’s names as well as location names which are key information in personal appointment management system. Our proposed system achieved 80.53% in F-measure in this domain, superior to those of the conventional ML approaches.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
C. Seon, J. Yoo, H. Kim, J. Kim1, J. Seo, "Lightweight Named Entity Extraction for Korean Short Message Service Text," KSII Transactions on Internet and Information Systems, vol. 5, no. 3, pp. 560-574, 2011. DOI: 10.3837/tiis.2011.03.006.

[ACM Style]
Choong-Nyoung Seon, JinHwan Yoo, Harksoo Kim, Ji-Hwan Kim1, and Jungyun Seo. 2011. Lightweight Named Entity Extraction for Korean Short Message Service Text. KSII Transactions on Internet and Information Systems, 5, 3, (2011), 560-574. DOI: 10.3837/tiis.2011.03.006.

[BibTeX Style]
@article{tiis:19946, title="Lightweight Named Entity Extraction for Korean Short Message Service Text", author="Choong-Nyoung Seon and JinHwan Yoo and Harksoo Kim and Ji-Hwan Kim1 and Jungyun Seo and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2011.03.006}, volume={5}, number={3}, year="2011", month={March}, pages={560-574}}