ZHANG Su-zhi and LIU Jing-jiao. A short text KNN classification algorithm based on semantic[J]. Journal of Light Industry, 2012, 27(6): 1-4. doi: 10.3969/j.issn.2095-476X.2012.06.001
Citation:
ZHANG Su-zhi and LIU Jing-jiao. A short text KNN classification algorithm based on semantic[J]. Journal of Light Industry, 2012, 27(6): 1-4.
doi:
10.3969/j.issn.2095-476X.2012.06.001
A short text KNN classification algorithm based on semantic
College of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
Received Date:
2012-02-26 Available Online:
2012-09-16
Abstract
Aiming at the problems of key words sparse features,sample quantity of the short text classification and differcult dealing with,a method based on semantic KNN short text classification algorithm was presented.The algorithm extracts short text feature words based on the word segmentation strategy, combining CNKI to key for concept mapping to improve the short text semantic expression,KNN classification algorithm was improved according to the characteristics of short text through application of LSA dimensionality reduction. The experiment results showed that the algorithm can effectively improve the short text classification performance.
Yang Y M,Liu X.A re-examination of text categorization methods[C]//Proceedings 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR'99),Berkeley:ACM Press,1999:42-49.
Xue N W,Shen L B.Chinese word segmentation as LMR tagging[C]//Proceedings of the Second SIGHAN Workshop on Chinese Language Processing,Stroudsburg:ACL,2003:176-179.
ZHANG Su-zhi and LIU Jing-jiao. A short text KNN classification algorithm based on semantic[J]. Journal of Light Industry, 2012, 27(6): 1-4.
doi: 10.3969/j.issn.2095-476X.2012.06.001