Incorporating New Words Detection with Chinese Word Segmentation – NLPIR自然语言处理与信息检索共享平台

自然语言处理与信息检索共享平台 自然语言处理与信息检索共享平台

Incorporating New Words Detection with Chinese Word Segmentation







Hua-Ping ZHANG,Jian GAO,Qian MO,He-Yan HUANG.Incorporating New Words Detection with Chinese Word Segmentation.In Proceedings of CIPS-SIGHAN Joint Conference on Chinese Language Processing (CLP 2010).Beijing, China.2010.8 .p249-251.


Abstract


With development in Chinese words segmentation, in-vocabulary word segmentation and named entity recognition achieves state-of-art performance. However, new words become bottleneck to Chinese word segmentation. This paper presents the result from Beijing Institute of Technology (BIT) in the Sixth International Chinese Word Segmentation Bakeoff in 2010. Firstly, the author reviewed the problem caused by the new words in Chinese texts, then introduced the algorithm of new words detection. The final section provided the official evaluation result in this bakeoff and gave conclusions.


WordSegmentation-BIT0723.pdf(80.5 KB)

You May Also Like

About the Author: nlpir

发表评论