﻿{"id":61,"date":"2017-07-21T00:00:00","date_gmt":"2018-05-23T09:55:10","guid":{"rendered":""},"modified":"2018-12-14T11:01:40","modified_gmt":"2018-12-14T03:01:40","slug":"hhmm-based-chinese-lexical-analyzer-ictclas","status":"publish","type":"post","link":"http:\/\/www.nlpir.org\/wordpress\/2017\/07\/21\/hhmm-based-chinese-lexical-analyzer-ictclas\/","title":{"rendered":"HHMM-based Chinese Lexical Analyzer ICTCLAS"},"content":{"rendered":"<p><P><FONT face=\"Times New Roman\">Hua-Ping ZHANG, Hong-Kui Yu, De-Yi Xiong, Qun LIU. HHMM-based Chinese Lexical Analyzer ICTCLAS, Second SIGHAN workshop affiliated with 41th ACL; <?XML:NAMESPACE PREFIX = ST1 \/><ST1:CITY>Sapporo<\/ST1:CITY> <ST1:PLACE><ST1:COUNTRY-REGION>Japan<\/ST1:COUNTRY-REGION><\/ST1:PLACE>, July, 2003, pp. 184-187<\/FONT><\/P><br \/>\n<P style=\"MARGIN: 12pt 0cm\" class=AbstractHeading><STRONG><FONT size=3><FONT face=\"Times New Roman\"><SPAN style=\"mso-spacerun: yes\">&nbsp;<\/SPAN><SPAN style=\"mso-ansi-language: EN-GB\"><SPAN style=\"mso-spacerun: yes\">&nbsp;<\/SPAN><\/SPAN><SPAN lang=EN-US>Abstract<\/SPAN><\/FONT><\/FONT><\/STRONG><\/P><br \/>\n<P style=\"MARGIN: 0cm 19.85pt 0pt\" class=Abstract><FONT size=3><FONT face=\"Times New Roman\"><SPAN style=\"mso-fareast-font-family: \u5b8b\u4f53; mso-fareast-language: ZH-CN\" lang=EN-US>This document presents the results from Inst. of Computing Tech., CAS in <\/SPAN><SPAN style=\"mso-bidi-font-size: 11.0pt; mso-fareast-font-family: \u5b8b\u4f53; mso-fareast-language: ZH-CN\" lang=EN-US>the ACL-SIGHAN-sponsored First International Chinese Word Segmentation Bakeoff.<\/SPAN><SPAN lang=EN-US> <\/SPAN><SPAN style=\"mso-fareast-font-family: \u5b8b\u4f53; mso-fareast-language: ZH-CN\" lang=EN-US>The authors introduce the unified HHMM-based frame of our Chinese lexical analyzer ICTCLAS and explain the operation of the six tracks. Then provide the evaluation results and give more analysis. Evaluation on ICTCLAS shows that its performance is competitive. Compared with other system, ICTCLAS has ranked top both in CTB and PK closed track. In PK open track, it ranks second position. ICTCLAS BIG5 version was transformed from GB version only in two days; however, it achieved well in two BIG5 closed tracks. Through the first bakeoff, we could learn more about the development in Chinese word segmentation and become more confident on our HHMM-based approach. At the same time, we really find our problems during the evaluation. The bakeoff is interesting and helpful. <\/SPAN><\/FONT><\/FONT><SPAN style=\"mso-bidi-font-size: 9.0pt; mso-fareast-language: ZH-CN\" lang=EN-US><?xml:namespace prefix = o ns = \"urn:schemas-microsoft-com:office:office\" \/><o:p><\/o:p><\/SPAN><\/P><br \/>\n<P><br \/>\n<P><FONT face=\"Times New Roman\">\u4e0b\u8f7d\u5730\u5740\uff1a<A href=\"http:\/\/www.nlpir.org\/wordpress\/attachments\/2011\/04\/HHMM-based Chinese Lexical Analyzer ICTCLAS.pdf\" target=_blank><IMG border=0 src=\"http:\/\/www.nlpir.org\/images\/base\/attachment.gif\"> HHMM-based Chinese Lexical Analyzer ICTCLAS.pdf(143 KB)<\/A><\/FONT><\/P><\/P><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Hua-Ping ZHANG, Hong-Kui Yu, De-Yi Xiong &hellip; <a href=\"http:\/\/www.nlpir.org\/wordpress\/2017\/07\/21\/hhmm-based-chinese-lexical-analyzer-ictclas\/\">\u7ee7\u7eed\u9605\u8bfb <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[38],"tags":[],"_links":{"self":[{"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/posts\/61"}],"collection":[{"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/comments?post=61"}],"version-history":[{"count":1,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/posts\/61\/revisions"}],"predecessor-version":[{"id":1520,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/posts\/61\/revisions\/1520"}],"wp:attachment":[{"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/media?parent=61"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/categories?post=61"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/tags?post=61"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}