﻿{"id":1077,"date":"2018-11-30T09:08:27","date_gmt":"2018-11-30T01:08:27","guid":{"rendered":"http:\/\/www.nlpir.org\/wordpress\/?p=1077"},"modified":"2018-12-13T22:17:24","modified_gmt":"2018-12-13T14:17:24","slug":"nlpir-ictcla2018-academic-seminar-9th-issue","status":"publish","type":"post","link":"http:\/\/www.nlpir.org\/wordpress\/2018\/11\/30\/nlpir-ictcla2018-academic-seminar-9th-issue\/","title":{"rendered":"An Unsupervised Method for Uncovering Morphological Chains"},"content":{"rendered":"<div id=\"article_body\">\n<p class=\"MsoNormal\" style=\"text-align: center; margin: 0cm 0cm 0pt;\" align=\"center\"><b style=\"mso-bidi-font-weight: normal;\"><span lang=\"EN-US\" style=\"font-size: 16pt; mso-bidi-font-size: 11.0pt; mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-family: Times New Roman;\">NLPIR SEMINAR Y2018#9<br \/>\n<!--?xml:namespace prefix = \"o\" ns = \"urn:schemas-microsoft-com:office:office\" \/--><\/span><\/span><\/b><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0\u0010e\/x\u0011u%K ^\u0016lG<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt; padding-left: 30px;\"><b style=\"mso-bidi-font-weight: normal;\"><span lang=\"EN-US\" style=\"font-size: 12pt; mso-bidi-font-size: 11.0pt; mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-family: Times New Roman;\">INTRO<\/span><\/span><\/b><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0\u001fd\u0005N V\u0016V#p8M!i<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\" style=\"mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\"><span lang=\"EN-US\">\u00a0 \u00a0 \u00a0 \u00a0 <\/span>In the new semester, our Lab, Web Search Mining and Security Lab, plans to hold an academic seminar every Wednesdays, and each time a keynote speaker will share understanding of papers published in recent years with you.<\/span><\/span><\/span><\/p>\n<p><span style=\"display: none;\">;K\u0002h\u0012n<br \/>\n`\u001dF\u0016D1N\u000ex0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\"><span style=\"font-family: Times New Roman; font-size: medium;\">\u00a0<\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0$U\u0003G9n(u;|<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt; padding-left: 30px;\"><b style=\"mso-bidi-font-weight: normal;\"><span lang=\"EN-US\" style=\"font-size: 12pt; mso-bidi-font-size: 11.0pt; mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-family: Times New Roman;\">Arrangement<\/span><\/span><\/b><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0\u001b?\u001dL\/o\u0016q3\\&amp;L\u000eq,B<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\" style=\"mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\"><span lang=\"EN-US\">\u00a0 \u00a0 \u00a0 \u00a0 <\/span>This week&#8217;s seminar is organized as follows:<\/span><\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0+C0R]:?&#8217;x\u001fs9p<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\" style=\"mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\"><span lang=\"EN-US\">\u00a0 \u00a0 \u00a0 \u00a0 <\/span>1. The seminar time is <b style=\"mso-bidi-font-weight: normal;\">1.pm, Wed<\/b>, at Zhongguancun Technology Park ,Building 5, 1306.<\/span><\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0\u000bo\u000eW:z\u001b[*A%{!o9P\u0017_\u0001u<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: left; margin: 0cm 0cm 0pt;\" align=\"left\"><span lang=\"EN-US\" style=\"mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\"><span lang=\"EN-US\">\u00a0 \u00a0 \u00a0 \u00a0 <\/span>2. The lecturer is <b style=\"mso-bidi-font-weight: normal;\">Yaofei Yang<\/b>, the paper&#8217;s titles are <b style=\"mso-bidi-font-weight: normal;\">An Unsupervised Method for Uncovering Morphological Chains <\/b>and<b style=\"mso-bidi-font-weight: normal;\"> From Segmentation to Analyses A Probabilistic Model for Unsupervised Morphology Induction<\/b>.<\/span><\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0#T\u000eG&amp;X(R\u0011|\u0001L\u0006N<br \/>\nf<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: left; margin: 0cm 0cm 0pt;\" align=\"left\"><span lang=\"EN-US\" style=\"mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\"><span lang=\"EN-US\">\u00a0 \u00a0 \u00a0 \u00a0 <\/span>3. The seminar will be hosted Wang Gang.<\/span><\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0\u0013?\u0005I\u000er\u001av\u0012e\/S\u001dP\u001bf<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\" style=\"mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\"><span lang=\"EN-US\">\u00a0 \u00a0 \u00a0 \u00a0 <\/span>4. Attachment is the paper of this seminar, please download in advance<\/span><\/span><\/span><\/p>\n<p><span style=\"display: none;\">0~<br \/>\nX!t d\u001fG\/Y F0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\"><span style=\"font-family: Times New Roman; font-size: medium;\">\u00a0<\/span><\/span><\/p>\n<p><span style=\"display: none;\">\u000f[M&amp;~,j\u0006\\\u0004S\u0011m.V0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\" style=\"mso-bidi-font-family: 'Times New Roman';\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\"><span lang=\"EN-US\">\u00a0 \u00a0 \u00a0 \u00a0 <\/span>Everyone interested in this topic is welcomed to join us. the following is the abstract for this week\u2019s paper<\/span><\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0\u000em*_,sN\u0018_\u0013u#{<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\"><span style=\"font-family: Times New Roman; font-size: medium;\">\u00a0<\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0(\\\u0019p+o\fx\u0013B#U\u0002]<\/span><\/p>\n<div style=\"mso-element: para-border-div; mso-border-alt: dotted windowtext .5pt; border: windowtext 1pt dotted; padding: 1pt 4pt 1pt 4pt;\">\n<p class=\"MsoNormal\" style=\"layout-grid-mode: char; text-align: center; margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\" align=\"center\"><span lang=\"EN-US\" style=\"font-size: 14pt; mso-bidi-font-size: 11.0pt;\"><span style=\"font-family: Times New Roman;\">An Unsupervised Method for Uncovering Morphological Chains<\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0&amp;G\u0005o2w\u0018q:n<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\" align=\"center\"><span lang=\"EN-US\" style=\"font-size: 9pt; mso-bidi-font-size: 11.0pt;\"><span style=\"font-family: Times New Roman;\">Karthik Narasimhan<span style=\"mso-tab-count: 2;\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <\/span>Regina Barzilay<span style=\"mso-tab-count: 2;\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <\/span>Tommi Jaakkola<\/span><\/span><\/p>\n<p><span style=\"display: none;\">\u0005c\u001d?\bQ\/e\u0006e#e\u0012}0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\" align=\"center\"><span lang=\"EN-US\" style=\"font-size: 12pt; mso-bidi-font-size: 11.0pt;\"><span style=\"font-family: Times New Roman;\">Abstract<\/span><\/span><\/p>\n<p><span style=\"display: none;\">#O\u001cn9?*]*A0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\"><span lang=\"EN-US\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\">Most state-of-the-art systems today produce morphological analysis based only on orthographic patterns. In contrast, we propose a model for unsupervised morphological analysis that integrates orthographic and semantic views of words. We model word formation in terms of morphological chains, from base words to the observed words, breaking the chains into parent-child relations. We use log-linear models with morpheme and wordlevel features to predict possible parents, including their modifications, for each word. The limited set of candidate parents for each word render contrastive estimation feasible. Our model consistently matches or outperforms five state-of-the-art systems on Arabic, English and Turkish.<\/span><\/span><\/span><\/p>\n<p><span style=\"display: none;\">\u0016M;j<br \/>\nH0b;Y0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\"><span lang=\"EN-US\"><span style=\"font-family: Times New Roman; font-size: medium;\">\u00a0<\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0\u001aJ\u001dj\u0010_+D\u0015i\u001bS4F\u001fB\u0003T\u0011A<\/span><\/p>\n<p class=\"MsoNormal\" style=\"layout-grid-mode: char; text-align: center; margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\" align=\"center\"><span lang=\"EN-US\" style=\"font-size: 14pt; mso-bidi-font-size: 11.0pt;\"><span style=\"font-family: Times New Roman;\">From Segmentation to Analyses:<\/span><\/span><\/p>\n<p><span style=\"display: none;\">\u001e`\u0014x\u000f]$?*d\bi*h\fU0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"layout-grid-mode: char; text-align: center; margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\" align=\"center\"><span lang=\"EN-US\" style=\"font-size: 14pt; mso-bidi-font-size: 11.0pt;\"><span style=\"font-family: Times New Roman;\">A Probabilistic Model for Unsupervised Morphology Induction<\/span><\/span><\/p>\n<p><span style=\"display: none;\">\u000bA0Q<br \/>\nj\u001b?\u001f{\u0007\\\u0004{4z\u0010p0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\" align=\"center\"><span lang=\"EN-US\" style=\"font-size: 9pt; mso-bidi-font-size: 11.0pt;\"><span style=\"font-family: Times New Roman;\">Toms Bergmanis<span style=\"mso-tab-count: 2;\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <\/span>Sharon Goldwater<\/span><\/span><\/p>\n<p><span style=\"display: none;\">2S\u0010z\u0010v\b{\u001eN\u0005X\/M0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"text-align: center; margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\" align=\"center\"><span lang=\"EN-US\" style=\"font-size: 12pt; mso-bidi-font-size: 11.0pt;\"><span style=\"font-family: Times New Roman;\">Abstract<\/span><\/span><\/p>\n<p><span style=\"display: none;\">\u0011?\u000bf1\\\u001dl<br \/>\ny-d8b\u000bt0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\"><span lang=\"EN-US\"><span style=\"font-size: medium;\"><span style=\"font-family: Times New Roman;\">\u00a0 \u00a0 \u00a0 \u00a0 A major motivation for unsupervised morphological analysis is to reduce the sparse data problem in under-resourced languages. Most previous work focuses on segmenting surface forms into their constituent morphs (e.g., taking: tak +ing), but surface form segmentation does not solve the sparse data problem as the analyses of take and taking are not connected to each other. We extend the MorphoChains system (Narasimhan et al., 2015) to provide morphological analyses that can abstract over spelling differences in functionally similar morphs. These analyses are not required to use all the orthographic material of a word (stopping: stop +ing), nor are they limited to only that material (acidified: acid +ify +ed). On average across six typologically varied languages our system has a similar or better F-score on EMMA (a measure of underlying morpheme accuracy) than three strong baselines; moreover, the total number of distinct morphemes identified by our system is on average 12.8% lower than for Morfessor (Virpioja et al., 2013), a state-of-the-art surface segmentation system.<\/span><\/span><\/span><\/p>\n<p><span style=\"display: none;\">(L\u000eY:v8e$F%j\u0017m<br \/>\nw\u0017h\u0005i<br \/>\n]0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt; mso-border-alt: dotted windowtext .5pt; mso-padding-alt: 1.0pt 4.0pt 1.0pt 4.0pt; padding: 0cm;\"><span lang=\"EN-US\"><span style=\"font-family: Times New Roman; font-size: medium;\">\u00a0<\/span><\/span><\/p>\n<p><span style=\"display: none;\">#[ o3O!z$R\u0014J S\u0014A0<\/span><\/p>\n<\/div>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\"><span style=\"font-family: Times New Roman; font-size: medium;\">\u00a0<\/span><\/span><\/p>\n<p><span style=\"display: none;\">\u001fu&#8217;u:{9X\u0019q\fp0u0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\"><span style=\"font-family: Times New Roman; font-size: medium;\">\u00a0<\/span><\/span><\/p>\n<p><span style=\"display: none;\">\fwT\u0012R$J\/~5C0<\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin: 0cm 0cm 0pt;\"><span lang=\"EN-US\"><span style=\"font-family: Times New Roman; font-size: medium;\">\u00a0<\/span><\/span><span style=\"display: none;\">\u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0&#8221;c&#8217;I\u0018a%S\u0017L\u0017Q:P&amp;^\u0015z<\/span><\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>NLPIR SEMINAR Y2018#9 \u81ea\u7136\u8bed\u8a00\u5904\u7406\u4e0e\u4fe1\u606f\u68c0\u7d22\u5171\u4eab\u5e73\u53f0\u0010e\/ &hellip; <a href=\"http:\/\/www.nlpir.org\/wordpress\/2018\/11\/30\/nlpir-ictcla2018-academic-seminar-9th-issue\/\">\u7ee7\u7eed\u9605\u8bfb <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[37],"tags":[],"_links":{"self":[{"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/posts\/1077"}],"collection":[{"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/comments?post=1077"}],"version-history":[{"count":3,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/posts\/1077\/revisions"}],"predecessor-version":[{"id":6154,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/posts\/1077\/revisions\/6154"}],"wp:attachment":[{"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/media?parent=1077"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/categories?post=1077"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.nlpir.org\/wordpress\/wp-json\/wp\/v2\/tags?post=1077"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}