{"id":3247,"date":"2016-10-04T09:41:13","date_gmt":"2016-10-04T09:41:13","guid":{"rendered":"http:\/\/www.clarin.si\/info\/?page_id=3247"},"modified":"2017-09-12T18:38:40","modified_gmt":"2017-09-12T18:38:40","slug":"mcat-workshop","status":"publish","type":"page","link":"https:\/\/www.clarin.si\/info\/events\/mcat-workshop\/","title":{"rendered":"Workshop &#8220;Multilingual corpus annotation tools: development and integration&#8221;"},"content":{"rendered":"<p style=\"text-align: center; font-size: 1.6em; line-height: 1.5em;\">CLARIN workshop<br \/>\n<strong>Multilingual corpus annotation tools:<br \/>\ndevelopment and integration<\/strong><br \/>\nLjubljana, November 10 \u2013 11, 2016<\/p>\n<h2>Introduction<\/h2>\n<p>Basic annotation of language corpora is a prerequisite for corpus linguistics or any advanced explorations of information content of language. Yet, for many CLARIN languages, online annotation tools are not available. This two-day workshop aimed to close this gap by joining CLARIN members that have locally developed annotation tools or resources in order to integrate them in terms of specifications and offer them as web services in the scope of the WebLicht architecture. The planned multilingual web services to be developed will enhance the utility of workflow construction and execution workflows and feed back into their development and documentation.<\/p>\n<p>The workshop catalogued available tools, resources and encoding standards of the participants and proposed a workplan on how to integrate them with WebLicht, also considering other such environments, such as TextFlows, developed at JSI. The concrete result of the workshop is an implementation plan with its timeline.<\/p>\n<h2><strong>Agenda<\/strong><\/h2>\n<h3>First Day<\/h3>\n<p>Thursday, November 10th, Physics seminar room:<\/p>\n<table border=\"border\" cellpadding=\"3\">\n<tbody>\n<tr>\n<td>9:00 &#8211; 9:30<\/td>\n<td><a href=\"http:\/\/www.clarin.si\/info\/wp-content\/uploads\/2016\/10\/clarinws2016_intro.pdf\">Introduction<\/a><\/td>\n<td>T. Erjavec, D. Fi\u0161er<\/td>\n<\/tr>\n<tr>\n<td>9:30 &#8211; 10:30<\/td>\n<td><a href=\"https:\/\/www.clarin.si\/info\/wp-content\/uploads\/2016\/10\/2016-11-10-WebLichtToolIntegration-Slovenia.pdf\">WebLicht<\/a><\/td>\n<td>M. Hinrichs, W. Qiu<\/td>\n<\/tr>\n<tr>\n<td>10:30 &#8211; 10:45<\/td>\n<td>Coffee break<\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>10:45 &#8211; 11:15<\/td>\n<td><a href=\"http:\/\/www.clarin.si\/info\/wp-content\/uploads\/2016\/10\/textflows.pdf\">TextFlows<\/a><\/td>\n<td>S. Pollak, M. Martinc, M. Perov\u0161ek<\/td>\n<\/tr>\n<tr>\n<td>11:15 &#8211; 12:15<\/td>\n<td><a href=\"https:\/\/www.clarin.si\/info\/wp-content\/uploads\/2016\/10\/nl_clarin_ws_presentation.pdf\">ReLDI data &amp; tools<\/a><\/td>\n<td>N. Ljube\u0161i\u0107<\/td>\n<\/tr>\n<tr>\n<td>12:15 &#8211; 12:45<\/td>\n<td>Estonian\u00a0data &amp; tools<\/td>\n<td>K. Liin<\/td>\n<\/tr>\n<tr>\n<td>12:45 &#8211; 13:45<\/td>\n<td>Lunch<\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>13:45 &#8211; 14:15<\/td>\n<td><a href=\"http:\/\/www.clarin.si\/info\/wp-content\/uploads\/2016\/10\/CLARIN-LV.pdf\">Latvian data &amp; tools<\/a><\/td>\n<td>I. Skadi\u0146a, R. Dar\u0123is, L. Pretkalni\u0146a<\/td>\n<\/tr>\n<tr>\n<td>14:15 &#8211; 14:45<\/td>\n<td>Discussion<\/td>\n<td>all<\/td>\n<\/tr>\n<tr>\n<td>14:45 &#8211; 15:00<\/td>\n<td>Coffee break<\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>15:00 &#8211; 16:30<\/td>\n<td>Discussion<\/td>\n<td>all<\/td>\n<\/tr>\n<tr>\n<td><\/td>\n<td><\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>19:00 &#8211;<\/td>\n<td>Dinner at &#8220;<a href=\"https:\/\/www.google.com\/maps\/place\/%C5%A0pajza+Restaurant\/@46.045756,14.508699,14z\/data=!4m5!3m4!1s0x0:0xcd53bf9c888e739e!8m2!3d46.045756!4d14.5086992?hl=en\">\u0160pajza<\/a>&#8220;<\/td>\n<td><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3>\u00a0Second Day<\/h3>\n<p>Friday, November 11th, Biochemistry seminar room:<\/p>\n<table border=\"border\" cellspacing=\"3\">\n<tbody>\n<tr>\n<td>9:00 &#8211; 9:30<\/td>\n<td><a href=\"http:\/\/www.clarin.si\/info\/wp-content\/uploads\/2016\/10\/WSLUBIANA.pdf\">Italian\u00a0data &amp; tools<\/a><\/td>\n<td>R. Del Gratta<\/td>\n<\/tr>\n<tr>\n<td>9:30 &#8211; 10:00<\/td>\n<td>Czech data &amp; tools<\/td>\n<td>P. Stranak<\/td>\n<\/tr>\n<tr>\n<td>10:00 &#8211; 11:00<\/td>\n<td>\n<div>WebLicht Hackaton<\/div>\n<\/td>\n<td>all<\/td>\n<\/tr>\n<tr>\n<td>11:00 &#8211; 11:15<\/td>\n<td>Coffee break<\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>11:15 &#8211; 12:45<\/td>\n<td>\n<div>WebLicht Hackaton +<\/div>\n<div>Drafting the workplan<\/div>\n<\/td>\n<td>all<\/td>\n<\/tr>\n<tr>\n<td>12:45 &#8211; 13:45<\/td>\n<td>Lunch<\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>13:45 &#8211; 14:45<\/td>\n<td>Drafting the workplan<\/td>\n<td>all<\/td>\n<\/tr>\n<tr>\n<td>14:45 &#8211; 15:00<\/td>\n<td>Coffee break<\/td>\n<td><\/td>\n<\/tr>\n<tr>\n<td>15:00 &#8211; 16:30<\/td>\n<td>\n<div>Workplan discussion<\/div>\n<\/td>\n<td>all<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2><strong>Envisaged implementation project<\/strong><\/h2>\n<p>Note that the plan is still under development!<\/p>\n<ol>\n<li>Basic annotation services in WebLicht:\n<ul>\n<li>Tools for tokenisation,sentence segmentation, morphosyntactic tagging and lemmatisation exposed as Web services and intergrated with WebLicht (internet protocol, TCF I\/O)<\/li>\n<li>Languages covered: sl, hr, sr, lv, et, cs, it<\/li>\n<li>Basic WebLicht documentation and a short video tutorial will be prepared in national languages<\/li>\n<\/ul>\n<\/li>\n<li>Normalisation of words will be added to WebLicht, in the first instance covering sl CMC<\/li>\n<li>Evaluation\n<ul>\n<li>The functioning of the tools will be tested with Bombard and Awesome profilers<\/li>\n<li>A user centred evaluation will be prepared and carried out<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<div id=\"themify_builder_content-3247\" data-postid=\"3247\" class=\"themify_builder_content themify_builder_content-3247 themify_builder\">\n    <\/div>\n<!-- \/themify_builder_content -->\n","protected":false},"excerpt":{"rendered":"<p>CLARIN workshop Multilingual corpus annotation tools: development and integration Ljubljana, November 10 \u2013 11, 2016 Introduction Basic annotation of language corpora is a prerequisite for corpus linguistics or any advanced explorations of information content of language. Yet, for many CLARIN languages, online annotation tools are not available. This two-day workshop aimed to close this gap [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"parent":3238,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-3247","page","type-page","status-publish","hentry","has-post-title","has-post-date","has-post-category","has-post-tag","has-post-comment","has-post-author",""],"_links":{"self":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages\/3247","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/comments?post=3247"}],"version-history":[{"count":41,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages\/3247\/revisions"}],"predecessor-version":[{"id":3414,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages\/3247\/revisions\/3414"}],"up":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages\/3238"}],"wp:attachment":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/media?parent=3247"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}