{"id":7264,"date":"2024-03-28T10:02:38","date_gmt":"2024-03-28T10:02:38","guid":{"rendered":"https:\/\/www.clarin.si\/info\/?p=7264"},"modified":"2024-03-28T10:05:10","modified_gmt":"2024-03-28T10:05:10","slug":"clarin-cafe-the-crowll-project","status":"publish","type":"post","link":"https:\/\/www.clarin.si\/info\/clarin-cafe-the-crowll-project\/","title":{"rendered":"CLARIN Caf\u00e9 &#8211;  the CrowLL project"},"content":{"rendered":"<p>We would like to cordially invite you to the latest CLARIN Caf\u00e9, which will be held on <strong><span class=\"smart-date--date\"><time class=\"smart-date--localize datetime\" datetime=\"2024-04-04T14:00:00+02:00\" data-format=\"l, j F Y\" data-tzoffset=\"-60\" data-once=\"smartDateLocalize\">Thursday, April 4, 2024<\/time><\/span>,\u00a0<\/strong><span class=\"smart-date--time\"><strong><time class=\"smart-date--localize datetime\" datetime=\"2024-04-04T14:00:00+02:00\" data-format=\"H:i\" data-tzoffset=\"-60\" data-once=\"smartDateLocalize\">14:00<\/time>\u00a0&#8211;\u00a0<\/strong><time class=\"smart-date--localize datetime\" datetime=\"2024-04-04T16:00:00+02:00\" data-format=\"H:i\" data-tzoffset=\"-60\" data-once=\"smartDateLocalize\"><strong>16:00 (CEST) <\/strong>on <\/time><\/span>the topic of the <a href=\"https:\/\/ucpages.uc.pt\/celga-iltec\/crowll\/\">CrowLL project<\/a> &#8211; Creating pedagogical corpora with annotation of sensitive content and offensive language.<\/p>\n<p>More information about the event is available below and on the following <a href=\"https:\/\/www.clarin.eu\/event\/2024\/clarin-cafe-creating-pedagogical-corpora-annotation-sensitive-content-and-offensive\">link<\/a>.<\/p>\n<p><!--more--><\/p>\n<p><strong>CLARIN Caf\u00e9 &#8211; Creating pedagogical corpora with annotation of sensitive content and offensive language &#8211; the CrowLL project<\/strong><\/p>\n<p>Date: 4 April 2024<br \/>\nTime: 14:00 &#8211; 16:00 (CEST)<br \/>\nVenue: CLARIN virtual Zoom meeting<\/p>\n<p><strong>ABOUT<\/strong>: The main goal of the<a href=\"https:\/\/ucpages.uc.pt\/celga-iltec\/crowll\/\"> CrowLL project<\/a> was to create manually annotated pedagogical corpora that can be used by lexicographers, language teachers, and NLP researchers. The languages were Brazilian Portuguese, Dutch, Estonian, and Slovene. Corpus sentences are annotated as &#8220;problematic&#8221; or &#8220;non-problematic&#8221; from the point of usage for pedagogical purposes. Sentences labelled as problematic also have annotations defining the category of the problem (offensive, vulgar, sensitive content, grammar\/spelling problems, incomprehensible\/lack of context). For each language, the corpus consists of 10,000 sentences annotated by language experts. These corpora, together with annotation guidelines in each language and in English, are available on PORTULAN CLARIN. In this CLARIN Caf\u00e9, we will share the steps that were followed to create these manually annotated corpora and will discuss some of the challenges that were faced. We will also demo the game to foster further expansion of this type of data collection to other languages. Finally, we will reflect on future steps of this project.<\/p>\n<p>You can register for free using\u00a0<a href=\"https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLScVbcCT6ksnv3ghjdy5Hja026cVg3ekl7S_eWtiDYuu29isWw\/viewform?usp=sf_link\">this link<\/a>\u00a0in order to receive the meeting room details.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We would like to cordially invite you to the latest CLARIN Caf\u00e9, which will be held on Thursday, April 4, 2024,\u00a014:00\u00a0&#8211;\u00a016:00 (CEST) on the topic of the CrowLL project &#8211; Creating pedagogical corpora with annotation of sensitive content and offensive language. More information about the event is available below and on the following link.<\/p>\n","protected":false},"author":12,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[34],"tags":[],"class_list":["post-7264","post","type-post","status-publish","format-standard","hentry","category-events","has-post-title","has-post-date","has-post-category","has-post-tag","has-post-comment","has-post-author",""],"_links":{"self":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/posts\/7264","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/comments?post=7264"}],"version-history":[{"count":8,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/posts\/7264\/revisions"}],"predecessor-version":[{"id":7272,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/posts\/7264\/revisions\/7272"}],"wp:attachment":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/media?parent=7264"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/categories?post=7264"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/tags?post=7264"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}