{"id":8126,"date":"2025-04-27T09:21:48","date_gmt":"2025-04-27T09:21:48","guid":{"rendered":"https:\/\/www.clarin.si\/info\/?p=8126"},"modified":"2025-04-28T08:39:04","modified_gmt":"2025-04-28T08:39:04","slug":"problems-with-access-to-clarin-si","status":"publish","type":"post","link":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/","title":{"rendered":"Problems with access to CLARIN.SI"},"content":{"rendered":"<p class=\"\" data-start=\"110\" data-end=\"427\">Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the <a class=\"\" href=\"http:\/\/www.clarin.si\" target=\"_new\" rel=\"noopener\" data-start=\"321\" data-end=\"334\">www.clarin.si<\/a> domain, while ignoring our <em data-start=\"362\" data-end=\"374\">robots.txt<\/em> directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to process each request. As these requests are sent repeatedly, several times per second, the infrastructure becomes overloaded and unresponsive, much like during a typical DDoS attack.<\/p>\n<p class=\"\" data-start=\"706\" data-end=\"910\">We are actively blocking the offending IP addresses, but new ones keep appearing. We are implementing various measures to address the situation, but unfortunately, it may take some time to fully resolve. We sincerely apologise for any inconvenience and thank you for your understanding!<\/p>\n<p>CLARIN.SI Team<\/p>\n<!--themify_builder_content-->\n<div id=\"themify_builder_content-8126\" data-postid=\"8126\" class=\"themify_builder_content themify_builder_content-8126 themify_builder tf_clear\">\n    <\/div>\n<!--\/themify_builder_content-->\n","protected":false},"excerpt":{"rendered":"<p>Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to [&hellip;]<\/p>\n","protected":false},"author":12,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[34],"tags":[],"class_list":["post-8126","post","type-post","status-publish","format-standard","hentry","category-events","has-post-title","has-post-date","has-post-category","has-post-tag","has-post-comment","has-post-author",""],"aioseo_notices":[],"aioseo_head":"\n\t\t<!-- All in One SEO 4.9.8 - aioseo.com -->\n\t<meta name=\"description\" content=\"Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to\" \/>\n\t<meta name=\"robots\" content=\"max-image-preview:large\" \/>\n\t<meta name=\"author\" content=\"Katja Meden\"\/>\n\t<meta name=\"google-site-verification\" content=\"LiA10aq97L10baWhrk27m-8KV46nP_6qo6Z8pFmPF88\" \/>\n\t<link rel=\"canonical\" href=\"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/\" \/>\n\t<meta name=\"generator\" content=\"All in One SEO (AIOSEO) 4.9.8\" \/>\n\t\t<meta property=\"og:locale\" content=\"en_GB\" \/>\n\t\t<meta property=\"og:site_name\" content=\"CLARIN Slovenija - Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije\" \/>\n\t\t<meta property=\"og:type\" content=\"article\" \/>\n\t\t<meta property=\"og:title\" content=\"Problems with access to CLARIN.SI - CLARIN Slovenija\" \/>\n\t\t<meta property=\"og:description\" content=\"Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to\" \/>\n\t\t<meta property=\"og:url\" content=\"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/\" \/>\n\t\t<meta property=\"article:published_time\" content=\"2025-04-27T09:21:48+00:00\" \/>\n\t\t<meta property=\"article:modified_time\" content=\"2025-04-28T08:39:04+00:00\" \/>\n\t\t<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n\t\t<meta name=\"twitter:title\" content=\"Problems with access to CLARIN.SI - CLARIN Slovenija\" \/>\n\t\t<meta name=\"twitter:description\" content=\"Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to\" \/>\n\t\t<script type=\"application\/ld+json\" class=\"aioseo-schema\">\n\t\t\t{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"BlogPosting\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#blogposting\",\"name\":\"Problems with access to CLARIN.SI - CLARIN Slovenija\",\"headline\":\"Problems with access to CLARIN.SI\",\"author\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/author\\\/katja\\\/#author\"},\"publisher\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#organization\"},\"image\":{\"@type\":\"ImageObject\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/wp-content\\\/uploads\\\/2014\\\/08\\\/Clarin-SI-logo.png\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#articleImage\",\"width\":359,\"height\":150},\"datePublished\":\"2025-04-27T09:21:48+00:00\",\"dateModified\":\"2025-04-28T08:39:04+00:00\",\"inLanguage\":\"en-GB\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#webpage\"},\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#webpage\"},\"articleSection\":\"Events, English, pll_680df8bca723d\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#breadcrumblist\",\"itemListElement\":[{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info#listItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.clarin.si\\\/info\",\"nextItem\":{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/category\\\/events\\\/#listItem\",\"name\":\"Events\"}},{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/category\\\/events\\\/#listItem\",\"position\":2,\"name\":\"Events\",\"item\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/category\\\/events\\\/\",\"nextItem\":{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#listItem\",\"name\":\"Problems with access to CLARIN.SI\"},\"previousItem\":{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info#listItem\",\"name\":\"Home\"}},{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#listItem\",\"position\":3,\"name\":\"Problems with access to CLARIN.SI\",\"previousItem\":{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/category\\\/events\\\/#listItem\",\"name\":\"Events\"}}]},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#organization\",\"name\":\"CLARIN Slovenija\",\"description\":\"Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/wp-content\\\/uploads\\\/2014\\\/08\\\/Clarin-SI-logo.png\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#organizationLogo\",\"width\":359,\"height\":150},\"image\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#organizationLogo\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/author\\\/katja\\\/#author\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/author\\\/katja\\\/\",\"name\":\"Katja Meden\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#webpage\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/\",\"name\":\"Problems with access to CLARIN.SI - CLARIN Slovenija\",\"description\":\"Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to\",\"inLanguage\":\"en-GB\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#website\"},\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/problems-with-access-to-clarin-si\\\/#breadcrumblist\"},\"author\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/author\\\/katja\\\/#author\"},\"creator\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/author\\\/katja\\\/#author\"},\"datePublished\":\"2025-04-27T09:21:48+00:00\",\"dateModified\":\"2025-04-28T08:39:04+00:00\"},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#website\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/\",\"name\":\"CLARIN Slovenija\",\"description\":\"Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije\",\"inLanguage\":\"en-GB\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#organization\"}}]}\n\t\t<\/script>\n\t\t<!-- All in One SEO -->\n\n","aioseo_head_json":{"title":"Problems with access to CLARIN.SI - CLARIN Slovenija","description":"Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to","canonical_url":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/","robots":"max-image-preview:large","keywords":"","webmasterTools":{"google-site-verification":"LiA10aq97L10baWhrk27m-8KV46nP_6qo6Z8pFmPF88","miscellaneous":""},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"BlogPosting","@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#blogposting","name":"Problems with access to CLARIN.SI - CLARIN Slovenija","headline":"Problems with access to CLARIN.SI","author":{"@id":"https:\/\/www.clarin.si\/info\/author\/katja\/#author"},"publisher":{"@id":"https:\/\/www.clarin.si\/info\/#organization"},"image":{"@type":"ImageObject","url":"https:\/\/www.clarin.si\/info\/wp-content\/uploads\/2014\/08\/Clarin-SI-logo.png","@id":"https:\/\/www.clarin.si\/info\/#articleImage","width":359,"height":150},"datePublished":"2025-04-27T09:21:48+00:00","dateModified":"2025-04-28T08:39:04+00:00","inLanguage":"en-GB","mainEntityOfPage":{"@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#webpage"},"isPartOf":{"@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#webpage"},"articleSection":"Events, English, pll_680df8bca723d"},{"@type":"BreadcrumbList","@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#breadcrumblist","itemListElement":[{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info#listItem","position":1,"name":"Home","item":"https:\/\/www.clarin.si\/info","nextItem":{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info\/category\/events\/#listItem","name":"Events"}},{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info\/category\/events\/#listItem","position":2,"name":"Events","item":"https:\/\/www.clarin.si\/info\/category\/events\/","nextItem":{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#listItem","name":"Problems with access to CLARIN.SI"},"previousItem":{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info#listItem","name":"Home"}},{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#listItem","position":3,"name":"Problems with access to CLARIN.SI","previousItem":{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info\/category\/events\/#listItem","name":"Events"}}]},{"@type":"Organization","@id":"https:\/\/www.clarin.si\/info\/#organization","name":"CLARIN Slovenija","description":"Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije","url":"https:\/\/www.clarin.si\/info\/","logo":{"@type":"ImageObject","url":"https:\/\/www.clarin.si\/info\/wp-content\/uploads\/2014\/08\/Clarin-SI-logo.png","@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#organizationLogo","width":359,"height":150},"image":{"@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#organizationLogo"}},{"@type":"Person","@id":"https:\/\/www.clarin.si\/info\/author\/katja\/#author","url":"https:\/\/www.clarin.si\/info\/author\/katja\/","name":"Katja Meden"},{"@type":"WebPage","@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#webpage","url":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/","name":"Problems with access to CLARIN.SI - CLARIN Slovenija","description":"Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to","inLanguage":"en-GB","isPartOf":{"@id":"https:\/\/www.clarin.si\/info\/#website"},"breadcrumb":{"@id":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/#breadcrumblist"},"author":{"@id":"https:\/\/www.clarin.si\/info\/author\/katja\/#author"},"creator":{"@id":"https:\/\/www.clarin.si\/info\/author\/katja\/#author"},"datePublished":"2025-04-27T09:21:48+00:00","dateModified":"2025-04-28T08:39:04+00:00"},{"@type":"WebSite","@id":"https:\/\/www.clarin.si\/info\/#website","url":"https:\/\/www.clarin.si\/info\/","name":"CLARIN Slovenija","description":"Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije","inLanguage":"en-GB","publisher":{"@id":"https:\/\/www.clarin.si\/info\/#organization"}}]},"og:locale":"en_GB","og:site_name":"CLARIN Slovenija - Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije","og:type":"article","og:title":"Problems with access to CLARIN.SI - CLARIN Slovenija","og:description":"Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to","og:url":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/","article:published_time":"2025-04-27T09:21:48+00:00","article:modified_time":"2025-04-28T08:39:04+00:00","twitter:card":"summary_large_image","twitter:title":"Problems with access to CLARIN.SI - CLARIN Slovenija","twitter:description":"Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to"},"aioseo_meta_data":{"post_id":"8126","title":null,"description":null,"keywords":null,"keyphrases":null,"primary_term":null,"canonical_url":null,"og_title":null,"og_description":null,"og_object_type":"default","og_image_type":"default","og_image_custom_url":null,"og_image_custom_fields":null,"og_image_url":null,"og_image_width":null,"og_image_height":null,"og_video":null,"og_custom_url":null,"og_article_section":null,"og_article_tags":null,"twitter_use_og":false,"twitter_card":"default","twitter_image_type":"default","twitter_image_custom_url":null,"twitter_image_custom_fields":null,"twitter_image_url":null,"twitter_title":null,"twitter_description":null,"schema_type":"default","schema_type_options":null,"schema":{"blockGraphs":[],"customGraphs":[],"default":{"data":{"Article":[],"Course":[],"Dataset":[],"FAQPage":[],"Movie":[],"Person":[],"Product":[],"ProductReview":[],"Car":[],"Recipe":[],"Service":[],"SoftwareApplication":[],"WebPage":[]},"graphName":"","isEnabled":true},"graphs":[]},"pillar_content":false,"robots_default":true,"robots_noindex":false,"robots_noarchive":false,"robots_nosnippet":false,"robots_nofollow":false,"robots_noimageindex":false,"robots_noodp":false,"robots_notranslate":false,"robots_max_snippet":null,"robots_max_videopreview":null,"robots_max_imagepreview":"large","priority":null,"frequency":null,"local_seo":null,"limit_modified_date":false,"ai":null,"breadcrumb_settings":null,"seo_analyzer_scan_date":null,"created":"2026-05-21 09:03:14","updated":"2026-05-21 09:03:14"},"aioseo_breadcrumb":"<div class=\"aioseo-breadcrumbs\"><span class=\"aioseo-breadcrumb\">\n\t\t\t<a href=\"https:\/\/www.clarin.si\/info\" title=\"Home\">Home<\/a>\n\t\t<\/span><span class=\"aioseo-breadcrumb-separator\">&raquo;<\/span><span class=\"aioseo-breadcrumb\">\n\t\t\t<a href=\"https:\/\/www.clarin.si\/info\/category\/events\/\" title=\"Events\">Events<\/a>\n\t\t<\/span><span class=\"aioseo-breadcrumb-separator\">&raquo;<\/span><span class=\"aioseo-breadcrumb\">\n\t\t\tProblems with access to CLARIN.SI\n\t\t<\/span><\/div>","aioseo_breadcrumb_json":[{"label":"Home","link":"https:\/\/www.clarin.si\/info"},{"label":"Events","link":"https:\/\/www.clarin.si\/info\/category\/events\/"},{"label":"Problems with access to CLARIN.SI","link":"https:\/\/www.clarin.si\/info\/problems-with-access-to-clarin-si\/"}],"_links":{"self":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/posts\/8126","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/comments?post=8126"}],"version-history":[{"count":9,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/posts\/8126\/revisions"}],"predecessor-version":[{"id":8145,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/posts\/8126\/revisions\/8145"}],"wp:attachment":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/media?parent=8126"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/categories?post=8126"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/tags?post=8126"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}