Problems with access to CLARIN.SI
Over the past week or so, there have been intermittent issues with access to CLARIN.SI. The problems are caused by several large AI companies aggressively harvesting all content under the www.clarin.si domain, while ignoring our robots.txt directives that explicitly prohibit such activities. This harvesting includes our services (e.g., the concordancers), which require some time to process each request. As these requests are sent repeatedly, several times per second, the infrastructure becomes overloaded and unresponsive, much like during a typical DDoS attack.
We are actively blocking the offending IP addresses, but new ones keep appearing. We are implementing various measures to address the situation, but unfortunately, it may take some time to fully resolve. We sincerely apologise for any inconvenience and thank you for your understanding!
CLARIN.SI Team