{"id":2847,"date":"2014-10-08T23:08:34","date_gmt":"2014-08-06T17:18:42","guid":{"rendered":"http:\/\/www.clarin.si\/info\/repozitorij-jezikovnih-virov-copy\/"},"modified":"2025-04-25T17:01:46","modified_gmt":"2025-04-25T17:01:46","slug":"about-repository","status":"publish","type":"page","link":"https:\/\/www.clarin.si\/info\/about-repository\/","title":{"rendered":"About the CLARIN.SI repository"},"content":{"rendered":"<p>One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc.<\/p>\n<p>CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority of entries focuses on Slovenian and other South Slavic languages. The repository includes a broad set of large corpora (i. e., structured collections of texts) for studying these languages, as well as a number of parallel and manually tagged corpora, lexicons and language models to be used in language tools.<\/p>\n<p>The repository is regularly maintained and <a href=\"https:\/\/doi.org\/10.34894\/EBQJRF\">Core Trust Seal<\/a> certified. It enables storing and download of language resources in accordance with clearly defined technical and legal standards. It supports easy user authentication and authorisation, as well as allocation of persistent identifiers to uploaded resources. The repository follows the <a href=\"https:\/\/www.go-fair.org\/fair-principles\/\">FAIR<\/a> principles and the conditions of the applicable licence for the archived resources and tools. It ensures long-term archiving since all resources with their persistent identifiers could be easily transferred to repositories of other CLARIN centres provided that CLARIN.SI would stop operating.<\/p>\n<p>The CLARIN.SI repository is registered in several catalogues of research data repositories, such as <a href=\"https:\/\/explore.openaire.eu\/search\/dataprovider?datasourceId=re3data_____::fe0d76581a60e1287a93e2ed2cb29339\">OpenAIRE<\/a>\u00a0and\u00a0<a href=\"https:\/\/www.re3data.org\/repository\/r3d100011922\">re3data<\/a>. Furthermore, CLARIN developed the <a href=\"https:\/\/vlo.clarin.eu\/\">Virtual Language Observatory (VLO)<\/a> which is a faceted browser that enables searching within all CLARIN centres.<\/p>\n<p style=\"text-align: center;\"><strong><a href=\"https:\/\/www.clarin.si\/repository\/xmlui\/?locale-attribute=en\">Go to the CLARIN.SI repository<\/a><\/strong><\/p>\n<p>For more information, follow the links below:<\/p>\n<ul>\n<li><a href=\"https:\/\/www.clarin.si\/repository\/xmlui\/page\/about?locale-attribute=en\">more about the repository<\/a><\/li>\n<li><a href=\"https:\/\/www.clarin.si\/repository\/xmlui\/page\/deposit\">how to deposit data or tools<\/a><\/li>\n<li><a href=\"https:\/\/www.clarin.si\/repository\/xmlui\/page\/faq\">FAQ<\/a><\/li>\n<li><a href=\"https:\/\/www.clarin.si\/repository\/xmlui\/page\/cite\">citing data policy<\/a><\/li>\n<li><a href=\"https:\/\/www.clarin.si\/repository\/xmlui\/page\/item-lifecycle\">submission lifecycle<\/a><\/li>\n<\/ul>\n<!--themify_builder_content-->\n<div id=\"themify_builder_content-2847\" data-postid=\"2847\" class=\"themify_builder_content themify_builder_content-2847 themify_builder tf_clear\">\n    <\/div>\n<!--\/themify_builder_content-->\n","protected":false},"excerpt":{"rendered":"<p>One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"class_list":["post-2847","page","type-page","status-publish","hentry","has-post-title","has-post-date","has-post-category","has-post-tag","has-post-comment","has-post-author",""],"aioseo_notices":[],"aioseo_head":"\n\t\t<!-- All in One SEO 4.9.8 - aioseo.com -->\n\t<meta name=\"description\" content=\"One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority\" \/>\n\t<meta name=\"robots\" content=\"max-image-preview:large\" \/>\n\t<meta name=\"google-site-verification\" content=\"LiA10aq97L10baWhrk27m-8KV46nP_6qo6Z8pFmPF88\" \/>\n\t<link rel=\"canonical\" href=\"https:\/\/www.clarin.si\/info\/about-repository\/\" \/>\n\t<meta name=\"generator\" content=\"All in One SEO (AIOSEO) 4.9.8\" \/>\n\t\t<meta property=\"og:locale\" content=\"en_GB\" \/>\n\t\t<meta property=\"og:site_name\" content=\"CLARIN Slovenija - Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije\" \/>\n\t\t<meta property=\"og:type\" content=\"article\" \/>\n\t\t<meta property=\"og:title\" content=\"About the CLARIN.SI repository - CLARIN Slovenija\" \/>\n\t\t<meta property=\"og:description\" content=\"One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority\" \/>\n\t\t<meta property=\"og:url\" content=\"https:\/\/www.clarin.si\/info\/about-repository\/\" \/>\n\t\t<meta property=\"article:published_time\" content=\"2014-08-06T17:18:42+00:00\" \/>\n\t\t<meta property=\"article:modified_time\" content=\"2025-04-25T17:01:46+00:00\" \/>\n\t\t<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n\t\t<meta name=\"twitter:title\" content=\"About the CLARIN.SI repository - CLARIN Slovenija\" \/>\n\t\t<meta name=\"twitter:description\" content=\"One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority\" \/>\n\t\t<script type=\"application\/ld+json\" class=\"aioseo-schema\">\n\t\t\t{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/about-repository\\\/#breadcrumblist\",\"itemListElement\":[{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info#listItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.clarin.si\\\/info\",\"nextItem\":{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/about-repository\\\/#listItem\",\"name\":\"About the CLARIN.SI repository\"}},{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/about-repository\\\/#listItem\",\"position\":2,\"name\":\"About the CLARIN.SI repository\",\"previousItem\":{\"@type\":\"ListItem\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info#listItem\",\"name\":\"Home\"}}]},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#organization\",\"name\":\"CLARIN Slovenija\",\"description\":\"Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/wp-content\\\/uploads\\\/2014\\\/08\\\/Clarin-SI-logo.png\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/about-repository\\\/#organizationLogo\",\"width\":359,\"height\":150},\"image\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/about-repository\\\/#organizationLogo\"}},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/about-repository\\\/#webpage\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/about-repository\\\/\",\"name\":\"About the CLARIN.SI repository - CLARIN Slovenija\",\"description\":\"One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority\",\"inLanguage\":\"en-GB\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#website\"},\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/about-repository\\\/#breadcrumblist\"},\"datePublished\":\"2014-10-08T23:08:34+00:00\",\"dateModified\":\"2025-04-25T17:01:46+00:00\"},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#website\",\"url\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/\",\"name\":\"CLARIN Slovenija\",\"description\":\"Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije\",\"inLanguage\":\"en-GB\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.clarin.si\\\/info\\\/#organization\"}}]}\n\t\t<\/script>\n\t\t<!-- All in One SEO -->\n\n","aioseo_head_json":{"title":"About the CLARIN.SI repository - CLARIN Slovenija","description":"One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority","canonical_url":"https:\/\/www.clarin.si\/info\/about-repository\/","robots":"max-image-preview:large","keywords":"","webmasterTools":{"google-site-verification":"LiA10aq97L10baWhrk27m-8KV46nP_6qo6Z8pFmPF88","miscellaneous":""},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"BreadcrumbList","@id":"https:\/\/www.clarin.si\/info\/about-repository\/#breadcrumblist","itemListElement":[{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info#listItem","position":1,"name":"Home","item":"https:\/\/www.clarin.si\/info","nextItem":{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info\/about-repository\/#listItem","name":"About the CLARIN.SI repository"}},{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info\/about-repository\/#listItem","position":2,"name":"About the CLARIN.SI repository","previousItem":{"@type":"ListItem","@id":"https:\/\/www.clarin.si\/info#listItem","name":"Home"}}]},{"@type":"Organization","@id":"https:\/\/www.clarin.si\/info\/#organization","name":"CLARIN Slovenija","description":"Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije","url":"https:\/\/www.clarin.si\/info\/","logo":{"@type":"ImageObject","url":"https:\/\/www.clarin.si\/info\/wp-content\/uploads\/2014\/08\/Clarin-SI-logo.png","@id":"https:\/\/www.clarin.si\/info\/about-repository\/#organizationLogo","width":359,"height":150},"image":{"@id":"https:\/\/www.clarin.si\/info\/about-repository\/#organizationLogo"}},{"@type":"WebPage","@id":"https:\/\/www.clarin.si\/info\/about-repository\/#webpage","url":"https:\/\/www.clarin.si\/info\/about-repository\/","name":"About the CLARIN.SI repository - CLARIN Slovenija","description":"One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority","inLanguage":"en-GB","isPartOf":{"@id":"https:\/\/www.clarin.si\/info\/#website"},"breadcrumb":{"@id":"https:\/\/www.clarin.si\/info\/about-repository\/#breadcrumblist"},"datePublished":"2014-10-08T23:08:34+00:00","dateModified":"2025-04-25T17:01:46+00:00"},{"@type":"WebSite","@id":"https:\/\/www.clarin.si\/info\/#website","url":"https:\/\/www.clarin.si\/info\/","name":"CLARIN Slovenija","description":"Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije","inLanguage":"en-GB","publisher":{"@id":"https:\/\/www.clarin.si\/info\/#organization"}}]},"og:locale":"en_GB","og:site_name":"CLARIN Slovenija - Slovenska raziskovalna infrastruktura za jezikovne vire in tehnologije","og:type":"article","og:title":"About the CLARIN.SI repository - CLARIN Slovenija","og:description":"One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority","og:url":"https:\/\/www.clarin.si\/info\/about-repository\/","article:published_time":"2014-08-06T17:18:42+00:00","article:modified_time":"2025-04-25T17:01:46+00:00","twitter:card":"summary_large_image","twitter:title":"About the CLARIN.SI repository - CLARIN Slovenija","twitter:description":"One of the primary purposes of the CLARIN infrastructure is to provide reliable archiving and access to language resources such as corpora, lexicons, audio and video recordings, grammars, language models, etc. CLARIN.SI maintains a certified repository with, currently, over 500 language resources and tools or approximately 3.7 TB of data for 90 languages. The majority"},"aioseo_meta_data":{"post_id":"2847","title":null,"description":null,"keywords":null,"keyphrases":null,"primary_term":null,"canonical_url":null,"og_title":null,"og_description":null,"og_object_type":"default","og_image_type":"default","og_image_custom_url":null,"og_image_custom_fields":null,"og_image_url":null,"og_image_width":null,"og_image_height":null,"og_video":null,"og_custom_url":null,"og_article_section":null,"og_article_tags":null,"twitter_use_og":false,"twitter_card":"default","twitter_image_type":"default","twitter_image_custom_url":null,"twitter_image_custom_fields":null,"twitter_image_url":null,"twitter_title":null,"twitter_description":null,"schema_type":"default","schema_type_options":null,"schema":{"blockGraphs":[],"customGraphs":[],"default":{"data":{"Article":[],"Course":[],"Dataset":[],"FAQPage":[],"Movie":[],"Person":[],"Product":[],"ProductReview":[],"Car":[],"Recipe":[],"Service":[],"SoftwareApplication":[],"WebPage":[]},"graphName":"","isEnabled":true},"graphs":[]},"pillar_content":false,"robots_default":true,"robots_noindex":false,"robots_noarchive":false,"robots_nosnippet":false,"robots_nofollow":false,"robots_noimageindex":false,"robots_noodp":false,"robots_notranslate":false,"robots_max_snippet":null,"robots_max_videopreview":null,"robots_max_imagepreview":"large","priority":null,"frequency":null,"local_seo":null,"limit_modified_date":false,"ai":null,"breadcrumb_settings":null,"seo_analyzer_scan_date":null,"created":"2026-05-21 09:22:05","updated":"2026-05-21 09:22:05"},"aioseo_breadcrumb":"<div class=\"aioseo-breadcrumbs\"><span class=\"aioseo-breadcrumb\">\n\t\t\t<a href=\"https:\/\/www.clarin.si\/info\" title=\"Home\">Home<\/a>\n\t\t<\/span><span class=\"aioseo-breadcrumb-separator\">&raquo;<\/span><span class=\"aioseo-breadcrumb\">\n\t\t\tAbout the CLARIN.SI repository\n\t\t<\/span><\/div>","aioseo_breadcrumb_json":[{"label":"Home","link":"https:\/\/www.clarin.si\/info"},{"label":"About the CLARIN.SI repository","link":"https:\/\/www.clarin.si\/info\/about-repository\/"}],"_links":{"self":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages\/2847","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/comments?post=2847"}],"version-history":[{"count":26,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages\/2847\/revisions"}],"predecessor-version":[{"id":8121,"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/pages\/2847\/revisions\/8121"}],"wp:attachment":[{"href":"https:\/\/www.clarin.si\/info\/wp-json\/wp\/v2\/media?parent=2847"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}