{"value":"\n<cmd:CMD xmlns:cmd=\"http://www.clarin.eu/cmd/\" xmlns:lindat=\"http://lindat.mff.cuni.cz/ns/experimental/cmdi\" xmlns:olac=\"http://www.clarin.eu/cmd/\" xmlns:ms=\"http://www.clarin.eu/cmd/\" xmlns:xsi=\"http://www.w3.org/2001/XMLSchema-instance\" CMDVersion=\"1.1\" xsi:schemaLocation=\"http://www.clarin.eu/cmd/ http://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/profiles/clarin.eu:cr1:p_1403526079380/xsd\">  \n  <cmd:Header> \n    <cmd:MdCreationDate>2026-02-16</cmd:MdCreationDate>  \n    <cmd:MdSelfLink>https://hdl.handle.net/11356/2085@format=cmdi</cmd:MdSelfLink>  \n    <cmd:MdProfile>clarin.eu:cr1:p_1403526079380</cmd:MdProfile>  \n    <cmd:MdCollectionDisplayName>CLARIN.SI data &amp; tools</cmd:MdCollectionDisplayName> \n  </cmd:Header>  \n  <cmd:Resources> \n    <cmd:ResourceProxyList> \n      <cmd:ResourceProxy id=\"lp_2619\"> \n        <cmd:ResourceType>LandingPage</cmd:ResourceType>  \n        <cmd:ResourceRef>https://hdl.handle.net/11356/2085</cmd:ResourceRef> \n      </cmd:ResourceProxy>  \n      <cmd:ResourceProxy id=\"uri_1\"> \n        <cmd:ResourceType mimetype=\"text/html\">Resource</cmd:ResourceType>  \n        <cmd:ResourceRef>https://www.cjvt.si/povejmo/</cmd:ResourceRef> \n      </cmd:ResourceProxy>  \n      <cmd:ResourceProxy id=\"_7467\"> \n        <cmd:ResourceType mimetype=\"application/zip\">Resource</cmd:ResourceType>  \n        <cmd:ResourceRef lindat:md5_checksum=\"46b8b74cfdbe967852d3bf233cdafd76\">https://www.clarin.si/repository/xmlui/bitstream/handle/11356/2085/GaMS-Instruct-MED-Anatomy_1.0.zip?sequence=1</cmd:ResourceRef> \n      </cmd:ResourceProxy> \n    </cmd:ResourceProxyList>  \n    <cmd:JournalFileProxyList/>  \n    <cmd:ResourceRelationList/> \n  </cmd:Resources>  \n  <cmd:Components> \n    <cmd:LINDAT_CLARIN> \n      <cmd:bibliographicInfo> \n        <cmd:projectUrl>https://www.cjvt.si/povejmo/</cmd:projectUrl>  \n        <cmd:titles> \n          <cmd:title xml:lang=\"en\">Slovene instruction-following dataset for large language models GaMS-Instruct-MED-Anatomy 1.0</cmd:title> \n        </cmd:titles>  \n        <cmd:authors> \n          <author xmlns=\"http://www.clarin.eu/cmd/\">  \n            <lastName>Plesnik</lastName>  \n            <firstName>Emil</firstName> \n          </author>  \n          <author xmlns=\"http://www.clarin.eu/cmd/\">  \n            <lastName>Tovornik</lastName>  \n            <firstName>Robert</firstName> \n          </author>  \n          <author xmlns=\"http://www.clarin.eu/cmd/\">  \n            <lastName>Fabjan</lastName>  \n            <firstName>Borut</firstName> \n          </author>  \n          <author xmlns=\"http://www.clarin.eu/cmd/\">  \n            <lastName>Korošec</lastName>  \n            <firstName>Filip</firstName> \n          </author>  \n          <author xmlns=\"http://www.clarin.eu/cmd/\">  \n            <lastName>Žabkar</lastName>  \n            <firstName>Ines</firstName> \n          </author>  \n          <author xmlns=\"http://www.clarin.eu/cmd/\">  \n            <lastName>Kuzman</lastName>  \n            <firstName>Ema</firstName> \n          </author>  \n          <author xmlns=\"http://www.clarin.eu/cmd/\">  \n            <lastName>Rigler</lastName>  \n            <firstName>Martin</firstName> \n          </author>  \n          <author xmlns=\"http://www.clarin.eu/cmd/\">  \n            <lastName>Škufca</lastName>  \n            <firstName>Lara</firstName> \n          </author> \n        </cmd:authors>  \n        <cmd:dates> \n          <cmd:dateIssued>2026-02-03</cmd:dateIssued> \n        </cmd:dates>  \n        <cmd:identifiers> \n          <cmd:identifier type=\"Handle\">https://hdl.handle.net/11356/2085</cmd:identifier> \n        </cmd:identifiers>  \n        <cmd:funds> \n          <funding xmlns=\"http://www.clarin.eu/cmd/\">  \n            <organization>ARIS (Slovenian Research and Innovation Agency)</organization>  \n            <code>NOO</code>  \n            <projectName>PoVeJMo research project (Adaptive Natural Language Processing with Large Language Models)</projectName>  \n            <fundsType>nationalFunds</fundsType> \n          </funding>  \n          <funding xmlns=\"http://www.clarin.eu/cmd/\">  \n            <organization>ARRS (Slovenian Research Agency)</organization>  \n            <code>P6-0411</code>  \n            <projectName>Language Resources and Technologies for Slovene</projectName>  \n            <fundsType>nationalFunds</fundsType> \n          </funding> \n        </cmd:funds>  \n        <contactPerson xmlns=\"http://www.clarin.eu/cmd/\">  \n          <firstName>Borut</firstName>  \n          <lastName>Fabjan</lastName>  \n          <email>info@better.care</email>  \n          <affiliation>Better, d.o.o.</affiliation> \n        </contactPerson>  \n        <cmd:publishers> \n          <cmd:publisher>Better, d.o.o.</cmd:publisher> \n        </cmd:publishers> \n      </cmd:bibliographicInfo>  \n      <cmd:dataInfo> \n        <cmd:type>corpus</cmd:type>  \n        <cmd:description>GaMS-Instruct-MED-Anatomy is an instruction-following dataset containing 711,805 prompt-response units in Slovene (with English and Latin terminology). The units form a structured, machine-readable database of Slovenian anatomical terminology for training language models. The collection is based on anatomical data collected, translated and validated by medical experts. The data was processed, structured and enriched with automatic scripts and explanations generated using large language models. It includes: • Anatomical terminology in Slovene, English and Latin • SNOMED CT classification (standardized medical coding system) • Classification by body systems • Synonyms and alternative terms (original and generated) • Popular explanations of anatomical structures for the general public • Expert explanations of anatomical structures for medical experts The result is a standardized database in an instructional format that is suitable for use in computational linguistics, natural language processing (NLP), medical informatics and for training and adapting large language models. The corpus is intended for research and development of fine-tuning language models, training and adapting large language models for the medical and anatomical domains, development of medical chatbots and assistants in Slovene, support for healthcare professionals in anatomical terminology, translation of medical documentation, standardization of medical terminology in Slovene, and education in the field of anatomy. For more details on the structure of the dataset, please consult 00README.txt.</cmd:description>  \n        <cmd:languages> \n          <cmd:language> \n            <cmd:code>slv</cmd:code>  \n            <cmd:name>Slovenian</cmd:name> \n          </cmd:language> \n        </cmd:languages>  \n        <cmd:keywords> \n          <cmd:keyword>instruction following dataset</cmd:keyword>  \n          <cmd:keyword>medical texts</cmd:keyword>  \n          <cmd:keyword>large language models</cmd:keyword>  \n          <cmd:keyword>anatomy</cmd:keyword> \n        </cmd:keywords>  \n        <cmd:sizeInfo> \n          <size xmlns=\"http://www.clarin.eu/cmd/\">  \n            <size>711805</size>  \n            <unit>units</unit> \n          </size> \n        </cmd:sizeInfo> \n      </cmd:dataInfo>  \n      <cmd:licenseInfo> \n        <cmd:license> \n          <cmd:uri>https://creativecommons.org/licenses/by/4.0/</cmd:uri> \n        </cmd:license> \n      </cmd:licenseInfo> \n    </cmd:LINDAT_CLARIN> \n  </cmd:Components> \n</cmd:CMD>"}