Prikaži enostavni zapis vnosa

 
dc.contributor.author Perdih, Andrej
dc.contributor.author Bizjak Končar, Aleksandra
dc.contributor.author Divjak Race, Duša
dc.contributor.author Gabrovšek, Dejan
dc.contributor.author Ježovnik, Janoš
dc.contributor.author Krvina, Domen
dc.contributor.author Ledinek, Nina
dc.contributor.author Michelizza, Mija
dc.contributor.author Mirtič, Tanja
dc.contributor.author Petric Žižić, Špela
dc.contributor.author Sušnik, Miha
dc.contributor.author Trojar, Mitja
dc.date.accessioned 2026-06-30T08:46:36Z
dc.date.available 2026-06-30T08:46:36Z
dc.date.issued 2026-06-22
dc.identifier.uri http://hdl.handle.net/11356/2254
dc.description The datasets contain sense–genus combinations from the Dictionary of the Slovenian Standard Language, 2nd Edition (Slovar slovenskega knjižnega jezika, druga, dopolnjena in deloma prenovljena izdaja; https://www.fran.si/133/sskj2-slovar-slovenskega-knjiznega-jezika-2). Genus is defined as a word denoting a broad, general category or superordinate class to which a defined word belongs. In the current version, 48,028 noun senses with 3,985 genera are included. Genera were attributed automatically and manually curated. The first dataset (SSKJ2_headword_genus.xml) is focused on senses. Each dictionary sense contains the following information: headword or subheadword, entry ID, sense ID and one or more genera. The second dataset (SSKJ2_genusGroups.xml) is focused on genera. One or more dictionary senses are attributed to each genus; for each dictionary sense, the following information are provided: headword or subheadword, entry ID and sense ID. No distinction between genera has been made with regard to homographs and homonyms. For both XML files, the corresponding XML schemas are provided. In rare cases, adjectival headwords are included, when the sense pertains to a multi-word unit containing an adjective and a substantive. Similarly, some noun senses are excluded, if they pertain to non-nominal phrases or are defined only by synonyms. In the current version, words such as vsak, vsaka, vsako, and del, which form syntactic heads, are treated as genera. All genera are single-word units, even in cases where multi-word units would be expected.
dc.language.iso slv
dc.publisher ZRC SAZU
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label PUB
dc.source.uri https://www.cjvt.si/povejmo/en/project/
dc.subject dictionary sense
dc.subject genus
dc.title Genus (proximum) in the SSKJ2 dictionary senses
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType ontology
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Andrej Perdih andrej.perdih@zrc-sazu.si ZRC SAZU
sponsor ARIS (Slovenian Research and Innovation Agency) NOO PoVeJMo research project (Adaptive Natural Language Processing with Large Language Models) nationalFunds
size.info 48028 semanticUnits
files.count 1
files.size 1451453


 Datoteke v tem vnosu

To je vnos
Publicly Available
z licenco:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Distributed under Creative Commons Attribution Required
Icon
Ime
SSKJ2_sense_genus.zip
Velikost
1.38 MB
Format
application/zip
Opis
SSKJ2 sense-genus XML and schema
MD5
2022266893e2ed849f00d0f47f7e50e9
 Prenesi datoteko  Predogled
 Predogled datoteke  
    • SSKJ2_genusGroups.xml8 MB
    • SSKJ2_sense_genus.xsd966 B
    • SSKJ2_sense_genus.xml7 MB
    • 00README.txt2 kB
    • SSKJ2_genusGroups.xsd1 kB

Prikaži enostavni zapis vnosa