| dc.contributor.author | Perdih, Andrej |
| dc.contributor.author | Bizjak Končar, Aleksandra |
| dc.contributor.author | Divjak Race, Duša |
| dc.contributor.author | Gabrovšek, Dejan |
| dc.contributor.author | Ježovnik, Janoš |
| dc.contributor.author | Krvina, Domen |
| dc.contributor.author | Ledinek, Nina |
| dc.contributor.author | Michelizza, Mija |
| dc.contributor.author | Mirtič, Tanja |
| dc.contributor.author | Petric Žižić, Špela |
| dc.contributor.author | Sušnik, Miha |
| dc.contributor.author | Trojar, Mitja |
| dc.date.accessioned | 2026-06-30T08:46:36Z |
| dc.date.available | 2026-06-30T08:46:36Z |
| dc.date.issued | 2026-06-22 |
| dc.identifier.uri | http://hdl.handle.net/11356/2254 |
| dc.description | The datasets contain sense–genus combinations from the Dictionary of the Slovenian Standard Language, 2nd Edition (Slovar slovenskega knjižnega jezika, druga, dopolnjena in deloma prenovljena izdaja; https://www.fran.si/133/sskj2-slovar-slovenskega-knjiznega-jezika-2). Genus is defined as a word denoting a broad, general category or superordinate class to which a defined word belongs. In the current version, 48,028 noun senses with 3,985 genera are included. Genera were attributed automatically and manually curated. The first dataset (SSKJ2_headword_genus.xml) is focused on senses. Each dictionary sense contains the following information: headword or subheadword, entry ID, sense ID and one or more genera. The second dataset (SSKJ2_genusGroups.xml) is focused on genera. One or more dictionary senses are attributed to each genus; for each dictionary sense, the following information are provided: headword or subheadword, entry ID and sense ID. No distinction between genera has been made with regard to homographs and homonyms. For both XML files, the corresponding XML schemas are provided. In rare cases, adjectival headwords are included, when the sense pertains to a multi-word unit containing an adjective and a substantive. Similarly, some noun senses are excluded, if they pertain to non-nominal phrases or are defined only by synonyms. In the current version, words such as vsak, vsaka, vsako, and del, which form syntactic heads, are treated as genera. All genera are single-word units, even in cases where multi-word units would be expected. |
| dc.language.iso | slv |
| dc.publisher | ZRC SAZU |
| dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
| dc.rights.label | PUB |
| dc.source.uri | https://www.cjvt.si/povejmo/en/project/ |
| dc.subject | dictionary sense |
| dc.subject | genus |
| dc.title | Genus (proximum) in the SSKJ2 dictionary senses |
| dc.type | lexicalConceptualResource |
| metashare.ResourceInfo#ContentInfo.detailedType | ontology |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| has.files | yes |
| branding | CLARIN.SI data & tools |
| contact.person | Andrej Perdih andrej.perdih@zrc-sazu.si ZRC SAZU |
| sponsor | ARIS (Slovenian Research and Innovation Agency) NOO PoVeJMo research project (Adaptive Natural Language Processing with Large Language Models) nationalFunds |
| size.info | 48028 semanticUnits |
| files.count | 1 |
| files.size | 1451453 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- SSKJ2_sense_genus.zip
- Size
- 1.38 MB
- Format
- application/zip
- Description
- SSKJ2 sense-genus XML and schema
- MD5
- 2022266893e2ed849f00d0f47f7e50e9