Files in this item
Download all files in item (60.29 KB)This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Name
- ONTEM-v1-DATA.csv
- Size
- 44.75 KB
- Format
- CSV file
- Description
- Neznano
- MD5
- 4c5fea38ade5b8815f5aa399b73738d9
- Name
- ONTEM-v1-README.txt
- Size
- 15.55 KB
- Format
- Text file
- Description
- Neznano
- MD5
- 7d5d83188865f2b628002c40ad986f51
README – Ontology of Topics for Slovenian as a Second and Foreign Language ONTEM 1.0
The data in tabular format comprises 8 columns:
A: Lemma / Lema includes a list of 1,019 lemmas from the KUUS corpus.
B: Part-of-speech / Besedna vrsta provides information about the part-of-speech of the included words following the MULTEXT-East tagset for Slovenian (https://nl.ijs.si/ME/V6/msd/html/msd-sl.html).
C: CEFR level / Raven SEJO provides information on the classification of lemmas according to the CEFR proficiency levels. The assignment is based on Core vocabulary for Slovenian as L2 (http://hdl.handle.net/11356/1697), which organises lexical items into levels A1, A2, and B1. If the lemma is not included in the Core vocabulary for Slovenian as L2, no information is provided in this column.
D: Confirmation of the CEFR level / Potrditev ravni SEJO indicates whether a lemma was validated as belonging to the A1 level. Specialists in Slovenian as a foreign and second language conducted in . . .