Files in this item

 Download all files in item (60.29 KB)
Icon
Name
ONTEM-v1-DATA.csv
Size
44.75 KB
Format
CSV file
Description
Neznano
MD5
4c5fea38ade5b8815f5aa399b73738d9
 Download file
Icon
Name
ONTEM-v1-README.txt
Size
15.55 KB
Format
Text file
Description
Neznano
MD5
7d5d83188865f2b628002c40ad986f51
 Download file  Preview
 File Preview  
README – Ontology of Topics for Slovenian as a Second and Foreign Language ONTEM 1.0
The data in tabular format comprises 8 columns:
A: Lemma / Lema includes a list of 1,019 lemmas from the KUUS corpus. 
B: Part-of-speech / Besedna vrsta provides information about the part-of-speech of the included words following the MULTEXT-East tagset for Slovenian (https://nl.ijs.si/ME/V6/msd/html/msd-sl.html).
C: CEFR level / Raven SEJO provides information on the classification of lemmas according to the CEFR proficiency levels. The assignment is based on Core vocabulary for Slovenian as L2 (http://hdl.handle.net/11356/1697), which organises lexical items into levels A1, A2, and B1. If the lemma is not included in the Core vocabulary for Slovenian as L2, no information is provided in this column.
D: Confirmation of the CEFR level / Potrditev ravni SEJO indicates whether a lemma was validated as belonging to the A1 level. Specialists in Slovenian as a foreign and second language conducted in . . .