Show simple item record

 
dc.contributor.author Čibej, Jaka
dc.contributor.author Kosem, Iztok
dc.date.accessioned 2022-11-15T08:51:32Z
dc.date.available 2022-11-15T08:51:32Z
dc.date.issued 2022-10-28
dc.identifier.uri http://hdl.handle.net/11356/1705
dc.description This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical features) from the Trendi Monitor Corpus of Slovene (http://hdl.handle.net/11356/1590) covering the period between 1 January 2020 and 31 December 2020 using the LIST corpus extraction tool (http://hdl.handle.net/11356/1227). The Trendi frequency list was then compared to the frequency list of words from the Gigafida 2.0 Corpus of Slovene (http://hdl.handle.net/11356/1320), which covers the period between 1991 and 2018, and the frequency list of words from Trendi for 2019. The words were compared using the simple maths formula implemented by SketchEngine (see https://www.sketchengine.eu/documentation/simple-maths/). The final list contains lemmas, their lexical features, their absolute and relative frequencies from the first (1991–2019) and second periods (2020), and the simple maths value indicating if the word is more frequent in 2020 (simple maths > 1.00) or in 1991–2019 (simple maths < 1.00). For frequency lists of words that are typical of previous years according to the simple maths measure (e.g. 2019 vs. 1991-2018), please refer to earlier versions of this entry.
dc.language.iso slv
dc.publisher Jožef Stefan Institute
dc.relation.replaces http://hdl.handle.net/11356/1701
dc.relation.isreplacedby http://hdl.handle.net/11356/1712
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri https://sled.ijs.si/
dc.subject frequency list
dc.subject words
dc.subject monitor corpus
dc.title Frequency list of words from the Trendi corpus 2020
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType wordList
metashare.ResourceInfo#ContentInfo.mediaType text
hidden hidden
has.files yes
branding CLARIN.SI data & tools
contact.person Jaka Čibej jaka.cibej@ijs.si Jožef Stefan Institute
sponsor Ministry of Culture of the Republic of Slovenia JR-infrastruktura-SJ-2021-2022 SLED - Monitor corpus of Slovene and related resources nationalFunds
size.info 4555454 words
files.count 1
files.size 25204009


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
sled_words_2020_vs_1991-2019.zip
Size
24.04 MB
Format
application/zip
Description
sled_words_2020_vs_1991-2019
MD5
6982098bea1259d409fd0e2b793a7913
 Download file  Preview
 File Preview  
    • sled_words_2020_vs_1991-2019.tsv241 MB
    • 00README.txt1 kB

Show simple item record