• Repository
  • About
  • Contact
  • CLARIN
  •  Login
  • English Slovenščina
  • CLARIN.SI repository
  • View Item
  •  
  • CLARIN logo
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   Statistics  
    •    Piwik StatisticsBETA
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 
 

Lemma list of the SYN-series corpora (ELEXIS)

 
CLARIN.SI data & tools
  Authors
Křen, Michal
  Item identifier
http://hdl.handle.net/11356/1554
 Project URL
https://wiki.korpus.cz/doku.php/en:cnk:syn
 Demo URL
https://korpus.cz/lists/
 Referenced by
http://www.lrec-conf.org/proceedings/lrec2014/pdf/294_Paper.pdf
 Date issued
2020-06-26
 Type
lexicalConceptualResource, text
 Size
169934 other
 Language(s)
Czech
 Description
Lemma list derived from the representative synchronic written corpora of the SYN series. The format is quite straightforward, it is a simple tsv file with the columns in the following order: lemma POS SYN2000 SYN2005 SYN2010 SYN2015 where every corpus is in fact represented by two columns, with frequency and i.p.m., so the total number of columns in the file is 10. The lemma list is filtered and includes only alphabetical lemmas with non-zero frequency in all four corpora.
 Publisher
Institute of the Czech National Corpus
 Subject(s)
monolingual lemma list pos part of speech
 Collection(s)
CLARIN.SI ELEXIS
Show full item record
 
 

Partners

  • Alpineon, d.o.o.
  • Amebis, d.o.o.
  • Institute of Contemporary History
  • Jožef Stefan Institute
  • National and University Library of Slovenia
  • Slovenian Language Technologies Society

Partners

  • University of Ljubljana
  • University of Maribor
  • University of Nova Gorica
  • University of Primorska
  • ZRC SAZU
  • ZRS Koper

Repository

  • Main page
  • Contact
  • Submission Lifecycle
  • FAQ
  • About and Policies

This platform runs under the software developed for the LINDAT/CLARIAH-CZ repository for linguistics, available on GitHub

CLARIN.SI is supported by the Ministry of Education, Science and Sport of the Republic of Slovenia
under the Programme of "Research Infrastructures".