Show simple item record

 
dc.contributor.author Čibej, Jaka
dc.contributor.author Arhar Holdt, Špela
dc.contributor.author Dobrovoljc, Kaja
dc.contributor.author Krek, Simon
dc.date.accessioned 2019-11-13T11:00:28Z
dc.date.available 2019-11-13T11:00:28Z
dc.date.issued 2019-11-18
dc.identifier.uri http://hdl.handle.net/11356/1275
dc.description Frequency lists of words split into word parts were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (https://viri.cjvt.si/gigafida/) using the LIST corpus extraction tool (http://hdl.handle.net/11356/1227). The lists contain all lemmas or lower-case word forms occurring in the corpus, split into their initial or final part (i.e. the initial or final string of 1, 2, 3, 4 or 5 characters in the word) and the rest of the word. In addition, the lists also contain absolute and relative frequencies, percentages, and distribution across the text-types included in the corpus taxonomy. The lists were extracted for each part-of-speech category. For each part-of-speech, a total of 20 lists were extracted: 1) 10 lists for initial or final word parts extracted from lemmas, 2) 10 lists for initial or final word parts extracted from lower-case word forms. In addition, 20 lists were extracted from all words (regardless of their part-of-speech category). For easier processing in statistical analysis software, shortened versions of longer lists were made containing the first 150,000 lines.
dc.language.iso slv
dc.publisher Centre for Language Resources and Technologies, University of Ljubljana
dc.publisher Jožef Stefan Institute
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label PUB
dc.source.uri http://slovnica.ijs.si/
dc.subject word parts
dc.subject morphology
dc.subject standard language
dc.subject frequency list
dc.subject initial part of the word
dc.subject final part of the word
dc.title Frequency lists of word parts from the Gigafida 2.0 corpus
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType wordList
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Jaka Čibej jaka.cibej@cjvt.si Centre for Language Resources and Technologies, University of Ljubljana
sponsor ARRS (Slovenian Research Agency) J6-8256 New grammar of contemporary standard Slovene: sources and methods nationalFunds
files.count 30
files.size 2119542857


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
GF2.0-word_parts-all-lemmas-short.zip
Size
18.84 MB
Format
application/zip
Description
Shortened frequency lists of word parts from lemmas in Gigafida 2.0
MD5
8117455dd54961553b1a2782801020b8
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-all-lemmas-final-1grams-short.tsv7 MB
    • GF2.0-word_parts-all-lemmas-initial-1grams-short.tsv7 MB
    • GF2.0-word_parts-all-lemmas-final-2grams-short.tsv7 MB
    • GF2.0-word_parts-all-lemmas-final-4grams-short.tsv7 MB
    • GF2.0-word_parts-all-lemmas-initial-2grams-short.tsv7 MB
    • GF2.0-word_parts-all-lemmas-final-3grams-short.tsv7 MB
    • GF2.0-word_parts-all-lemmas-final-5grams-short.tsv8 MB
    • GF2.0-word_parts-all-lemmas-initial-5grams-short.tsv8 MB
    • GF2.0-word_parts-all-lemmas-initial-3grams-short.tsv7 MB
    • GF2.0-word_parts-all-lemmas-initial-4grams-short.tsv7 MB
Icon
Name
GF2.0-word_parts-all-lemmas-entire.zip
Size
354.89 MB
Format
application/zip
Description
Frequency lists of word parts from all lemmas in Gigafida 2.0
MD5
2ce468a15cc9a76f7e511d5c3ed1c81d
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-all-lemmas-final-4grams-entire.tsv180 MB
    • GF2.0-word_parts-all-lemmas-final-5grams-entire.tsv174 MB
    • GF2.0-word_parts-all-lemmas-initial-2grams-entire.tsv182 MB
    • GF2.0-word_parts-all-lemmas-initial-1grams-entire.tsv182 MB
    • GF2.0-word_parts-all-lemmas-initial-3grams-entire.tsv182 MB
    • GF2.0-word_parts-all-lemmas-final-1grams-entire.tsv182 MB
    • GF2.0-word_parts-all-lemmas-initial-5grams-entire.tsv174 MB
    • GF2.0-word_parts-all-lemmas-initial-4grams-entire.tsv180 MB
    • GF2.0-word_parts-all-lemmas-final-3grams-entire.tsv182 MB
    • GF2.0-word_parts-all-lemmas-final-2grams-entire.tsv182 MB
Icon
Name
GF2.0-word_parts-all-lowercase_forms-short.zip
Size
16.59 MB
Format
application/zip
Description
Shortened frequency lists of word parts of lower-case forms in Gigafida 2.0
MD5
7cf3f344f475dbcdbb2de70bd9769621
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-all-lowercase_forms-initial-2grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-4grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-final-1grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-5grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-final-2grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-final-3grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-final-4grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-final-5grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-1grams-short.tsv6 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-3grams-short.tsv6 MB
Icon
Name
GF2.0-word_parts-all-lowercase_forms-entire.zip
Size
335.24 MB
Format
application/zip
Description
Frequency lists of word parts from all lower-case forms in Gigafida 2.0
MD5
d5c4eae389c0a9ec3b6e4e4a568b52a8
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-all-lowercase_forms-final-1grams-entire.tsv176 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-4grams-entire.tsv175 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-3grams-entire.tsv176 MB
    • GF2.0-word_parts-all-lowercase_forms-final-3grams-entire.tsv176 MB
    • GF2.0-word_parts-all-lowercase_forms-final-2grams-entire.tsv176 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-5grams-entire.tsv172 MB
    • GF2.0-word_parts-all-lowercase_forms-final-4grams-entire.tsv175 MB
    • GF2.0-word_parts-all-lowercase_forms-final-5grams-entire.tsv172 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-2grams-entire.tsv176 MB
    • GF2.0-word_parts-all-lowercase_forms-initial-1grams-entire.tsv176 MB
Icon
Name
GF2.0-word_parts-nouns-lemmas-short.zip
Size
39.43 MB
Format
application/zip
Description
Shortened frequency lists of word parts of noun lemmas in Gigafida 2.0
MD5
f666c5549fcf3eab5e88c70650c3bf81
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-nouns-lemmas-initial-5grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-final-4grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-initial-3grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-final-2grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-initial-1grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-final-5grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-initial-4grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-final-3grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-initial-2grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-nouns-lemmas-final-1grams-taxonomy-short.tsv24 MB
Icon
Name
GF2.0-word_parts-nouns-lemmas-entire.zip
Size
308.13 MB
Format
application/zip
Description
Frequency lists of word parts of noun lemmas in Gigafida 2.0
MD5
f4ca93ae374ed6fe97c7663437c802e1
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-nouns-lemmas-initial-2grams-taxonomy-entire.tsv345 MB
    • GF2.0-word_parts-nouns-lemmas-final-4grams-taxonomy-entire.tsv339 MB
    • GF2.0-word_parts-nouns-lemmas-initial-5grams-taxonomy-entire.tsv324 MB
    • GF2.0-word_parts-nouns-lemmas-initial-1grams-taxonomy-entire.tsv345 MB
    • GF2.0-word_parts-nouns-lemmas-final-3grams-taxonomy-entire.tsv345 MB
    • GF2.0-word_parts-nouns-lemmas-initial-4grams-taxonomy-entire.tsv339 MB
    • GF2.0-word_parts-nouns-lemmas-final-2grams-taxonomy-entire.tsv345 MB
    • GF2.0-word_parts-nouns-lemmas-final-5grams-taxonomy-entire.tsv324 MB
    • GF2.0-word_parts-nouns-lemmas-initial-3grams-taxonomy-entire.tsv345 MB
    • GF2.0-word_parts-nouns-lemmas-final-1grams-taxonomy-entire.tsv345 MB
Icon
Name
GF2.0-word_parts-nouns-lowercase_forms-short.zip
Size
42.86 MB
Format
application/zip
Description
Shortened frequency lists of word parts of noun lower-case forms in Gigafida 2.0
MD5
1a72a644faa50fa43eabfca41c41c6ad
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-nouns-lowercase_forms-initial-2grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-4grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-2grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-5grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-3grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-5grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-1grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-3grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-1grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-4grams-taxonomy-short.tsv23 MB
Icon
Name
GF2.0-word_parts-nouns-lowercase_forms-entire.zip
Size
297.68 MB
Format
application/zip
Description
Frequency lists of word parts of noun lower-case forms in Gigafida 2.0
MD5
9394bf81f2ad5de5ef0502aef85f1f33
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-nouns-lowercase_forms-final-3grams-taxonomy-entire.tsv359 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-5grams-taxonomy-entire.tsv346 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-1grams-taxonomy-entire.tsv359 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-2grams-taxonomy-entire.tsv359 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-4grams-taxonomy-entire.tsv356 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-5grams-taxonomy-entire.tsv346 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-1grams-taxonomy-entire.tsv359 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-3grams-taxonomy-entire.tsv359 MB
    • GF2.0-word_parts-nouns-lowercase_forms-final-4grams-taxonomy-entire.tsv356 MB
    • GF2.0-word_parts-nouns-lowercase_forms-initial-2grams-taxonomy-entire.tsv359 MB
Icon
Name
GF2.0-word_parts-verbs-lemmas-entire.zip
Size
12.43 MB
Format
application/zip
Description
Frequency lists of word parts of verb lemmas in Gigafida 2.0
MD5
a24043cec9a2e0880a2e5f45b28e3368
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-verbs-lemmas-initial-5grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-initial-1grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-final-3grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-initial-4grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-final-2grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-initial-3grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-final-5grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-final-1grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-initial-2grams-taxonomy-entire.tsv13 MB
    • GF2.0-word_parts-verbs-lemmas-final-4grams-taxonomy-entire.tsv13 MB
Icon
Name
GF2.0-word_parts-verbs-lowercase_forms-short.zip
Size
26.28 MB
Format
application/zip
Description
Shortened frequency lists of word parts of verb lower-case forms in Gigafida 2.0
MD5
f456c35879721e0e454741c232ab1035
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-verbs-lowercase_forms-final-5grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-5grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-final-3grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-3grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-final-1grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-1grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-final-4grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-4grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-final-2grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-2grams-taxonomy-short.tsv22 MB
Icon
Name
GF2.0-word_parts-verbs-lowercase_forms-entire.zip
Size
32.48 MB
Format
application/zip
Description
Frequency lists of word parts of verb lower-case forms in Gigafida 2.0
MD5
89d3b639e9d245b358c779a738465019
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-verbs-lowercase_forms-final-5grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-final-1grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-3grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-final-4grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-2grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-final-3grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-5grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-1grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-final-2grams-taxonomy-entire.tsv31 MB
    • GF2.0-word_parts-verbs-lowercase_forms-initial-4grams-taxonomy-entire.tsv31 MB
Icon
Name
GF2.0-word_parts-adjectives-lemmas-short.zip
Size
26.8 MB
Format
application/zip
Description
Shortened frequency lists of word parts of adjective lemmas from Gigafida 2.0
MD5
814eeb61b298ba4182b4b8cbef6ae26f
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-adjectives-lemmas-final-4grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-final-2grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-4grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-2grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-final-5grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-final-3grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-5grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-final-1grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-3grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-1grams-taxonomy-short.tsv23 MB
Icon
Name
GF2.0-word_parts-adjectives-lemmas-entire.zip
Size
58.92 MB
Format
application/zip
Description
Frequency lists of word parts of adjective lemmas from Gigafida 2.0
MD5
ce90cbbce9636f0cc43118b63f93115c
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-adjectives-lemmas-initial-5grams-taxonomy-entire.tsv64 MB
    • GF2.0-word_parts-adjectives-lemmas-final-5grams-taxonomy-entire.tsv64 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-1grams-taxonomy-entire.tsv66 MB
    • GF2.0-word_parts-adjectives-lemmas-final-4grams-taxonomy-entire.tsv65 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-4grams-taxonomy-entire.tsv65 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-3grams-taxonomy-entire.tsv66 MB
    • GF2.0-word_parts-adjectives-lemmas-final-3grams-taxonomy-entire.tsv66 MB
    • GF2.0-word_parts-adjectives-lemmas-initial-2grams-taxonomy-entire.tsv66 MB
    • GF2.0-word_parts-adjectives-lemmas-final-1grams-taxonomy-entire.tsv66 MB
    • GF2.0-word_parts-adjectives-lemmas-final-2grams-taxonomy-entire.tsv66 MB
Icon
Name
GF2.0-word_parts-adjectives-lowercase_forms-short.zip
Size
36.36 MB
Format
application/zip
Description
Shortened frequency lists of word parts of adjective lower-case forms from Gigafida 2.0
MD5
add4d319486f8da6c1dff4d067651570
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-3grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-4grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-1grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-2grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-4grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-2grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-3grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-1grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-5grams-taxonomy-short.tsv24 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-5grams-taxonomy-short.tsv24 MB
Icon
Name
GF2.0-word_parts-adjectives-lowercase_forms-entire.zip
Size
114.6 MB
Format
application/zip
Description
Frequency lists of word parts of adjective lower-case forms from Gigafida 2.0
MD5
020d8c6e20091bfddeab1e1eaad42eca
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-2grams-taxonomy-entire.tsv131 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-2grams-taxonomy-entire.tsv131 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-5grams-taxonomy-entire.tsv130 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-1grams-taxonomy-entire.tsv131 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-5grams-taxonomy-entire.tsv130 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-1grams-taxonomy-entire.tsv131 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-4grams-taxonomy-entire.tsv131 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-4grams-taxonomy-entire.tsv131 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-initial-3grams-taxonomy-entire.tsv131 MB
    • GF2.0-word_parts-adjectives-lowercase_forms-final-3grams-taxonomy-entire.tsv131 MB
Icon
Name
GF2.0-word_parts-numerals-lemmas-short.zip
Size
17.54 MB
Format
application/zip
Description
Shortened frequency lists of word parts of numeral lemmas in Gigafida 2.0
MD5
6aa1dcade0c6084c181db11fa8e089a7
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-numerals-lemmas-initial-1grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-final-5grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-final-3grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-final-1grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-initial-4grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-initial-2grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-final-4grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-final-2grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-initial-5grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-numerals-lemmas-initial-3grams-taxonomy-short.tsv21 MB
Icon
Name
GF2.0-word_parts-numerals-lemmas-entire.zip
Size
45.85 MB
Format
application/zip
Description
Frequency lists of word parts of numeral lemmas in Gigafida 2.0
MD5
3bce0ad407095a0588c6414ba6c79615
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-numerals-lemmas-initial-5grams-taxonomy-entire.tsv58 MB
    • GF2.0-word_parts-numerals-lemmas-initial-1grams-taxonomy-entire.tsv62 MB
    • GF2.0-word_parts-numerals-lemmas-final-4grams-taxonomy-entire.tsv61 MB
    • GF2.0-word_parts-numerals-lemmas-initial-4grams-taxonomy-entire.tsv61 MB
    • GF2.0-word_parts-numerals-lemmas-final-3grams-taxonomy-entire.tsv62 MB
    • GF2.0-word_parts-numerals-lemmas-initial-3grams-taxonomy-entire.tsv62 MB
    • GF2.0-word_parts-numerals-lemmas-final-2grams-taxonomy-entire.tsv62 MB
    • GF2.0-word_parts-numerals-lemmas-initial-2grams-taxonomy-entire.tsv62 MB
    • GF2.0-word_parts-numerals-lemmas-final-5grams-taxonomy-entire.tsv58 MB
    • GF2.0-word_parts-numerals-lemmas-final-1grams-taxonomy-entire.tsv62 MB
Icon
Name
GF2.0-word_parts-numerals-lowercase_forms-short.zip
Size
15.11 MB
Format
application/zip
Description
Shortened frequency lists of word parts of numeral lower-case forms in Gigafida 2.0
MD5
9f9f4cc2813113ea6a0591c1e0552d90
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-numerals-lowercase_forms-initial-1grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-5grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-4grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-3grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-2grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-2grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-4grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-3grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-5grams-taxonomy-short.tsv20 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-1grams-taxonomy-short.tsv20 MB
Icon
Name
GF2.0-word_parts-numerals-lowercase_forms-entire.zip
Size
37.92 MB
Format
application/zip
Description
Frequency lists of word parts of numeral lower-case forms in Gigafida 2.0
MD5
ec1ed058b580de5923bf31efe82a2a10
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-numerals-lowercase_forms-final-3grams-taxonomy-entire.tsv58 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-3grams-taxonomy-entire.tsv58 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-2grams-taxonomy-entire.tsv58 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-2grams-taxonomy-entire.tsv58 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-5grams-taxonomy-entire.tsv54 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-1grams-taxonomy-entire.tsv58 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-5grams-taxonomy-entire.tsv54 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-1grams-taxonomy-entire.tsv58 MB
    • GF2.0-word_parts-numerals-lowercase_forms-final-4grams-taxonomy-entire.tsv57 MB
    • GF2.0-word_parts-numerals-lowercase_forms-initial-4grams-taxonomy-entire.tsv57 MB
Icon
Name
GF2.0-word_parts-pronouns.zip
Size
1.15 MB
Format
application/zip
Description
Frequency lists of word parts of pronouns in Gigafida 2.0
MD5
d796d65c12a75854248a867cbff2b256
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-pronouns-lemmas-final-1grams-taxonomy-entire.tsv465 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-initial-2grams-taxonomy-entire.tsv573 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-initial-3grams-taxonomy-entire.tsv561 kB
    • GF2.0-word_parts-pronouns-lemmas-final-2grams-taxonomy-entire.tsv464 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-initial-4grams-taxonomy-entire.tsv527 kB
    • GF2.0-word_parts-pronouns-lemmas-final-3grams-taxonomy-entire.tsv451 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-final-1grams-taxonomy-entire.tsv574 kB
    • GF2.0-word_parts-pronouns-lemmas-final-4grams-taxonomy-entire.tsv415 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-initial-5grams-taxonomy-entire.tsv475 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-final-2grams-taxonomy-entire.tsv573 kB
    • GF2.0-word_parts-pronouns-lemmas-final-5grams-taxonomy-entire.tsv369 kB
    • GF2.0-word_parts-pronouns-lemmas-initial-1grams-taxonomy-entire.tsv465 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-final-3grams-taxonomy-entire.tsv561 kB
    • GF2.0-word_parts-pronouns-lemmas-initial-2grams-taxonomy-entire.tsv464 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-final-4grams-taxonomy-entire.tsv527 kB
    • GF2.0-word_parts-pronouns-lemmas-initial-3grams-taxonomy-entire.tsv451 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-final-5grams-taxonomy-entire.tsv475 kB
    • GF2.0-word_parts-pronouns-lemmas-initial-4grams-taxonomy-entire.tsv415 kB
    • GF2.0-word_parts-pronouns-lemmas-initial-5grams-taxonomy-entire.tsv369 kB
    • GF2.0-word_parts-pronouns-lowercase_forms-initial-1grams-taxonomy-entire.tsv574 kB
Icon
Name
GF2.0-word_parts-adverbs.zip
Size
17.39 MB
Format
application/zip
Description
Frequency lists of adverb word parts in Gigafida 2.0
MD5
51d60f89f79c95d5eb7ae46ef7c9f17e
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-adverbs-lowercase_forms-initial-5grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lemmas-initial-2grams-taxonomy-entire.tsv10 MB
    • GF2.0-word_parts-adverbs-lemmas-initial-3grams-taxonomy-entire.tsv10 MB
    • GF2.0-word_parts-adverbs-lemmas-final-1grams-taxonomy-entire.tsv10 MB
    • GF2.0-word_parts-adverbs-lemmas-initial-4grams-taxonomy-entire.tsv10 MB
    • GF2.0-word_parts-adverbs-lemmas-final-2grams-taxonomy-entire.tsv10 MB
    • GF2.0-word_parts-adverbs-lemmas-initial-5grams-taxonomy-entire.tsv9 MB
    • GF2.0-word_parts-adverbs-lemmas-final-3grams-taxonomy-entire.tsv10 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-final-1grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lemmas-final-4grams-taxonomy-entire.tsv10 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-final-2grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-initial-1grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lemmas-final-5grams-taxonomy-entire.tsv9 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-final-3grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-initial-2grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-final-4grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-initial-3grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-initial-4grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lowercase_forms-final-5grams-taxonomy-entire.tsv8 MB
    • GF2.0-word_parts-adverbs-lemmas-initial-1grams-taxonomy-entire.tsv10 MB
Icon
Name
GF2.0-word_parts-prepositions.zip
Size
436.56 KB
Format
application/zip
Description
Frequency lists of word parts of prepositions in Gigafida 2.0
MD5
81321725754be76f3e33c81dce15de24
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-prepositions-lemmas-initial-5grams-taxonomy-entire.tsv165 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-initial-1grams-taxonomy-entire.tsv200 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-initial-2grams-taxonomy-entire.tsv195 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-final-1grams-taxonomy-entire.tsv200 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-initial-3grams-taxonomy-entire.tsv189 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-final-2grams-taxonomy-entire.tsv195 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-initial-4grams-taxonomy-entire.tsv169 kB
    • GF2.0-word_parts-prepositions-lemmas-final-1grams-taxonomy-entire.tsv225 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-final-3grams-taxonomy-entire.tsv189 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-initial-5grams-taxonomy-entire.tsv148 kB
    • GF2.0-word_parts-prepositions-lemmas-final-2grams-taxonomy-entire.tsv220 kB
    • GF2.0-word_parts-prepositions-lemmas-initial-1grams-taxonomy-entire.tsv225 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-final-4grams-taxonomy-entire.tsv169 kB
    • GF2.0-word_parts-prepositions-lemmas-final-3grams-taxonomy-entire.tsv213 kB
    • GF2.0-word_parts-prepositions-lemmas-initial-2grams-taxonomy-entire.tsv220 kB
    • GF2.0-word_parts-prepositions-lowercase_forms-final-5grams-taxonomy-entire.tsv148 kB
    • GF2.0-word_parts-prepositions-lemmas-final-4grams-taxonomy-entire.tsv189 kB
    • GF2.0-word_parts-prepositions-lemmas-initial-3grams-taxonomy-entire.tsv213 kB
    • GF2.0-word_parts-prepositions-lemmas-final-5grams-taxonomy-entire.tsv165 kB
    • GF2.0-word_parts-prepositions-lemmas-initial-4grams-taxonomy-entire.tsv189 kB
Icon
Name
GF2.0-word_parts-conjunctions.zip
Size
319.14 KB
Format
application/zip
Description
Frequency lists of word parts of conjunctions in Gigafida 2.0
MD5
dae16d7a2bf6b659d58e684056e5af6e
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-conjunctions-lemmas-initial-4grams-taxonomy-entire.tsv128 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-final-1grams-taxonomy-entire.tsv139 kB
    • GF2.0-word_parts-conjunctions-lemmas-initial-5grams-taxonomy-entire.tsv112 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-final-2grams-taxonomy-entire.tsv138 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-initial-1grams-taxonomy-entire.tsv139 kB
    • GF2.0-word_parts-conjunctions-lemmas-final-1grams-taxonomy-entire.tsv151 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-final-3grams-taxonomy-entire.tsv133 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-initial-2grams-taxonomy-entire.tsv138 kB
    • GF2.0-word_parts-conjunctions-lemmas-final-2grams-taxonomy-entire.tsv150 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-initial-3grams-taxonomy-entire.tsv133 kB
    • GF2.0-word_parts-conjunctions-lemmas-final-3grams-taxonomy-entire.tsv145 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-initial-4grams-taxonomy-entire.tsv118 kB
    • GF2.0-word_parts-conjunctions-lemmas-final-4grams-taxonomy-entire.tsv128 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-initial-5grams-taxonomy-entire.tsv103 kB
    • GF2.0-word_parts-conjunctions-lemmas-final-5grams-taxonomy-entire.tsv112 kB
    • GF2.0-word_parts-conjunctions-lemmas-initial-1grams-taxonomy-entire.tsv151 kB
    • GF2.0-word_parts-conjunctions-lemmas-initial-2grams-taxonomy-entire.tsv150 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-final-4grams-taxonomy-entire.tsv118 kB
    • GF2.0-word_parts-conjunctions-lemmas-initial-3grams-taxonomy-entire.tsv145 kB
    • GF2.0-word_parts-conjunctions-lowercase_forms-final-5grams-taxonomy-entire.tsv103 kB
Icon
Name
GF2.0-word_parts-particles.zip
Size
179.57 KB
Format
application/zip
Description
Frequency lists of word parts of particles in Gigafida 2.0
MD5
a60bafb20cdcc9146e72d2b8e57d94da
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-particles-lowercase_forms-final-2grams-taxonomy-entire.tsv52 kB
    • GF2.0-word_parts-particles-lemmas-final-5grams-taxonomy-entire.tsv38 kB
    • GF2.0-word_parts-particles-lowercase_forms-initial-5grams-taxonomy-entire.tsv37 kB
    • GF2.0-word_parts-particles-lemmas-initial-1grams-taxonomy-entire.tsv57 kB
    • GF2.0-word_parts-particles-lemmas-initial-2grams-taxonomy-entire.tsv57 kB
    • GF2.0-word_parts-particles-lowercase_forms-final-3grams-taxonomy-entire.tsv49 kB
    • GF2.0-word_parts-particles-lemmas-initial-3grams-taxonomy-entire.tsv53 kB
    • GF2.0-word_parts-particles-lemmas-final-1grams-taxonomy-entire.tsv57 kB
    • GF2.0-word_parts-particles-lowercase_forms-initial-1grams-taxonomy-entire.tsv52 kB
    • GF2.0-word_parts-particles-lowercase_forms-final-4grams-taxonomy-entire.tsv43 kB
    • GF2.0-word_parts-particles-lemmas-initial-4grams-taxonomy-entire.tsv46 kB
    • GF2.0-word_parts-particles-lemmas-final-2grams-taxonomy-entire.tsv57 kB
    • GF2.0-word_parts-particles-lowercase_forms-initial-2grams-taxonomy-entire.tsv52 kB
    • GF2.0-word_parts-particles-lowercase_forms-final-5grams-taxonomy-entire.tsv37 kB
    • GF2.0-word_parts-particles-lowercase_forms-initial-3grams-taxonomy-entire.tsv49 kB
    • GF2.0-word_parts-particles-lemmas-initial-5grams-taxonomy-entire.tsv38 kB
    • GF2.0-word_parts-particles-lemmas-final-3grams-taxonomy-entire.tsv53 kB
    • GF2.0-word_parts-particles-lowercase_forms-final-1grams-taxonomy-entire.tsv52 kB
    • GF2.0-word_parts-particles-lowercase_forms-initial-4grams-taxonomy-entire.tsv43 kB
    • GF2.0-word_parts-particles-lemmas-final-4grams-taxonomy-entire.tsv46 kB
Icon
Name
GF2.0-word_parts-interjections.zip
Size
810.74 KB
Format
application/zip
Description
Frequency lists of word parts of interjections in Gigafida 2.0
MD5
6937d6fac2b77ed8ae3c929c4ad9e3d5
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-interjections-lemmas-final-1grams-taxonomy-entire.tsv510 kB
    • GF2.0-word_parts-interjections-lemmas-initial-2grams-taxonomy-entire.tsv510 kB
    • GF2.0-word_parts-interjections-lowercase_forms-initial-3grams-taxonomy-entire.tsv466 kB
    • GF2.0-word_parts-interjections-lemmas-final-2grams-taxonomy-entire.tsv510 kB
    • GF2.0-word_parts-interjections-lemmas-initial-3grams-taxonomy-entire.tsv501 kB
    • GF2.0-word_parts-interjections-lowercase_forms-initial-4grams-taxonomy-entire.tsv434 kB
    • GF2.0-word_parts-interjections-lemmas-final-3grams-taxonomy-entire.tsv501 kB
    • GF2.0-word_parts-interjections-lemmas-initial-4grams-taxonomy-entire.tsv467 kB
    • GF2.0-word_parts-interjections-lowercase_forms-initial-5grams-taxonomy-entire.tsv380 kB
    • GF2.0-word_parts-interjections-lowercase_forms-final-1grams-taxonomy-entire.tsv474 kB
    • GF2.0-word_parts-interjections-lemmas-final-4grams-taxonomy-entire.tsv467 kB
    • GF2.0-word_parts-interjections-lemmas-initial-5grams-taxonomy-entire.tsv411 kB
    • GF2.0-word_parts-interjections-lowercase_forms-final-2grams-taxonomy-entire.tsv474 kB
    • GF2.0-word_parts-interjections-lemmas-final-5grams-taxonomy-entire.tsv411 kB
    • GF2.0-word_parts-interjections-lowercase_forms-final-3grams-taxonomy-entire.tsv466 kB
    • GF2.0-word_parts-interjections-lowercase_forms-final-4grams-taxonomy-entire.tsv434 kB
    • GF2.0-word_parts-interjections-lowercase_forms-final-5grams-taxonomy-entire.tsv380 kB
    • GF2.0-word_parts-interjections-lowercase_forms-initial-1grams-taxonomy-entire.tsv474 kB
    • GF2.0-word_parts-interjections-lemmas-initial-1grams-taxonomy-entire.tsv510 kB
    • GF2.0-word_parts-interjections-lowercase_forms-initial-2grams-taxonomy-entire.tsv474 kB
Icon
Name
GF2.0-word_parts-abbreviations.zip
Size
1.5 MB
Format
application/zip
Description
Frequency lists of abbreviation word parts from Gigafida 2.0
MD5
75edf8b13f51eda46dac4c6da8797ecc
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-abbreviations-lemmas-final-3grams-taxonomy-entire.tsv912 kB
    • GF2.0-word_parts-abbreviations-lemmas-final-4grams-taxonomy-entire.tsv868 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-initial-1grams-taxonomy-entire.tsv849 kB
    • GF2.0-word_parts-abbreviations-lemmas-final-5grams-taxonomy-entire.tsv763 kB
    • GF2.0-word_parts-abbreviations-lemmas-initial-1grams-taxonomy-entire.tsv923 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-initial-2grams-taxonomy-entire.tsv849 kB
    • GF2.0-word_parts-abbreviations-lemmas-initial-2grams-taxonomy-entire.tsv923 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-initial-3grams-taxonomy-entire.tsv845 kB
    • GF2.0-word_parts-abbreviations-lemmas-initial-3grams-taxonomy-entire.tsv912 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-final-1grams-taxonomy-entire.tsv849 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-initial-4grams-taxonomy-entire.tsv806 kB
    • GF2.0-word_parts-abbreviations-lemmas-initial-4grams-taxonomy-entire.tsv868 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-final-2grams-taxonomy-entire.tsv849 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-initial-5grams-taxonomy-entire.tsv710 kB
    • GF2.0-word_parts-abbreviations-lemmas-initial-5grams-taxonomy-entire.tsv763 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-final-3grams-taxonomy-entire.tsv845 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-final-4grams-taxonomy-entire.tsv806 kB
    • GF2.0-word_parts-abbreviations-lemmas-final-1grams-taxonomy-entire.tsv923 kB
    • GF2.0-word_parts-abbreviations-lowercase_forms-final-5grams-taxonomy-entire.tsv710 kB
    • GF2.0-word_parts-abbreviations-lemmas-final-2grams-taxonomy-entire.tsv923 kB
Icon
Name
GF2.0-word_parts-residual-lemmas-short.zip
Size
22.47 MB
Format
application/zip
Description
Shortened frequency lists of word parts of residual lemmas in Gigafida 2.0
MD5
f7a3b8835dcdde966f284b83d1c16fb6
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-residual-lemmas-initial-5grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-residual-lemmas-final-4grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-residual-lemmas-initial-3grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-residual-lemmas-final-2grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-residual-lemmas-initial-1grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-residual-lemmas-final-5grams-taxonomy-short.tsv23 MB
    • GF2.0-word_parts-residual-lemmas-initial-4grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-residual-lemmas-final-3grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-residual-lemmas-initial-2grams-taxonomy-short.tsv22 MB
    • GF2.0-word_parts-residual-lemmas-final-1grams-taxonomy-short.tsv22 MB
Icon
Name
GF2.0-word_parts-residual-lemmas-entire.zip
Size
67.18 MB
Format
application/zip
Description
Frequency lists of word parts of residual lemmas in Gigafida 2.0
MD5
058263acb9f08035980668830c3785f0
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-residual-lemmas-initial-2grams-taxonomy-entire.tsv78 MB
    • GF2.0-word_parts-residual-lemmas-final-3grams-taxonomy-entire.tsv78 MB
    • GF2.0-word_parts-residual-lemmas-initial-5grams-taxonomy-entire.tsv71 MB
    • GF2.0-word_parts-residual-lemmas-initial-1grams-taxonomy-entire.tsv78 MB
    • GF2.0-word_parts-residual-lemmas-final-2grams-taxonomy-entire.tsv78 MB
    • GF2.0-word_parts-residual-lemmas-initial-4grams-taxonomy-entire.tsv75 MB
    • GF2.0-word_parts-residual-lemmas-final-5grams-taxonomy-entire.tsv71 MB
    • GF2.0-word_parts-residual-lemmas-final-1grams-taxonomy-entire.tsv78 MB
    • GF2.0-word_parts-residual-lemmas-initial-3grams-taxonomy-entire.tsv78 MB
    • GF2.0-word_parts-residual-lemmas-final-4grams-taxonomy-entire.tsv75 MB
Icon
Name
GF2.0-word_parts-residual-lowercase_forms-short.zip
Size
19.33 MB
Format
application/zip
Description
Shortened frequency lists of word parts of residual lower-case forms in Gigafida 2.0
MD5
22077f59baf8499951229a98f71043bc
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-residual-lowercase_forms-final-4grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-5grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-2grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-3grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-1grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-5grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-3grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-4grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-1grams-taxonomy-short.tsv21 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-2grams-taxonomy-short.tsv21 MB
Icon
Name
GF2.0-word_parts-residual-lowercase_forms-entire.zip
Size
52.68 MB
Format
application/zip
Description
Frequency lists of word parts of residual lower-case forms in Gigafida 2.0
MD5
444fd81026d088a2456b165d8a8e60be
 Download file  Preview
 File Preview  
    • GF2.0-word_parts-residual-lowercase_forms-initial-5grams-taxonomy-entire.tsv62 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-1grams-taxonomy-entire.tsv67 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-2grams-taxonomy-entire.tsv67 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-4grams-taxonomy-entire.tsv65 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-5grams-taxonomy-entire.tsv62 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-1grams-taxonomy-entire.tsv67 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-3grams-taxonomy-entire.tsv67 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-4grams-taxonomy-entire.tsv65 MB
    • GF2.0-word_parts-residual-lowercase_forms-initial-2grams-taxonomy-entire.tsv67 MB
    • GF2.0-word_parts-residual-lowercase_forms-final-3grams-taxonomy-entire.tsv67 MB

Show simple item record