Files in this item
Download all files in item (4.14 MB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- gos_ngrams_word_1-5.zip
- Size
- 946.5 KB
- Format
- application/zip
- Description
- 1- to 5-grams of words (pronunciation-based spelling) excluding punctuation. The minimum frequency threshold is 2.
- MD5
- ac638e81a8a7bae5b0bc4dae484d0389
- Name
- gos_ngrams_norm_1-5.zip
- Size
- 981.5 KB
- Format
- application/zip
- Description
- 1- to 5-grams of normalized words (standardized spelling) excluding punctuation. The minimum frequency threshold is 2.
- MD5
- 586b75baa4a7ceb86825c79a900cf073
- Name
- gos_ngrams_word-norm-lemma-tag_1-5.zip
- Size
- 1.86 MB
- Format
- application/zip
- Description
- 1- to 5-grams of words with normalized form, lemma and morphosyntactic tag including punctuation. The minimum frequency threshold is 2.
- MD5
- 98e7e7f91a0ad35f367ded64bbd35f43
- Name
- kres_AFL_norm_1-5_min5M.zip
- Size
- 411.66 KB
- Format
- application/zip
- Description
- Adjusted frequency list for 1- to 5-grams of normalized words (standardized spelling) excluding punctuation. The minimum relative frequency threshold for substring reduction is 5. Column 1: n-gram; column 2: length of n-gram, column 3: adjusted corpus frequency.
- MD5
- 4e38a0184cc591847a20d34c397b41b4