Najnovejše

 lexicalConceptualResource 
lexicalConceptualResource
Opis:
Wordlists, keywords and n-grams were extracted from a corpus of textbooks for Slovenian elementary and secondary schools. The corpus contains 4,302,857 words (5,373,268 tokens), and consists of 127 textbooks from 16 different ...
 Ta vnos vsebuje 1 datoteko (864.93 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 toolService 
toolService
Opis:
Part-of-speech tagger for Slovene language implemented using convolutional and LSTM neural networks. Tagger uses character-level representation of sentences. The tagger has been trained on the ssj500k 2.1 corpus, ...
 Ta vnos vsebuje 14 datotek(e) (140.75 MB).
 
Publicly Available
 lexicalConceptualResource 
lexicalConceptualResource
Opis:
SenseGraph a graph-like structure of word senses of most common words of the standard Croatian language, obtained by relying on human-provided lexical substitutes for target words in context. SenseGraph is encoded in the ...
 Ta vnos vsebuje 1 datoteko (955.93 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike

Največ ogledov

V zadnjem tednu
 lexicalConceptualResource 
lexicalConceptualResource
Opis:
Wordlists, keywords and n-grams were extracted from a corpus of textbooks for Slovenian elementary and secondary schools. The corpus contains 4,302,857 words (5,373,268 tokens), and consists of 127 textbooks from 16 different ...
 Ta vnos vsebuje 1 datoteko (864.93 KB).
 
Publicly Available Distributed under Creative Commons Attribution Required
 corpus 
corpus
Avtor(ji):
Opis:
The corpus contains 256,567 documents from the Slovenian news portals 24ur, Dnevnik, Finance, Rtvslo, and Žurnal24. These portals contain political, business, economic and financial content. The submission contains 7 files: ...
 Ta vnos vsebuje 8 datotek(e) (616.88 MB).
 
Publicly Available Distributed under Creative Commons Attribution Required Share Alike
 lexicalConceptualResource 
lexicalConceptualResource
Opis:
srLex is a large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma, MSD) triple frequencies are calculated on the ...
 Ta vnos vsebuje 1 datoteko (29.54 MB).
 
Publicly Available