Datoteke v tem vnosu

 Prenesi vse datoteke v vnosu (35.84 MB)
Icon
Ime
senticoref_private_corefud_unlabeled.conllu
Velikost
2.77 MB
Format
Neznano
Opis
Unlabeled SentiCoref test set in CoNLL-U format.
MD5
3edae210479e8e79b1fef9f968cb91a4
 Prenesi datoteko
Icon
Ime
senticoref_corefud.conllu
Velikost
33.06 MB
Format
Neznano
Opis
Labeled SentiCoref training set in CoNLL-U format.
MD5
0e9d5f3fbe4698cc96c3f92a2dd4f7fb
 Prenesi datoteko
Icon
Ime
README.txt
Velikost
2.18 KB
Format
Besedilna datoteka
Opis
Description of the resource.
MD5
e9d91eea5c52934731d0cb1d1f955832
 Prenesi datoteko  Predogled
 Predogled datoteke  
CorefUD conversion of Slovene corpus for aspect-based sentiment analysis SentiCoref
v1.0
http://hdl.handle.net/11356/1990
CC-BY-SA 4.0

This corpus is the CorefUD conversion of the SentiCoref corpus for coreference resolution in Slovene contained within the SUK 1.1 collection of corpora (http://hdl.handle.net/11356/1959).
The item contains 756 labeled training (senticoref_corefud.conllu) and 81 unlabeled test documents (senticoref_private_corefud_unlabeled.conllu) annotated with coreference information.

Coreference in Universal Dependencies (CorefUD) is an initiative to collect coreference corpora in various languages and harmonize them to the same scheme and data format (CoNLL-U).
The coreference information is stored in the MISC column. More concretely, the start and end of each coreference mention is marked with the "Entity=" attribute. For example, "Entity=(e0" marks the start of the entity e0 at the current token while "Entity=e0) marks the end of the entity e0 at the current tok . . .