Files in this item
Download all files in item (35.84 MB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)




- Name
- senticoref_private_corefud_unlabeled.conllu
- Size
- 2.77 MB
- Format
- Unknown
- Description
- Unlabeled SentiCoref test set in CoNLL-U format.
- MD5
- 3edae210479e8e79b1fef9f968cb91a4

- Name
- senticoref_corefud.conllu
- Size
- 33.06 MB
- Format
- Unknown
- Description
- Labeled SentiCoref training set in CoNLL-U format.
- MD5
- 0e9d5f3fbe4698cc96c3f92a2dd4f7fb

- Name
- README.txt
- Size
- 2.18 KB
- Format
- Text file
- Description
- Description of the resource.
- MD5
- e9d91eea5c52934731d0cb1d1f955832
CorefUD conversion of Slovene corpus for aspect-based sentiment analysis SentiCoref v1.0 http://hdl.handle.net/11356/1990 CC-BY-SA 4.0 This corpus is the CorefUD conversion of the SentiCoref corpus for coreference resolution in Slovene contained within the SUK 1.1 collection of corpora (http://hdl.handle.net/11356/1959). The item contains 756 labeled training (senticoref_corefud.conllu) and 81 unlabeled test documents (senticoref_private_corefud_unlabeled.conllu) annotated with coreference information. Coreference in Universal Dependencies (CorefUD) is an initiative to collect coreference corpora in various languages and harmonize them to the same scheme and data format (CoNLL-U). The coreference information is stored in the MISC column. More concretely, the start and end of each coreference mention is marked with the "Entity=" attribute. For example, "Entity=(e0" marks the start of the entity e0 at the current token while "Entity=e0) marks the end of the entity e0 at the current tok . . .