Datoteke v tem vnosu

 Prenesi vse datoteke v vnosu (228.99 KB)
To je vnos
Publicly Available
z licenco:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Distributed under Creative Commons Attribution Required
Icon
Ime
README.txt
Velikost
2.39 KB
Format
Besedilna datoteka
Opis
description
MD5
425c9b3f580725daba40877c2d73eecc
 Prenesi datoteko  Predogled
 Predogled datoteke  
TCMeta is a multilingual dataset of COVID tweets for relation-level metaphor analysis.

It contains 2,138 Slovene and 2,221 English noun phrase constructions extracted from COVID-related tweets that are annotated for relation-level metaphor.


The data is in tab-separated tabular format .tsv. Each line presents a unique phrase, extracted from a COVID-related tweet. 

The primary annotations can be found in the column "COVID metaphor label" (whether the phrase expresses a metaphor relating to COVID). Additional annotations can be found in the "Comments" column, and include annotations of idioms, metaphors not relating to COVID, and metaphors not evident on the relation-level.


The data contains the following columns:

Language		the language of the tweet, 'sl' (Slovene) or 'en' (English) 
Tweet ID		the unique identifier of the tweet, which can be used to retrieve the text of the post
Phrase			the phrase extracted from the tweet 
COVID metaphor label	'y' (Yes) or 'n' (No): whether it is . . .
                                            
Icon
Ime
TCMeta.v1.tsv
Velikost
226.6 KB
Format
Neznano
Opis
dataset
MD5
a61a7c336d66d87f3822be638a744cb9
 Prenesi datoteko