The dataset contains human similarity ratings for pairs of words. The annotators were presented with contexts that contained both of the words in the pair and the dataset features two different contexts per pair. The words were sourced from the English, Croatian, Finnish and Slovenian versions of the original Simlex dataset.
dc.language.iso
eng
dc.language.iso
hrv
dc.language.iso
fin
dc.language.iso
slv
dc.publisher
Queen Mary University
dc.relation
info:eu-repo/grantAgreement/EC/H2020/825153
dc.relation.isreferencedby
https://arxiv.org/abs/1912.05320
dc.relation.isreferencedby
https://www.aclweb.org/anthology/2020.lrec-1.720/
dc.rights
GNU General Public Licence, version 3
dc.rights.uri
https://opensource.org/licenses/GPL-3.0
dc.rights.label
PUB
dc.source.uri
http://embeddia.eu/
dc.subject
similarity
dc.subject
contextual embeddings
dc.subject
evaluation
dc.subject
context
dc.title
A Resource for Evaluating Graded Word Similarity in Context: CoSimLex
dc.type
lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType
other
metashare.ResourceInfo#ContentInfo.mediaType
text
has.files
yes
branding
CLARIN.SI data & tools
contact.person
Carlos Armendariz carlos@santosarmendariz.com Queen Mary University of London
contact.person
Matthew Purver m.purver@qmul.ac.uk Queen Mary University
sponsor
European Union EC/H2020/825153 EMBEDDIA - Cross-Lingual Embeddings for Less-Represented Languages in European News Media euFunds info:eu-repo/grantAgreement/EC/H2020/825153