Show simple item record

 
dc.contributor.author Armendariz, Carlos
dc.contributor.author Matthew, Purver
dc.contributor.author Ulčar, Matej
dc.contributor.author Pollak, Senja
dc.contributor.author Ljubešić, Nikola
dc.contributor.author Robnik-Šikonja, Marko
dc.contributor.author Granroth-Wilding, Mark
dc.contributor.author Vaik, Kristiina
dc.date.accessioned 2020-10-30T08:31:25Z
dc.date.available 2020-10-30T08:31:25Z
dc.date.issued 2020
dc.identifier.uri http://hdl.handle.net/11356/1308
dc.description The dataset contains human similarity ratings for pairs of words. The annotators were presented with contexts that contained both of the words in the pair and the dataset features two different contexts per pair. The words were sourced from the English, Croatian, Finnish and Slovenian versions of the original Simlex dataset.
dc.language.iso eng
dc.language.iso hrv
dc.language.iso fin
dc.language.iso slv
dc.publisher Queen Mary University
dc.relation info:eu-repo/grantAgreement/EC/H2020/825153
dc.relation.isreferencedby https://arxiv.org/abs/1912.05320
dc.relation.isreferencedby https://www.aclweb.org/anthology/2020.lrec-1.720/
dc.rights GNU General Public Licence, version 3
dc.rights.uri https://opensource.org/licenses/GPL-3.0
dc.rights.label PUB
dc.source.uri http://embeddia.eu/
dc.subject similarity
dc.subject contextual embeddings
dc.subject evaluation
dc.subject context
dc.title A Resource for Evaluating Graded Word Similarity in Context: CoSimLex
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.detailedType other
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Carlos Armendariz carlos@santosarmendariz.com Queen Mary University of London
contact.person Matthew Purver m.purver@qmul.ac.uk Queen Mary University
sponsor European Union EC/H2020/825153 EMBEDDIA - Cross-Lingual Embeddings for Less-Represented Languages in European News Media euFunds info:eu-repo/grantAgreement/EC/H2020/825153
size.info 580 entries
files.count 6
files.size 768067


 Files in this item

 Download all files in item (750.07 KB)
This item is
Publicly Available
and licensed under:
GNU General Public Licence, version 3
Icon
Name
cosimlex_en.csv
Size
284.95 KB
Format
CSV file
Description
English Dataset
MD5
481d4243b30a59acb1a1d778d08c7304
 Download file
Icon
Name
cosimlex_fi.csv
Size
19.98 KB
Format
CSV file
Description
Finnish Dataset
MD5
c53e5591516e499f7fcac07bd2d0c589
 Download file
Icon
Name
cosimlex_hr.csv
Size
92.46 KB
Format
CSV file
Description
Croatian Dataset
MD5
c565f15294762e1406a006fb7bd220c9
 Download file
Icon
Name
cosimlex_sl.csv
Size
85.42 KB
Format
CSV file
Description
Slovene Dataset
MD5
ff53a542eb3ed416a15ef72fc3a16595
 Download file
Icon
Name
README.md
Size
3.93 KB
Format
Unknown
Description
Readme file
MD5
1bdc84968bfc9fe27091fd6f454e827c
 Download file
Icon
Name
cosimlex_scores.zip
Size
263.33 KB
Format
application/zip
Description
Raw individual annotator ratings for all languages
MD5
5a8a9343f8e47bc56aab072ef2ddbacb
 Download file  Preview
 File Preview  
  • cosimlex_scores
    • sl_context
      • sl_survey4_B_rows.csv10 kB
      • sl_survey2_A_rows.csv10 kB
      • sl_survey2_B_rows.csv10 kB
      • sl_survey3_A_rows.csv10 kB
      • sl_survey3_B_rows.csv10 kB
      • sl_survey1_A_rows.csv10 kB
      • sl_survey1_B_rows.csv9 kB
      • sl_survey4_A_rows.csv10 kB
    • README.md2 kB
    • en_scores
      • en_scores_7_B.csv8 kB
      • en_scores_6_B.csv7 kB
      • en_scores_4_A.csv8 kB
      • en_scores_5_B.csv9 kB
      • en_scores_3_A.csv9 kB
      • en_scores_10_B.csv7 kB
      • en_scores_2_B.csv9 kB
      • en_scores_9_B.csv8 kB
      • en_scores_1_B.csv9 kB
      • en_scores_7_A.csv10 kB
      • en_scores_8_B.csv11 kB
      • en_scores_6_A.csv8 kB
      • en_scores_5_A.csv8 kB
      • en_scores_10_A.csv6 kB
      • en_scores_4_B.csv7 kB
      • en_scores_2_A.csv10 kB
      • en_scores_3_B.csv9 kB
      • en_scores_9_A.csv8 kB
      • en_scores_1_A.csv8 kB
      • en_scores_8_A.csv8 kB
    • sl_scores
      • sl_scores_4_A.csv4 kB
      • sl_scores_2_B.csv4 kB
      • sl_scores_1_A.csv4 kB
      • sl_scores_3_B.csv3 kB
      • sl_scores_2_A.csv3 kB
      • sl_scores_4_B.csv3 kB
      • sl_scores_3_A.csv3 kB
      • sl_scores_1_B.csv3 kB
    • en_context
      • en_survey2_B_rows.csv13 kB
      • en_survey2_A_rows.csv14 kB
      • en_survey8_A_rows.csv13 kB
      • en_survey6_B_rows.csv14 kB
      • en_survey3_A_rows.csv13 kB
      • en_survey9_A_rows.csv13 kB
      • en_survey10_A_rows.csv13 kB
      • en_survey7_B_rows.csv14 kB
      • en_survey4_A_rows.csv13 kB
      • en_survey1_A_rows.csv14 kB
      • en_survey8_B_rows.csv13 kB
      • en_survey5_A_rows.csv13 kB
      • en_survey3_B_rows.csv13 kB
      • en_survey9_B_rows.csv13 kB
      • en_survey10_B_rows.csv14 kB
      • en_survey6_A_rows.csv13 kB
      • en_survey4_B_rows.csv13 kB
      • en_survey1_B_rows.csv13 kB
      • en_survey7_A_rows.csv14 kB
      • en_survey5_B_rows.csv13 kB
    • hr_context
      • hr_survey3_A_rows.csv11 kB
      • hr_survey3_B_rows.csv10 kB
      • hr_survey1_A_rows.csv10 kB
      • hr_survey4_A_rows.csv11 kB
      • hr_survey1_B_rows.csv10 kB
      • hr_survey4_B_rows.csv10 kB
      • hr_survey2_A_rows.csv11 kB
      • hr_survey2_B_rows.csv11 kB
    • hr_scores
      • hr_scores_1_A.csv4 kB
      • hr_scores_3_B.csv3 kB
      • hr_scores_2_A.csv4 kB
      • hr_scores_4_B.csv3 kB
      • hr_scores_1_B.csv3 kB
      • hr_scores_3_A.csv4 kB
      • hr_scores_2_B.csv3 kB
      • hr_scores_4_A.csv3 kB

Show simple item record