dc.contributor.author | Armendariz, Carlos |
dc.contributor.author | Matthew, Purver |
dc.contributor.author | Ulčar, Matej |
dc.contributor.author | Pollak, Senja |
dc.contributor.author | Ljubešić, Nikola |
dc.contributor.author | Robnik-Šikonja, Marko |
dc.contributor.author | Granroth-Wilding, Mark |
dc.contributor.author | Vaik, Kristiina |
dc.date.accessioned | 2020-10-30T08:31:25Z |
dc.date.available | 2020-10-30T08:31:25Z |
dc.date.issued | 2020 |
dc.identifier.uri | http://hdl.handle.net/11356/1308 |
dc.description | The dataset contains human similarity ratings for pairs of words. The annotators were presented with contexts that contained both of the words in the pair and the dataset features two different contexts per pair. The words were sourced from the English, Croatian, Finnish and Slovenian versions of the original Simlex dataset. |
dc.language.iso | eng |
dc.language.iso | hrv |
dc.language.iso | fin |
dc.language.iso | slv |
dc.publisher | Queen Mary University |
dc.relation | info:eu-repo/grantAgreement/EC/H2020/825153 |
dc.relation.isreferencedby | https://arxiv.org/abs/1912.05320 |
dc.relation.isreferencedby | https://www.aclweb.org/anthology/2020.lrec-1.720/ |
dc.rights | GNU General Public Licence, version 3 |
dc.rights.uri | https://opensource.org/licenses/GPL-3.0 |
dc.rights.label | PUB |
dc.source.uri | http://embeddia.eu/ |
dc.subject | similarity |
dc.subject | contextual embeddings |
dc.subject | evaluation |
dc.subject | context |
dc.title | A Resource for Evaluating Graded Word Similarity in Context: CoSimLex |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType | other |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Carlos Armendariz carlos@santosarmendariz.com Queen Mary University of London |
contact.person | Matthew Purver m.purver@qmul.ac.uk Queen Mary University |
sponsor | European Union EC/H2020/825153 EMBEDDIA - Cross-Lingual Embeddings for Less-Represented Languages in European News Media euFunds info:eu-repo/grantAgreement/EC/H2020/825153 |
size.info | 580 entries |
files.count | 6 |
files.size | 768067 |
Files in this item
Download all files in item (750.07 KB)
- Name
- cosimlex_en.csv
- Size
- 284.95 KB
- Format
- CSV file
- Description
- English Dataset
- MD5
- 481d4243b30a59acb1a1d778d08c7304

- Name
- cosimlex_fi.csv
- Size
- 19.98 KB
- Format
- CSV file
- Description
- Finnish Dataset
- MD5
- c53e5591516e499f7fcac07bd2d0c589

- Name
- cosimlex_hr.csv
- Size
- 92.46 KB
- Format
- CSV file
- Description
- Croatian Dataset
- MD5
- c565f15294762e1406a006fb7bd220c9

- Name
- cosimlex_sl.csv
- Size
- 85.42 KB
- Format
- CSV file
- Description
- Slovene Dataset
- MD5
- ff53a542eb3ed416a15ef72fc3a16595

- Name
- README.md
- Size
- 3.93 KB
- Format
- Unknown
- Description
- Readme file
- MD5
- 1bdc84968bfc9fe27091fd6f454e827c

- Name
- cosimlex_scores.zip
- Size
- 263.33 KB
- Format
- application/zip
- Description
- Raw individual annotator ratings for all languages
- MD5
- 5a8a9343f8e47bc56aab072ef2ddbacb
- cosimlex_scores
- sl_context
- sl_survey4_B_rows.csv10 kB
- sl_survey2_A_rows.csv10 kB
- sl_survey2_B_rows.csv10 kB
- sl_survey3_A_rows.csv10 kB
- sl_survey3_B_rows.csv10 kB
- sl_survey1_A_rows.csv10 kB
- sl_survey1_B_rows.csv9 kB
- sl_survey4_A_rows.csv10 kB
- README.md2 kB
- en_scores
- en_scores_7_B.csv8 kB
- en_scores_6_B.csv7 kB
- en_scores_4_A.csv8 kB
- en_scores_5_B.csv9 kB
- en_scores_3_A.csv9 kB
- en_scores_10_B.csv7 kB
- en_scores_2_B.csv9 kB
- en_scores_9_B.csv8 kB
- en_scores_1_B.csv9 kB
- en_scores_7_A.csv10 kB
- en_scores_8_B.csv11 kB
- en_scores_6_A.csv8 kB
- en_scores_5_A.csv8 kB
- en_scores_10_A.csv6 kB
- en_scores_4_B.csv7 kB
- en_scores_2_A.csv10 kB
- en_scores_3_B.csv9 kB
- en_scores_9_A.csv8 kB
- en_scores_1_A.csv8 kB
- en_scores_8_A.csv8 kB
- sl_scores
- sl_scores_4_A.csv4 kB
- sl_scores_2_B.csv4 kB
- sl_scores_1_A.csv4 kB
- sl_scores_3_B.csv3 kB
- sl_scores_2_A.csv3 kB
- sl_scores_4_B.csv3 kB
- sl_scores_3_A.csv3 kB
- sl_scores_1_B.csv3 kB
- en_context
- en_survey2_B_rows.csv13 kB
- en_survey2_A_rows.csv14 kB
- en_survey8_A_rows.csv13 kB
- en_survey6_B_rows.csv14 kB
- en_survey3_A_rows.csv13 kB
- en_survey9_A_rows.csv13 kB
- en_survey10_A_rows.csv13 kB
- en_survey7_B_rows.csv14 kB
- en_survey4_A_rows.csv13 kB
- en_survey1_A_rows.csv14 kB
- en_survey8_B_rows.csv13 kB
- en_survey5_A_rows.csv13 kB
- en_survey3_B_rows.csv13 kB
- en_survey9_B_rows.csv13 kB
- en_survey10_B_rows.csv14 kB
- en_survey6_A_rows.csv13 kB
- en_survey4_B_rows.csv13 kB
- en_survey1_B_rows.csv13 kB
- en_survey7_A_rows.csv14 kB
- en_survey5_B_rows.csv13 kB
- hr_context
- hr_survey3_A_rows.csv11 kB
- hr_survey3_B_rows.csv10 kB
- hr_survey1_A_rows.csv10 kB
- hr_survey4_A_rows.csv11 kB
- hr_survey1_B_rows.csv10 kB
- hr_survey4_B_rows.csv10 kB
- hr_survey2_A_rows.csv11 kB
- hr_survey2_B_rows.csv11 kB
- hr_scores
- hr_scores_1_A.csv4 kB
- hr_scores_3_B.csv3 kB
- hr_scores_2_A.csv4 kB
- hr_scores_4_B.csv3 kB
- hr_scores_1_B.csv3 kB
- hr_scores_3_A.csv4 kB
- hr_scores_2_B.csv3 kB
- hr_scores_4_A.csv3 kB
- sl_context