Show simple item record

 
dc.contributor.author Borovič, Mladen
dc.contributor.author Žagar, Kristjan
dc.contributor.author Ferme, Marko
dc.contributor.author Majninger, Sandi
dc.contributor.author Ojsteršek, Milan
dc.contributor.author Žagar, Aleš
dc.contributor.author Robnik-Šikonja, Marko
dc.date.accessioned 2022-11-07T15:09:19Z
dc.date.available 2022-11-07T15:09:19Z
dc.date.issued 2022-11-07
dc.identifier.uri http://hdl.handle.net/11356/1704
dc.description SuperGLUE is a benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a public leaderboard. It is comprised of 8 corpora (BoolQ, CB, COPA, MultiRC, ReCoRD, RTE, WiC, WSC), which cover 4 different types of tasks (QA, NLI, WSD, coref.). Slovene translation of SuperGLUE consists of machine and human translations of the benchmark. ReCoRD is completely translated by the Google Machine Translation service. Questions and answers from the project "Slovene in the Palm of your Hand (Slovenščina na dlani)" are also included for the BoolQ, MultiRC and ReCoRD tasks and are in form of extensions to the existing datasets. The data is provided in jsonl format.
dc.language.iso slv
dc.publisher Faculty of Electrical Engineering and Computer Science, University of Maribor
dc.relation.isreferencedby https://super.gluebenchmark.com/
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label PUB
dc.source.uri https://rsdo.slovenscina.eu/en/semantic-resources-and-technologies
dc.subject dataset
dc.subject natural language processing
dc.subject Q&A
dc.subject SuperGLUE
dc.title Extensions to the Slovene translation of SuperGLUE
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN.SI data & tools
contact.person Mladen Borovič mladen.borovic@um.si Faculty of Electrical Engineering and Computer Science, University of Maribor
sponsor Ministry of Culture C3340-20-278001 Development of Slovene in a Digital Environment Other
sponsor Slovene Ministry of Culture and European Social Fund C3340-17-208002 Slovenščina na dlani Other
files.count 4
files.size 61359637


 Files in this item

 Download all files in item (58.52 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Distributed under Creative Commons Attribution Required
Icon
Name
SuperGLUE-multirc-ext-SND.zip
Size
306.41 KB
Format
application/zip
Description
Questions and answers from the project Slovenščina na dlani for the MultiRC task
MD5
391ad993236c97ce33af893b1f300d33
 Download file  Preview
 File Preview  
  • multirc
    • ext-SND
      • val.jsonl82 kB
      • MultiRC-SLO-ext-SND.jsonl425 kB
      • train.jsonl276 kB
      • test.jsonl94 kB
Icon
Name
SuperGLUE-record-GoogleMT.zip
Size
50.21 MB
Format
application/zip
Description
Google translated dataset for the ReCoRD task
MD5
3f5deb474eb1cfa8ef392ea0f8049051
 Download file  Preview
 File Preview  
    • train.jsonl130 MB
    • dev.jsonl14 MB
Icon
Name
SuperGLUE-record-ext-SND.zip
Size
514.41 KB
Format
application/zip
Description
Questions and answers from the project Slovenščina na dlani for the ReCoRD task
MD5
f52dd6ecf27f0b117cc897e20f90bcb7
 Download file  Preview
 File Preview  
  • record
    • ext-SND
      • val.jsonl109 kB
      • train.jsonl517 kB
      • test.jsonl114 kB
      • ReCoRD-SLO-ext-SND.jsonl761 kB
      • dev.jsonl109 kB
Icon
Name
SuperGLUE-boolq-ext-SND.zip
Size
7.51 MB
Format
application/zip
Description
Questions and answers from the project Slovenščina na dlani for the BoolQ task
MD5
ef92b2a1a390ad84b805092b001128c5
 Download file  Preview
 File Preview  
  • boolq
    • ext-SND
      • val.jsonl2 MB
      • train.jsonl13 MB
      • test.jsonl2 MB
      • BoolQ-SLO-ext-SND.jsonl17 MB

Show simple item record