dc.contributor.author | Popović, Maja |
dc.contributor.author | Arčan, Mihael |
dc.date.accessioned | 2016-05-29T15:22:49Z |
dc.date.available | 2016-05-29T15:22:49Z |
dc.date.issued | 2016-05-24 |
dc.identifier.uri | http://hdl.handle.net/11356/1065 |
dc.description | The PE²rr corpus contains source language texts from different domains along with their automatically generated translations into several morphologically rich languages, their post-edited versions, and error annotations of the performed post-edit operations. The main advantage of the corpus is the fusion of post-editing and error classification tasks, which have usually been seen as two independent tasks, although naturally they are not. |
dc.language.iso | slv |
dc.language.iso | srp |
dc.language.iso | deu |
dc.language.iso | spa |
dc.language.iso | eng |
dc.publisher | Insight Centre for Data Analytics, National University of Ireland, Galway |
dc.relation | info:eu-repo/grantAgreement/EC/H2020/644333 |
dc.relation.isreferencedby | http://www.lrec-conf.org/proceedings/lrec2016/summaries/405.html |
dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by-sa/4.0/ |
dc.rights.label | PUB |
dc.subject | parallel corpus |
dc.subject | machine translation |
dc.subject | post-editing |
dc.subject | error annotation |
dc.subject | manual annotation |
dc.subject | multilingual |
dc.title | Post-edited and error annotated machine translation corpus PErr 1.0 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Maja Popović maja.popovic@hu-berlin.de Humboldt University of Berlin |
contact.person | Mihael Arčan mihael.arcan@insight-centre.org Insight Centre for Data Analytics, National University of Ireland, Galway |
sponsor | European Union EC/H2020/644333 TraMOOC - Translation for Massive Open Online Courses euFunds info:eu-repo/grantAgreement/EC/H2020/644333 |
sponsor | Science Foundation Ireland SFI/12/RC/2289 Insight nationalFunds |
size.info | 2896 units |
size.info | 43938 words |
files.count | 1 |
files.size | 373440 |
Files in this item
This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)




- Name
- pe2rr_dataset.tgz
- Size
- 364.69 KB
- Format
- Unknown
- Description
- 11 files (each for one source / language pair), tab-separated, with one translation unit per line.
- MD5
- cb5dd1552c5a3c28008a7ceb294aafb2