Post-edited and error annotated machine translation corpus PErr 1.0

Name: Post-edited and error annotated machine translation corpus PErr 1.0
License: https://creativecommons.org/licenses/by-sa/4.0/

Popović, Maja; Arčan, Mihael

Prikaži enostavni zapis vnosa

dc.contributor.author	Popović, Maja
dc.contributor.author	Arčan, Mihael
dc.date.accessioned	2016-05-29T15:22:49Z
dc.date.available	2016-05-29T15:22:49Z
dc.date.issued	2016-05-24
dc.identifier.uri	http://hdl.handle.net/11356/1065
dc.description	The PE²rr corpus contains source language texts from different domains along with their automatically generated translations into several morphologically rich languages, their post-edited versions, and error annotations of the performed post-edit operations. The main advantage of the corpus is the fusion of post-editing and error classification tasks, which have usually been seen as two independent tasks, although naturally they are not.
dc.language.iso	slv
dc.language.iso	srp
dc.language.iso	deu
dc.language.iso	spa
dc.language.iso	eng
dc.publisher	Insight Centre for Data Analytics, National University of Ireland, Galway
dc.relation	info:eu-repo/grantAgreement/EC/H2020/644333
dc.relation.isreferencedby	http://www.lrec-conf.org/proceedings/lrec2016/summaries/405.html
dc.rights	Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri	https://creativecommons.org/licenses/by-sa/4.0/
dc.rights.label	PUB
dc.subject	parallel corpus
dc.subject	machine translation
dc.subject	post-editing
dc.subject	error annotation
dc.subject	manual annotation
dc.subject	multilingual
dc.title	Post-edited and error annotated machine translation corpus PErr 1.0
dc.type	corpus
metashare.ResourceInfo#ContentInfo.mediaType	text
has.files	yes
branding	CLARIN.SI data & tools
contact.person	Maja Popović maja.popovic@hu-berlin.de Humboldt University of Berlin
contact.person	Mihael Arčan mihael.arcan@insight-centre.org Insight Centre for Data Analytics, National University of Ireland, Galway
sponsor	European Union EC/H2020/644333 TraMOOC - Translation for Massive Open Online Courses euFunds info:eu-repo/grantAgreement/EC/H2020/644333
sponsor	Science Foundation Ireland SFI/12/RC/2289 Insight nationalFunds
size.info	2896 units
size.info	43938 words
files.count	1
files.size	373440

Datoteke v tem vnosu

To je vnos

Publicly Available

z licenco:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

Ime: pe2rr_dataset.tgz
Velikost: 364.69 KB
Format: Neznano
Opis: 11 files (each for one source / language pair), tab-separated, with one translation unit per line.
MD5: cb5dd1552c5a3c28008a7ceb294aafb2

Prenesi datoteko

Prikaži enostavni zapis vnosa

Datoteke v tem vnosu

Partnerji

Partnerji

Repozitorij