dc.contributor.author | Ljubešić, Nikola |
dc.date.accessioned | 2018-02-14T11:29:25Z |
dc.date.available | 2018-02-14T11:29:25Z |
dc.date.issued | 2015-10-16 |
dc.identifier.uri | http://hdl.handle.net/11356/1178 |
dc.description | The srMWELex lexicon is an automatically constructed lexicon of Serbian multiword expression candidates (mostly collocations) from the parsed srWaC 1.0 corpus by using the DepMWEx [depmueks] tool (https://github.com/nljubesi/depmwex). The tool extracts MWE candidates from parse trees by applying tree patterns and ranking by occurrence statistics. |
dc.language.iso | srp |
dc.publisher | Jožef Stefan Institute |
dc.relation.isreferencedby | http://www.informatica.si/index.php/informatica/article/view/985 |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | http://nlp.ffzg.hr/resources/lexicons/srmwelex/ |
dc.subject | multiword expressions |
dc.subject | collocations |
dc.title | Automatically constructed multiword lexicon srMWELex v0.5 |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType | lexicon |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN.SI data & tools |
contact.person | Nikola Ljubešić nikola.ljubesic@ijs.si Jožef Stefan Institute |
sponsor | COST IC-1207 PARSEME Other |
size.info | 22290 entries |
size.info | 3273369 multiWordUnits |
files.count | 1 |
files.size | 42210880 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)



- Name
- srMWELex.zip
- Size
- 40.26 MB
- Format
- application/zip
- Description
- Lexicon in XML format with DTD
- MD5
- dd1ac09983d5de1b358c372207abfe26
- srMWELex
- README.txt703 B
- mwelex.dtd885 B
- srMWELex_v0.5.xml394 MB