Prikaži enostavni zapis vnosa
dc.contributor.author |
Krsnik, Luka |
dc.contributor.author |
Dobrovoljc, Kaja |
dc.contributor.author |
Robnik-Šikonja, Marko |
dc.date.accessioned |
2023-11-20T15:48:07Z |
dc.date.available |
2023-11-20T15:48:07Z |
dc.date.issued |
2023-11-17 |
dc.identifier.uri |
http://hdl.handle.net/11356/1899 |
dc.description |
STARK is a python-based command-line tool for extraction of dependency trees from parsed corpora, aimed at corpus-driven linguistic investigations of syntactic and lexical phenomena of various kinds. It takes a treebank in the CONLL-U format as input and returns a list of all relevant dependency trees with frequency information and other useful statistics, such as the strength of association between the nodes of a tree, or its significance in comparison to another treebank.
For installation, execution and the description of various user-defined parameter settings, see the official project page at: https://github.com/clarinsi/STARK
In comparison with v1, this version introduces several new features and improvements, such as the option to set parameters in the command line, compare treebanks or visualise results online. |
dc.publisher |
Faculty of Computer and Information Science, University of Ljubljana |
dc.publisher |
Centre for Language Resources and Technologies, University of Ljubljana |
dc.publisher |
Faculty of Arts, University of Ljubljana |
dc.publisher |
CLARIN.SI |
dc.relation.isreferencedby |
https://unidive.lisn.upsaclay.fr/lib/exe/fetch.php?media=meetings:2023-saclay:abstracts:62_dobrovoljc_et_al_stark_a_tool_for_dependency_tree.pdf |
dc.relation.replaces |
http://hdl.handle.net/11356/1284 |
dc.relation.isreplacedby |
http://hdl.handle.net/11356/1958 |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/licenses/Apache-2.0 |
dc.rights.label |
PUB |
dc.source.uri |
https://github.com/clarinsi/STARK |
dc.subject |
corpus linguistics |
dc.subject |
text processing |
dc.subject |
dependency trees |
dc.subject |
extraction |
dc.subject |
n-grams |
dc.subject |
universal dependencies |
dc.subject |
syntax |
dc.title |
Dependency tree extraction tool STARK 2.0 |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
false |
has.files |
yes |
branding |
CLARIN.SI data & tools |
contact.person |
Kaja Dobrovoljc kaja.dobrovoljc@ff.uni-lj.si Faculty of Arts, University of Ljubljana |
sponsor |
Jožef Stefan Institute CLARIN CLARIN.SI nationalFunds |
sponsor |
ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds |
sponsor |
ARRS (Slovenian Research Agency) Z6-4617 Treebank-Driven Approach to the Study of Spoken Slovenian nationalFunds |
files.count |
1 |
files.size |
1210101 |
Datoteke v tem vnosu
To je vnos
Publicly Available
z licenco:
Apache License 2.0
- Ime
- STARK-2.0.zip
- Velikost
- 1.15
MB
- Format
- application/zip
- Opis
- GitHub source code
- MD5
- b73bfc2c1fb9b519639499d555c6b9fb
Prenesi datoteko
Predogled
- STARK-2.0
- setup.py620 B
- settings.md12 kB
- README.md7 kB
- .gitignore133 B
- scripts
- grew_corpus_names.txt5 kB
- create_codes_mapper.py907 B
- codes_and_flags.yaml14 kB
- install.bat42 B
- logos
- ARRS.png13 kB
- FF.png43 kB
- CJVT.png76 kB
- CLARIN.png28 kB
- ARRS.svg139 kB
- FF.svg313 kB
- FRI.png128 kB
- stark
- __init__.py66 B
- Tree.py19 kB
- ResultNode.py1 kB
- codes_mapper.json11 kB
- Value.py700 B
- ResultTree.py8 kB
- stark.py32 kB
- _version.py78 B
- generic.py3 kB
- run.sh80 B
- requirements.txt15 B
- config.ini794 B
- run.bat49 B
- sample
- output.tsv64 kB
- sl_ssj-ud-dev.conllu1 MB
- en_ewt-ud-dev.conllu1 MB
- LICENSE.txt11 kB
- stark.py2 kB
- MANIFEST.in57 B
Prikaži enostavni zapis vnosa