dc.contributor.author |
Krsnik, Luka |
dc.contributor.author |
Dobrovoljc, Kaja |
dc.contributor.author |
Robnik-Šikonja, Marko |
dc.date.accessioned |
2023-11-20T15:48:07Z |
dc.date.available |
2023-11-20T15:48:07Z |
dc.date.issued |
2023-11-17 |
dc.identifier.uri |
http://hdl.handle.net/11356/1899 |
dc.description |
STARK is a python-based command-line tool for extraction of dependency trees from parsed corpora, aimed at corpus-driven linguistic investigations of syntactic and lexical phenomena of various kinds. It takes a treebank in the CONLL-U format as input and returns a list of all relevant dependency trees with frequency information and other useful statistics, such as the strength of association between the nodes of a tree, or its significance in comparison to another treebank.
For installation, execution and the description of various user-defined parameter settings, see the official project page at: https://github.com/clarinsi/STARK
In comparison with v1, this version introduces several new features and improvements, such as the option to set parameters in the command line, compare treebanks or visualise results online. |
dc.publisher |
Faculty of Computer and Information Science, University of Ljubljana |
dc.publisher |
Centre for Language Resources and Technologies, University of Ljubljana |
dc.publisher |
Faculty of Arts, University of Ljubljana |
dc.publisher |
CLARIN.SI |
dc.relation.isreferencedby |
https://unidive.lisn.upsaclay.fr/lib/exe/fetch.php?media=meetings:2023-saclay:abstracts:62_dobrovoljc_et_al_stark_a_tool_for_dependency_tree.pdf |
dc.relation.replaces |
http://hdl.handle.net/11356/1284 |
dc.relation.isreplacedby |
http://hdl.handle.net/11356/1958 |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/licenses/Apache-2.0 |
dc.rights.label |
PUB |
dc.source.uri |
https://github.com/clarinsi/STARK |
dc.subject |
corpus linguistics |
dc.subject |
text processing |
dc.subject |
dependency trees |
dc.subject |
extraction |
dc.subject |
n-grams |
dc.subject |
universal dependencies |
dc.subject |
syntax |
dc.title |
Dependency tree extraction tool STARK 2.0 |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
false |
has.files |
yes |
branding |
CLARIN.SI data & tools |
contact.person |
Kaja Dobrovoljc kaja.dobrovoljc@ff.uni-lj.si Faculty of Arts, University of Ljubljana |
sponsor |
Jožef Stefan Institute CLARIN CLARIN.SI nationalFunds |
sponsor |
ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds |
sponsor |
ARRS (Slovenian Research Agency) Z6-4617 Treebank-Driven Approach to the Study of Spoken Slovenian nationalFunds |
files.count |
1 |
files.size |
1210101 |