Show simple item record

 
dc.contributor.author Krsnik, Luka
dc.contributor.author Dobrovoljc, Kaja
dc.contributor.author Robnik-Šikonja, Marko
dc.date.accessioned 2023-11-20T15:48:07Z
dc.date.available 2023-11-20T15:48:07Z
dc.date.issued 2023-11-17
dc.identifier.uri http://hdl.handle.net/11356/1899
dc.description STARK is a python-based command-line tool for extraction of dependency trees from parsed corpora, aimed at corpus-driven linguistic investigations of syntactic and lexical phenomena of various kinds. It takes a treebank in the CONLL-U format as input and returns a list of all relevant dependency trees with frequency information and other useful statistics, such as the strength of association between the nodes of a tree, or its significance in comparison to another treebank. For installation, execution and the description of various user-defined parameter settings, see the official project page at: https://github.com/clarinsi/STARK In comparison with v1, this version introduces several new features and improvements, such as the option to set parameters in the command line, compare treebanks or visualise results online.
dc.publisher Faculty of Computer and Information Science, University of Ljubljana
dc.publisher Centre for Language Resources and Technologies, University of Ljubljana
dc.publisher Faculty of Arts, University of Ljubljana
dc.publisher CLARIN.SI
dc.relation.isreferencedby https://unidive.lisn.upsaclay.fr/lib/exe/fetch.php?media=meetings:2023-saclay:abstracts:62_dobrovoljc_et_al_stark_a_tool_for_dependency_tree.pdf
dc.relation.replaces http://hdl.handle.net/11356/1284
dc.relation.isreplacedby http://hdl.handle.net/11356/1958
dc.rights Apache License 2.0
dc.rights.uri https://opensource.org/licenses/Apache-2.0
dc.rights.label PUB
dc.source.uri https://github.com/clarinsi/STARK
dc.subject corpus linguistics
dc.subject text processing
dc.subject dependency trees
dc.subject extraction
dc.subject n-grams
dc.subject universal dependencies
dc.subject syntax
dc.title Dependency tree extraction tool STARK 2.0
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent false
has.files yes
branding CLARIN.SI data & tools
contact.person Kaja Dobrovoljc kaja.dobrovoljc@ff.uni-lj.si Faculty of Arts, University of Ljubljana
sponsor Jožef Stefan Institute CLARIN CLARIN.SI nationalFunds
sponsor ARRS (Slovenian Research Agency) P6-0411 Language Resources and Technologies for Slovene nationalFunds
sponsor ARRS (Slovenian Research Agency) Z6-4617 Treebank-Driven Approach to the Study of Spoken Slovenian nationalFunds
files.count 1
files.size 1210101


 Files in this item

This item is
Publicly Available
and licensed under:
Apache License 2.0
Icon
Name
STARK-2.0.zip
Size
1.15 MB
Format
application/zip
Description
GitHub source code
MD5
b73bfc2c1fb9b519639499d555c6b9fb
 Download file  Preview
 File Preview  
  • STARK-2.0
    • setup.py620 B
    • settings.md12 kB
    • README.md7 kB
    • .gitignore133 B
    • scripts
      • grew_corpus_names.txt5 kB
      • create_codes_mapper.py907 B
      • codes_and_flags.yaml14 kB
    • install.bat42 B
    • logos
      • ARRS.png13 kB
      • FF.png43 kB
      • CJVT.png76 kB
      • CLARIN.png28 kB
      • ARRS.svg139 kB
      • FF.svg313 kB
      • FRI.png128 kB
    • stark
      • __init__.py66 B
      • Tree.py19 kB
      • ResultNode.py1 kB
      • codes_mapper.json11 kB
      • Value.py700 B
      • ResultTree.py8 kB
      • stark.py32 kB
      • _version.py78 B
      • generic.py3 kB
    • run.sh80 B
    • requirements.txt15 B
    • config.ini794 B
    • run.bat49 B
    • sample
      • output.tsv64 kB
      • sl_ssj-ud-dev.conllu1 MB
      • en_ewt-ud-dev.conllu1 MB
    • LICENSE.txt11 kB
    • stark.py2 kB
    • MANIFEST.in57 B

Show simple item record