KAS-biterm is an automatically generated glossary of English terms with their translations into Slovene. The pairs, possibly with their English and Slovene acronyms, were extracted from the Corpus of Academic Slovene KAS 1.0 (http://hdl.handle.net/11356/1244), where they have been annotated with the kas-biterm tool (https://github.com/clarinsi/kas-biterm) trained on the Bilingual terminology extraction dataset KAS-biterm 1.0 (http://hdl.handle.net/11356/1199). Note that only Query 1 was used for pre-selection of the sentences and for training the tool, and that the bi-lingual terms from the KAS corpus have been filtered to remove noise.
The glossary is encoded in TEI-Lex0 (https://github.com/DARIAH-ERIC/lexicalresources) and gives, for each entry, also up to three examples of use, together with their bibliographic information. Various parts of the lexical entries also have links to the appropriate queries to CLARIN.SI noSketch Engine conconrdancer. The TEI encoded corpus is also available in a variant that is a much smaller document as it does not contain the examples of use and links.
ARRS (Slovenian Research Agency)J6-7094"Slovene scientific texts: resources and description"ARRS (Slovenian Research Agency)P2-103"Knowledge Technologies"ARRS (Slovenian Research Agency)P6-0411"Language Resources and Technologies for Slovene"