Show simple item record

 
dc.contributor.author Zwitter Vitez, Ana
dc.contributor.author Zemljarič Miklavčič, Jana
dc.contributor.author Krek, Simon
dc.contributor.author Stabej, Marko
dc.contributor.author Erjavec, Tomaž
dc.date.accessioned 2021-09-23T13:05:10Z
dc.date.available 2021-09-23T13:05:10Z
dc.date.issued 2021-09-23
dc.identifier.uri http://hdl.handle.net/11356/1438
dc.description Gos is a corpus of spoken Slovene that includes the transcripts of approximately 120 hours of speech recorded in various situations: radio and TV shows, school lessons and lectures, private conversations between friends or within the family, work meetings, consultations, conversations in buying and selling situations, etc. All speech is transcribed in two versions – with pronunciation-based spelling and with standardized spelling – and it comprises over one million words. The corpus can be searched by means of the web concordancer where it is also possible to listen to the corresponding recordings: http://www.korpus-gos.net. As opposed to the previous version, this one corrects some errors in the transcriptions and introduces various changes in the TEI and vertical encodings.
dc.language.iso slv
dc.publisher Centre for Language Resources and Technologies, University of Ljubljana
dc.relation.replaces http://hdl.handle.net/11356/1040
dc.relation.isreplacedby http://hdl.handle.net/11356/1771
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.label PUB
dc.source.uri http://eng.slovenscina.eu/korpusi/gos
dc.subject speech transcription
dc.subject spoken corpus
dc.subject TEI
dc.title Spoken corpus Gos 1.1
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
hidden false
hasMetadata false
has.files yes
branding CLARIN.SI data & tools
demo.uri http://www.korpus-gos.net/
contact.person Simon Krek simon.krek@guest.arnes.si Faculty of Arts, University of Ljubljana
sponsor Ministry of Education, Science and Sport 3311-08-986003 Communication in Slovene Other
sponsor Ministry of Culture C3340-20-278001 Development of Slovene in a Digital Environment Other
size.info 120 hours
size.info 1063936 tokens
size.info 1044301 words
files.count 2
files.size 23177137
featuredService.kontext search|https://www.clarin.si/kontext/first_form?corpname=gos11
featuredService.noske search|https://www.clarin.si/ske/#dashboard?corpname=gos11


 Files in this item

 Download all files in item (22.1 MB)
Icon
Name
Gos.TEI.zip
Size
15.74 MB
Format
application/zip
Description
Corpus in TEI format
MD5
f7d6429496a6a792ad879d2e85a0f3d5
 Download file  Preview
 File Preview  
  • Gos.TEI
    • gos043.xml474 kB
    • gos035.xml200 kB
    • gos027.xml173 kB
    • gos097.xml33 kB
    • gos121.xml390 kB
    • gos089.xml83 kB
    • gos113.xml248 kB
    • gos191.xml577 kB
    • gos105.xml36 kB
    • gos183.xml448 kB
    • gos175.xml615 kB
    • gos167.xml1012 kB
    • gos159.xml1 MB
    • gos261.xml217 kB
    • gos253.xml576 kB
    • gos245.xml786 kB
    • gos014.xml800 kB
    • gos237.xml211 kB
    • gos006.xml439 kB
    • gos229.xml536 kB
    • gos050.xml565 kB
    • gos042.xml341 kB
    • gos034.xml122 kB
    • gos026.xml177 kB
    • gos096.xml27 kB
    • gos088.xml29 kB
    • gos120.xml186 kB
    • gos112.xml42 kB
    • gos190.xml172 kB
    • gos104.xml24 kB
    • gos182.xml649 kB
    • gos174.xml755 kB
    • gos166.xml1 MB
    • gos158.xml1 MB
    • gos260.xml624 kB
    • gos252.xml531 kB
    • gos244.xml839 kB
    • gos013.xml862 kB
    • gos236.xml793 kB
    • gos228.xml227 kB
    • gos005.xml458 kB
    • 00README.txt611 B
    • gos041.xml508 kB
    • gos033.xml412 kB
    • gos025.xml197 kB
    • gos095.xml28 kB
    • gos087.xml27 kB
    • gos079.xml34 kB
    • gos111.xml48 kB
    • gos103.xml28 kB
    • gos181.xml686 kB
    • gos173.xml202 kB
    • gos165.xml1 MB
    • Gos-speakers.tsv60 kB
    • gos157.xml1 MB
    • gos149.xml100 kB
    • gos251.xml430 kB
    • gos243.xml599 kB
    • gos235.xml688 kB
    • gos012.xml466 kB
    • gos004.xml752 kB
    • gos227.xml324 kB
    • gos219.xml1 MB
    • gos040.xml481 kB
    • gos032.xml363 kB
    • gos024.xml231 kB
    • gos094.xml35 kB
    • gos086.xml19 kB
    • gos110.xml19 kB
    • gos078.xml40 kB
    • gos102.xml28 kB
    • gos180.xml771 kB
    • gos172.xml409 kB
    • gos164.xml2 MB
    • gos.xml499 kB
    • gos156.xml583 kB
    • gos148.xml197 kB
    • gos250.xml567 kB
    • gos242.xml737 kB
    • gos011.xml767 kB
    • gos234.xml288 kB
    • gos003.xml854 kB
    • gos226.xml692 kB
    • gos218.xml1 MB
    • gos031.xml290 kB
    • gos023.xml182 kB
    • gos093.xml33 kB
    • gos085.xml27 kB
    • gos077.xml30 kB
    • gos069.xml188 kB
    • gos101.xml92 kB
    • gos171.xml631 kB
    • gos163.xml539 kB
    • gos155.xml1 MB
    • gos147.xml359 kB
    • gos139.xml693 kB
    • gos241.xml610 kB
    • gos010.xml567 kB
    • gos233.xml487 kB
    • gos225.xml994 kB
    • gos002.xml406 kB
    • gos217.xml861 kB
    • gos209.xml313 kB
    • gos287.xml530 kB
    • gos279.xml216 kB
    • gos030.xml274 kB
    • gos022.xml552 kB
    • gos092.xml28 kB
    • gos084.xml50 kB
    • gos076.xml21 kB
    • gos068.xml121 kB
    • gos100.xml28 kB
    • gos170.xml664 kB
    • gos162.xml756 kB
    • gos154.xml413 kB
    • gos146.xml800 kB
    • gos138.xml138 kB
    • gos240.xml907 kB
    • gos232.xml480 kB
    • gos224.xml605 kB
    • gos001.xml395 kB
    • gos216.xml1014 kB
    • gos208.xml75 kB
    • gos286.xml682 kB
    • gos278.xml661 kB
    • gos021.xml471 kB
    • gos091.xml37 kB
    • gos083.xml33 kB
    • gos075.xml52 kB
    • gos067.xml156 kB
    • gos059.xml565 kB
    • gos161.xml379 kB
    • gos153.xml440 kB
    • gos145.xml1 MB
    • gos137.xml1 MB
    • gos129.xml210 kB
    • gos231.xml411 kB
    • gos199.xml403 kB
    • gos223.xml941 kB
    • gos215.xml34 kB
    • gos207.xml34 kB
    • gos285.xml681 kB
    • gos277.xml643 kB
    • gos269.xml149 kB
    • gos020.xml665 kB
    • gos090.xml23 kB
    • Gos-speeches.tsv90 kB
    • gos082.xml27 kB
    • gos074.xml605 kB
    • gos066.xml180 kB
    • gos058.xml675 kB
    • gos160.xml472 kB
    • gos152.xml348 kB
    • gos144.xml898 kB
    • gos136.xml1 MB
    • gos128.xml202 kB
    • gos198.xml427 kB
    • gos230.xml404 kB
    • gos222.xml722 kB
    • gos214.xml278 kB
    • gos206.xml405 kB
    • gos284.xml237 kB
    • gos276.xml308 kB
    • gos268.xml457 kB
    • gos081.xml31 kB
    • gos073.xml1 MB
    • gos065.xml125 kB
    • gos057.xml525 kB
    • schema
      • tei_clarin.rng667 kB
      • dcr.tmp1 kB
      • tei_clarin.sch501 B
      • tei_clarin.dtd247 kB
      • tei_clarin.rnc315 kB
      • trans-14.dtd2 kB
      • docs
        • tei_clarin_doc.xml8 MB
        • odd.css8 kB
        • index.html8 MB
        • tei-print.css2 kB
        • tei_clarin_doc.html8 MB
        • tei.css16 kB
      • xml.tmp2 kB
      • tei_clarin.xsd738 kB
    • gos049.xml319 kB
    • gos151.xml53 kB
    • gos143.xml452 kB
    • gos135.xml1 MB
    • gos127.xml191 kB
    • gos119.xml245 kB
    • gos197.xml389 kB
    • gos221.xml285 kB
    • gos189.xml423 kB
    • gos213.xml194 kB
    • gos205.xml274 kB
    • gos283.xml718 kB
    • gos275.xml248 kB
    • gos267.xml82 kB
    • gos259.xml415 kB
    • gos080.xml41 kB
    • gos072.xml1 MB
    • gos064.xml153 kB
    • gos056.xml668 kB
    • gos048.xml353 kB
    • gos150.xml183 kB
    • gos142.xml490 kB
    • gos134.xml364 kB
    • gos126.xml241 kB
    • gos118.xml198 kB
    • gos196.xml192 kB
    • gos188.xml239 kB
    • gos220.xml210 kB
    • gos212.xml195 kB
    • gos204.xml229 kB
    • gos282.xml440 kB
    • gos274.xml172 kB
    • gos266.xml351 kB
    • gos258.xml593 kB
    • gos019.xml591 kB
    • gos071.xml104 kB
    • gos063.xml657 kB
    • gos055.xml585 kB
    • gos047.xml471 kB
    • gos039.xml613 kB
    • gos141.xml852 kB
    • gos133.xml349 kB
    • gos125.xml226 kB
    • gos117.xml259 kB
    • gos195.xml407 kB
    • gos109.xml34 kB
    • gos187.xml308 kB
    • gos179.xml812 kB
    • gos211.xml91 kB
    • gos203.xml187 kB
    • gos281.xml815 kB
    • gos273.xml727 kB
    • gos265.xml260 kB
    • gos257.xml712 kB
    • gos249.xml556 kB
    • gos018.xml531 kB
    • gos070.xml101 kB
    • gos062.xml599 kB
    • gos054.xml422 kB
    • gos046.xml507 kB
    • gos038.xml830 kB
    • gos140.xml449 kB
    • gos132.xml344 kB
    • gos124.xml970 kB
    • gos116.xml459 kB
    • gos194.xml822 kB
    • gos108.xml29 kB
    • gos186.xml577 kB
    • gos210.xml66 kB
    • gos178.xml843 kB
    • gos202.xml124 kB
    • gos280.xml966 kB
    • gos272.xml69 kB
    • gos264.xml270 kB
    • gos256.xml745 kB
    • gos248.xml558 kB
    • gos017.xml732 kB
    • gos009.xml350 kB
    • gos061.xml606 kB
    • gos053.xml543 kB
    • gos045.xml771 kB
    • gos037.xml324 kB
    • gos029.xml218 kB
    • gos131.xml401 kB
    • gos099.xml24 kB
    • gos123.xml959 kB
    • gos115.xml245 kB
    • gos193.xml426 kB
    • gos107.xml33 kB
    • gos185.xml543 kB
    • gos177.xml496 kB
    • gos169.xml731 kB
    • gos201.xml132 kB
    • gos271.xml126 kB
    • gos263.xml415 kB
    • gos255.xml863 kB
    • gos247.xml743 kB
    • gos239.xml865 kB
    • gos016.xml596 kB
    • gos008.xml309 kB
    • gos060.xml697 kB
    • gos052.xml481 kB
    • gos044.xml152 kB
    • gos036.xml274 kB
    • gos028.xml187 kB
    • gos098.xml48 kB
    • gos130.xml342 kB
    • gos122.xml252 kB
    • gos114.xml275 kB
    • gos192.xml565 kB
    • gos106.xml16 kB
    • gos184.xml619 kB
    • gos176.xml1 MB
    • gos168.xml565 kB
    • gos200.xml191 kB
    • gos270.xml310 kB
    • gos262.xml386 kB
    • gos254.xml982 kB
    • gos246.xml907 kB
    • gos015.xml616 kB
    • gos238.xml440 kB
    • gos007.xml831 kB
    • gos051.xml672 kB
Icon
Name
Gos.vert.zip
Size
6.36 MB
Format
application/zip
Description
Corpus in derived vertical format
MD5
f59ee5af3f14570f6692bacfb0d62036
 Download file  Preview
 File Preview  
  • Gos.vert
    • gos11.vert48 MB
    • gos11.regi3 kB
    • Gos-speeches.tsv90 kB
    • Gos-speakers.tsv60 kB
    • 00README.txt461 B

Show simple item record