Prikaži enostavni zapis vnosa
dc.contributor.author |
Lebar Bajec, Iztok |
dc.contributor.author |
Bajec, Marko |
dc.contributor.author |
Bajec, Žan |
dc.date.accessioned |
2022-12-02T10:44:56Z |
dc.date.available |
2022-12-02T10:44:56Z |
dc.date.issued |
2022-12-01 |
dc.identifier.uri |
http://hdl.handle.net/11356/1738 |
dc.description |
Punctuation and Capitalisation service for NeMo models. For more details about building such models, see the official NVIDIA NeMo documentation (https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/stable/nlp/punctuation_and_capitalization.html) and NVIDIA NeMo GitHub (https://github.com/NVIDIA/NeMo). A model for punctuation and capitalisation restoration in lowercased non-punctuated Slovene text can be downloaded from http://hdl.handle.net/11356/1735.
The service accepts as input either a single string or list of strings for which punctuation and capitalisation should be restored. The result will be in the same format as the request, either a single string or list of strings. The maximal accepted text length is 5000c. Note that punctuation and capitalization of one 5000c text block on cpu will take advantage of all available cores and may take ~30s (on a system with 24 vCPU). See the service README.md for further details. |
dc.publisher |
Faculty of Computer and Information Science, University of Ljubljana |
dc.relation.isreferencedby |
https://rsdo.slovenscina.eu/en/speech-technologies |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/licenses/Apache-2.0 |
dc.rights.label |
PUB |
dc.source.uri |
https://github.com/clarinsi/Slovene_punctuator |
dc.subject |
punctuation |
dc.subject |
capitalisation |
dc.subject |
NeMo |
dc.subject |
service |
dc.title |
NeMo Punctuation and Capitalisation service RSDO-DS2-P&C-API 1.0 |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
service |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
false |
has.files |
yes |
branding |
CLARIN.SI data & tools |
contact.person |
Iztok Lebar Bajec ilb@fri.uni-lj.si Faculty of Computer and Information Science, University of Ljubljana |
sponsor |
Ministry of Culture C3340-20-278001 Development of Slovene in a Digital Environment Other |
files.count |
1 |
files.size |
40960 |
Datoteke v tem vnosu
To je vnos
Publicly Available
z licenco:
Apache License 2.0
- Ime
- Slovene_punctuator-1.0.tar
- Velikost
- 40
KB
- Format
- Neznano
- Opis
- RSDO DS2 P&C API 1.0
- MD5
- a4bf32082c16f2a7bc06e57bf4babb38
Prenesi datoteko
Prikaži enostavni zapis vnosa