What's New

 corpus 
corpus
Description:
The Greek web corpus MaCoCu-el 1.0 was built by crawling the ".gr", ".ελ", ".cy" and ".eu" internet top-level domains in 2023, extending the crawl dynamically to other domains as well. The crawler is available at ...
 Ta vnos vsebuje 2 datotek(e) (16.23 GB).
 
Publicly Available
 corpus 
corpus
Description:
The Catalan web corpus MaCoCu-ca 1.0 was built by crawling the ".cat", ".es", ".ad", ".fr", ".it" and ".eu" internet top-level domains in 2022, extending the crawl dynamically to other domains as well. The crawler is ...
 Ta vnos vsebuje 2 datotek(e) (4.72 GB).
 
Publicly Available
 corpus 
corpus
Description:
The Ukrainian web corpus MaCoCu-uk 1.0 was built by crawling the ".ua" and ".укр" internet top-level domains in 2022, extending the crawl dynamically to other domains as well. The crawler is available at https://github.c ...
 Ta vnos vsebuje 2 datotek(e) (24.58 GB).
 
Publicly Available

Največ ogledov

V preteklem tednu
 corpus 
corpus
Description:
The Catalan web corpus MaCoCu-ca 1.0 was built by crawling the ".cat", ".es", ".ad", ".fr", ".it" and ".eu" internet top-level domains in 2022, extending the crawl dynamically to other domains as well. The crawler is ...
 Ta vnos vsebuje 2 datotek(e) (4.72 GB).
 
Publicly Available