What's New

 corpus 
corpus
Description:
The Greek web corpus MaCoCu-el 1.0 was built by crawling the ".gr", ".ελ", ".cy" and ".eu" internet top-level domains in 2023, extending the crawl dynamically to other domains as well. The crawler is available at ...
 This item contains 2 files (16.23 GB).
 
Publicly Available
 corpus 
corpus
Description:
The Catalan web corpus MaCoCu-ca 1.0 was built by crawling the ".cat", ".es", ".ad", ".fr", ".it" and ".eu" internet top-level domains in 2022, extending the crawl dynamically to other domains as well. The crawler is ...
 This item contains 2 files (4.72 GB).
 
Publicly Available
 corpus 
corpus
Description:
The Ukrainian web corpus MaCoCu-uk 1.0 was built by crawling the ".ua" and ".укр" internet top-level domains in 2022, extending the crawl dynamically to other domains as well. The crawler is available at https://github.c ...
 This item contains 2 files (24.58 GB).
 
Publicly Available

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
The Catalan web corpus MaCoCu-ca 1.0 was built by crawling the ".cat", ".es", ".ad", ".fr", ".it" and ".eu" internet top-level domains in 2022, extending the crawl dynamically to other domains as well. The crawler is ...
 This item contains 2 files (4.72 GB).
 
Publicly Available