| dc.contributor.author | Cicola, Ilaria |
| dc.contributor.author | Pannitto, Ludovica |
| dc.contributor.author | Peta, Ines |
| dc.contributor.author | Fontana, Chiara |
| dc.contributor.author | Norozi, Nahid |
| dc.contributor.author | Demichelis, Marco |
| dc.contributor.author | Aiello, Giulia |
| dc.date.accessioned | 2026-04-17T09:25:37Z |
| dc.date.available | 2026-04-17T09:25:37Z |
| dc.date.issued | 2026-03-12 |
| dc.identifier.uri | http://hdl.handle.net/11356/2097 |
| dc.description | The Disasters corpus in classical Arabic sources (DiCCAS) is designed to allow historians to compare different accounts and narratives of disasters in a variety of classical Arabic sources. The corpus encompasses a diverse range of materials, including the Qur’an and the ḥadīth collections Saḥīḥ al-Bukhārī and Saḥīḥ Muslim, as well as several significant historical works, such as al-Ṭabarī’s Kitāb Tārīkh al-rusul wa-l-mulūk and Ibn Taghrībirdī’s Kitāb al-Nujūm al-zāhira fī mulūk Miṣr wa-l-Qāhira. The corpus also incorporates adab texts by al-Jāhiẓ, notably his Rasāʾil, and Ibn al-Jawzī’s al-Mudhish. The DiCCAS corpus is encoded using the Text Encoding Initiative (TEI) Guidelines which gives the structure of the corpus and marks disaster related words. It is also available in vertical format, which adds linguistic annotations, i.e. tokenisation, lemmatisations and PoS tagging. |
| dc.language.iso | ara |
| dc.publisher | University of Bologna |
| dc.relation.isreferencedby | https://doi.org/10.5617/jais.12791 |
| dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0/ |
| dc.rights.label | PUB |
| dc.source.uri | https://github.com/LaboratorioSperimentale/DiCCAS |
| dc.subject | Arabic history |
| dc.subject | Arabic text encoding |
| dc.subject | Islamicate history |
| dc.subject | environmental challenges |
| dc.subject | premodern Eurasia |
| dc.title | Disasters corpus in classical Arabic sources DiCCAS |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| has.files | yes |
| branding | CLARIN.SI data & tools |
| contact.person | Ilaria Cicola ilaria.cicola@unibo.it University of Bologna |
| contact.person | Ludovica Pannitto ludovica.pannitto@unibo.it University of Bologna |
| contact.person | Ines Peta ines.peta@unibo.it University of Bologna |
| sponsor | NextGenerationEU - PRIN 2022 20225NNWWE_002 Environmental Anomalies & Political Legitimacy in Global Eurasia, 12th-14th century euFunds |
| size.info | 10 texts |
| size.info | 289480 tokens |
| files.count | 7 |
| files.size | 21991150 |
Files in this item
Download all files in item (20.97 MB)This item is
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Name
- DiCCAS_README.md
- Size
- 1.58 KB
- Format
- Unknown
- Description
- ReadME file with Description, Pipeline, HTML generation and Tagset information
- MD5
- 9782f36e053f8b7e729d863f18a48318
- Name
- DiCCAS-annotation.md
- Size
- 5.13 KB
- Format
- Unknown
- Description
- Description of TEI XML annotation
- MD5
- 02821c821b1a2cc972c88bb7ca37bf38
- Name
- diccas_tei.rng
- Size
- 10.7 KB
- Format
- Unknown
- Description
- TEI Relax NG schema for DiCCAS
- MD5
- 7db7156f484533baeab739bbe2121c3b
- Name
- DiCCAS.tei.xml
- Size
- 3.45 MB
- Format
- XML
- Description
- DiCCAS in TEI XML
- MD5
- a52cc9c62a7ba66c3c18c53a07449a33
- Name
- diccas.xsl
- Size
- 20.17 KB
- Format
- Unknown
- Description
- XSLT script to convert DiCCAS TEI to HTML
- MD5
- 20142bbdff8d01b4c0a1266602ac9ff6
- Name
- DiCCAS_html.html
- Size
- 3.45 MB
- Format
- HTML
- Description
- DiCCAS in derived HTML
- MD5
- b27ef4b15017d8655c470b2bb38755d2
- Name
- DiCCAS_vert.vert
- Size
- 14.03 MB
- Format
- Unknown
- Description
- Vertical file for NoSketch Engine
- MD5
- 4b538ba1eacd541d7db360ea6c524be0