| dc.contributor.author | Cano González, Pedro |
| dc.contributor.author | Carreras Riudavets, Francisco Javier |
| dc.contributor.author | Hernández Marrero, Mar |
| dc.contributor.author | Hernández Figueroa, Zenón José |
| dc.date.accessioned | 2025-11-21T11:32:16Z |
| dc.date.available | 2025-11-21T11:32:16Z |
| dc.date.issued | 2025-11-20 |
| dc.identifier.uri | http://hdl.handle.net/11356/2063 |
| dc.description | The ParlaMint-ES-CN corpus is the contribution of the Parliament of the Canary Islands (Parlamento de Canarias) to the ParlaMint collection of comparable parliamentary corpora (https://www.clarin.eu/parlamint). It contains transcriptions of parliamentary debates produced between 1991 and 2021, covering thirty years of legislative activity in the autonomous community of the Canary Islands. The corpus is encoded following the official ParlaMint TEI/XML guidelines and schema specifications. The transcriptions are organised by day and include detailed metadata on each parliamentary sitting, such as the legislative term, session and meeting. Speeches are marked with information about the speaker and their role in the debate. The corpus also includes transcriber-supplied comments, such as interruptions, lapses, or procedural notes. The dataset provides extensive metadata on speakers, including their full names and institutional roles within the Parliament of the Canary Islands. Additional metadata on parties, parliamentary groups and political affiliations are included where available. As in other ParlaMint corpora, the texts are divided into subcorpora according to the ParlaMint scheme. The corpus is distributed in two variants: - The canonical TEI/XML version, containing the full transcriptions and metadata. - The linguistically annotated version (.ana), which includes tokenization, lemmatisation, Universal Dependencies part-of-speech tags, morphological features, syntactic dependencies, and named entities. The ParlaMint-ES-CN corpus offers a comprehensive diachronic resource for the study of political discourse in the Canary Islands, enabling comparative research with other European and regional parliaments within the ParlaMint framework. |
| dc.language.iso | spa |
| dc.publisher | Instituto Universitario de Análisis y Aplicaciones Textuales |
| dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
| dc.rights.label | PUB |
| dc.subject | TEI |
| dc.subject | parliamentary debates |
| dc.subject | COVID-19 |
| dc.subject | Parla-CLARIN |
| dc.subject | Canary Islands Parliament |
| dc.title | Comparable corpus of parliamentary debates ParlaMint-ES-CN 1.0 |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| has.files | yes |
| branding | CLARIN.SI data & tools |
| contact.person | Pedro Jesús Cano González pedro.cano@ulpgc.es Instituto Universitario de Análisis y Aplicaciones Textuales |
| contact.person | Francisco Javier Carreras Riudavets secretaria@ulpgc.es Instituto Universitario de Análisis y Aplicaciones Textuales |
| sponsor | Ministry of Science and Innovation of Spain - Adaptación de la tecnología InteLiText a ParlaMint nationalFunds |
| size.info | 39820063 words |
| size.info | 125192 utterances |
| size.info | 47051488 tokens |
| files.count | 2 |
| files.size | 2254917464 |
| featuredService.noske | search|https://www.clarin.si/ske/#dashboard?corpname=parlamint10_es_cn |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- ParlaMint-ES-CN.tgz
- Size
- 161.89 MB
- Format
- Unknown
- Description
- "Plain text" corpus
- MD5
- e0b11cab0fa79012b816f2b7f4c0806b
- Name
- ParlaMint-ES-CN.ana.tgz
- Size
- 1.94 GB
- Format
- Unknown
- Description
- Linguistically annotated corpus
- MD5
- bf2b8a4825be469208a22ce016d6a5a8