Corpus of metre

Corpus of castilian sonnets from the Golden Ages (XVI y XVII) annotated with metrical information.

Poets

In order to develop a representative corpus, any poet from the 16th and 17th Centuries with at least 10 sonnets digitalized have been selected. Nowadays, the corpus has more than 5000 sonets (more than 71000 lines). It includes authors as:

Agustin de Salazar y Torres, Antonio Enríquez Gómez, Bartolomé Leonardo de Argensola, Lupercio Leonardo de Argensola, Bernardino de Rebolledo, Bocángel y Unzueta, Miguel de Cervantes, Diego Hurtado de Mendoza, Diego Ximenez Ayllon, Fernando de Herrera, Francisco de Aldana, Francisco de Borja y Aragón, Francisco de la Torre, Francisco de Medrano, Garcilaso de la Vega, Luis de Góngora, Gutierre de Cetina, Hernando de Acuña, Joseph de Litala y Castelví, Juana Inés de la Cruz, Juan Boscán, Juan de Almeida, Juan de Arguijo, Juan de Tassis y Peralta, Juan de Timoneda, Lope de Vega, Lope de Zárate, López de Mendoza, Luis Carrillo y Sotomayor, Luis de Ulloa y Pereira, Antonio Mira de Amescúa, Pedro de Padilla, Pedro Espinosa, Pedro Soto de Rojas, Francisco de Quevedo, Tirso de Molina, Trillo y Figueroa, Baltasar de Alcázar, Fray Luis de León, Jerónimo Cáncer y Velasco, Francisco de Figueroa, Antonio Hurtado de Mendoza, Juan de Jauregui, Luis Martín de la Plaza, Gregorio de Matos, Anastasio Pantaleón de Ribera, Jacinto Polo de Medina, Diego Ramírez Pagán, Adnrés Rey de Artieda, Juan de Salinas, Esteban Manuel de Villegas, Cristóbal de Virués, between others. See the complet list of poets at the download page.

All texts have been extracted from Miguel de Cervantes Virtual Library.

Markup

Each sonnet is markup with XML tags following TEI standard. Besides the TEI-Header and structural information, a formal representation of the metre of each line is included as metrical pattern. It is formed by the sequence of strong and weak syllables with symbols + and respectively. For example:

    <l n="1" met="---+---+-+-">Cuando me paro a contemplar mi estado,</l>

Annotation

The metrical patterns has been annotated following a semi-automatic process. First, all the sonnets have been annotated with an automatic scansion system for Spanish verses. It assign a metrical patter to each line. Nowadays the annotation is being manually validated. The TEI-Header declares if the annotation of each sonnet has been manually validated or not.

The annotators of ADSO corpus are María Ribes Lafoz, Mª Ángeles Herrero Herrero, Noelia Sánchez, Sara Trigueros and Borja Navarro Colorado.

For detaills, see the annotation guide (in Spanish) or our publications.

Cite the corpus

Please, use the following reference when you cite the corpus in academic publications or conference papers. Thanks.

Licence

Metrical annotation is under Creative Commons Licence, Attribution-NonCommercial 4.0 Internacional.