Corpora and dataset

The guide for consulting the corpora of the project can be downloaded here.

The corpora are accessible at this link and must be cited as follows:

GRANDI, Nicola, BALLARÈ, Silvia, CHIUSAROLI, Francesca, GALLINA, Francesca, PASCOLI, Matteo, PISTOLESI, Elena; Corpus Univers-ITA. 2023, DOI: https://doi.org/10.60760/unibo/univers-ita

 

GRANDI, Nicola, BALLARÈ, Silvia, CHIUSAROLI, Francesca, GALLINA, Francesca, PASCOLI, Matteo, PISTOLESI, Elena; Corpus Univers-ITA-ProUniv. 2023, DOI: https://doi.org/10.60760/unibo/univers-ita-prouniv 

 

GRANDI, Nicola, BALLARÈ, Silvia, CHIUSAROLI, Francesca, GALLINA, Francesca, PASCOLI, Matteo, PISTOLESI, Elena; Corpus Univers-ITA-ProGior. 2023, DOI: https://doi.org/10.60760/unibo/univers-ita-progior

Attention:

Due to technical issues, it is not possible to distinguish within the corpus between words ending in an accented vowel and words ending in a vowel followed by an apostrophe. This means that, for example, forms that in the original text appear as po’ or are standardized as . We are working on resolving the issue.