Databases

Arabic

- Semitischen Tonarchiv - The Semitic Languages Audio Archive, Heidelberg University

 

Catalan

- Computerized repertoire of ancient Catalan literature: http://www.rialc.unina.it/

  

French

-    http://www.projet-pfc.net/ressources-linguistiques/moteuref.html, with regional differences

-    http://www.projet-pfc.net/ressources-linguistiques/corpusthematique.html thematic corpus about conversational routines

 

Japan

-   Aozora Bunko, copyright free collection of verbatim/complete works: http://www.aozora.gr.jp/

-   Japanese Text Initiative (University of Virginia): http://etext.lib.virginia.edu/japanese/texts.euc.html

-   contemporary music databases: http://www.utamap.com/

 

English

-    The Internet Archive, digital library

-    Project Gutenberg: http://promo.net/pg/

-    International Computer Archive of Modern and Medieval Englishhttp://icame.uib.no/

-    The University of Oxford Text Archivehttp://ota.ahds.ac.uk/

 

Italian

-    The Corpus of Ancient Italian of the Work of the Italian Vocabulary includes about 22 million words from Vulgar texts before 1375: http://www.ovi.cnr.it/index.php?page=banchedati

 

Dutch

-  the website of the Dutch lexicology Institute, in which it is possible to find links to all free online databases and vocabularies http://www.inl.nl/

- website for general information about language, containing several useful links:http://taalunieversum.org/

- online version of Groete Boekje - dutch word list: http://woordenlijst.org/

 

Spanish

-   Archivo de Filología Española

-   Grupo de Estructuras de Datos y Lingüística Computacional, Departamento de Informática y Sistemas de la Universidad de Las Palmas de Gran Canaria