Institute of linguistics
Faculty of philosophy
University of Zagreb

Croatian
National
Corpus


HOME

INSTITUTIONS/ASSOCIATIONS

PROJECTS

CORPORA

DICTIONARIES

TOOLS

SPEECH

CONFERENCES

GLOSSARY

MISCELLANEOUS

Corpora represent the basic form of language resources and are therefore indispensable foundation for language technology research for each natural language.

Croatian corpora


WWW as corpus


Avalibale corpora of other languages

Bosnian

Bulgarian

Czech

Danish

Dutch

English

Estonian

Ethiopian languages

French

Gaelic languages

German

Greek

Finnish

Hebrew

Hungarian

Indian languages

Italian

Lithuanian

Malayan

Norwegian

Polish

  • PELCRA (Polish and English Language Corpora for Resarch and Applications)
  • IPI PAN (Corpus of Polish)

Portuguese

Romanian

  • MULTEXT-East: multilingual text tools and corpora for Central and Eastern Euroepan Languages

Russian

Serbian

Slovak

Slovenian

Sorbian

Spanish

Swedish

Turkish

Other corpora


Other lists of corpora