Croatian National Corpus (HNK)
project of the Ministry of Science and Technology of the Republic of Croatia 130718, Computational processing of Croatian language, actually started at the end of 1998
theoretical foundations in 1995, published:
- Tadic (1996) Racunalna obradba hrvatskoga i nacionalni korpus, Suvremena lingvistika 41-42, 603-612
- Tadic (1998) Raspon, opseg i sastav korpusa suvremenoga hrvatskoga jezika, Filologija 30-31, 337-347
need for the reference corpus of Croatian (syn- and diachronical)
- 1st step: written text
- later: some 10% spoken text
a tentative solution for its composition
the size, time-span and structure was elaborated
accessibility via WWW service was suggested