| HOME
INSTITUTIONS/ASSOCIATIONS
PROJECTS
CORPORA
DICTIONARIES
SPEECH
CONFERENCES
GLOSSARY
MISCELLANEOUS
|
Language technologies
tools
Tools for Croatian
Other tools
Text editors and converters
- Emacs (fully programmable
text editor)
- 2XML (tool
for conversion of HTML/RTF texts into XML)
- Softleks (highly
specialized text editor for dictionary compiling/writing)
Language resources markup
- SGML
(Standard Generalized Markup Language)
- XML (Extended
Markup Language)
- TEI (Text
Encoding Initiative: home page)
- TEI recommendations
for corpus annotation
- TEI Pizza
Chef (on-line definition of DTDs)
- CES (Corpus
Encoding Standard, SMGL)
- XCES (Corpus
Encoding Standard, XML)
- TMX
format specification
- XT (SGML/XML
parser and validator by James Clark)
- O'Reilly XML.com
Word lists, frequency dictionaries, concordances,
text statistics and analysis
Taggers
- TNT
(trigram tagger)
- WinBrill
(rule-based tagger for Windows)
- QTAG (language
independent tagger)
- Czech
Morphology (Johns Hopkins University)
- CLAWS (tagger used
for tagging the British National Corpus)
Syntactic parsers
Tree-banks
Semantic networks
Aligners
Machine (aided) translation
Developers environments
|