The STO (SprogTeknologisk Ordbase) lexicon is a comprehensive computational lexicon of Danish developed for NLP/HLT applications. The syntax layer of the lexicon, presented here in Lexical Markup Format (LMF), contains a vocabulary of 84,159 entries (nouns, verbs and adjectives). The syntax is linked to the morphological layer through ID's.
The Lexical Markup Language is an internationally wellknown and accepted XML format and the ISO standard for Natural Language Processing (NLP) lexicons. See www.lexicalmarkupframework.org for more information on LMF and the attached documentation for the marke-up of STO.
Creators: Pedersen, Bolette Sandford; Olsen, Sussi; Nimb, Sanni; Hansen, Dorte Haltrup
License: http://creativecommons.org/licenses/by-sa/4.0/