CST STO

The STO (SprogTeknologisk Ordbase) lexicon is a comprehensive computational lexicon of Danish developed for NLP/HLT applications. The syntax layer of the lexicon, presented here in Lexical Markup Format (LMF), contains a vocabulary of 84,159 entries (nouns, verbs and adjectives). The syntax is linked to the morphological layer through ID's. The Lexical Markup Language is an internationally wellknown and accepted XML format and the ISO standard for Natural Language Processing (NLP) lexicons. See www.lexicalmarkupframework.org for more information on LMF and the attached documentation for the marke-up of STO.

Creators: Pedersen, Bolette Sandford; Olsen, Sussi; Nimb, Sanni; Hansen, Dorte Haltrup

License: http://creativecommons.org/licenses/by-sa/4.0/

Data og Distribution(er)

Yderligere info

Felt Værdi
Destinationsside https://cst.ku.dk/sto_ordbase/
Metadata sidst opdateret September 9, 2020, 08:25 (UTC)
Metadata oprettet Maj 13, 2020, 15:25 (UTC)
Emne Sprog og retskrivning Uddannelse, kultur og sport
GUID https://data.gov.dk/dataset/lang/4bed1a5b-f66e-433a-918b-9755c8ecb096
Kontaktemail info@clarin.dk
Kontaktnavn CLARIN-DK, Centre for Language Technology, NorS, University of Copenhagen
Sprog engelsk
URI https://data.gov.dk/dataset/lang/4bed1a5b-f66e-433a-918b-9755c8ecb096
Udgivelsesdato 2013
Udgivernavn CST, KU
Type Leksikalske ressourcer
Dokumentation
Kildedatasæt