Skip to main content

CST STO

The STO (SprogTeknologisk Ordbase) lexicon is a comprehensive computational lexicon of Danish developed for NLP/HLT applications. The syntax layer of the lexicon, presented here in Lexical Markup Format (LMF), contains a vocabulary of 84,159 entries (nouns, verbs and adjectives). The syntax is linked to the morphological layer through ID's. The Lexical Markup Language is an internationally wellknown and accepted XML format and the ISO standard for Natural Language Processing (NLP) lexicons. See www.lexicalmarkupframework.org for more information on LMF and the attached documentation for the marke-up of STO.

Data og ressourcer

Nøgleord

Yderligere info

URI https://data.gov.dk/dataset/lang/4bed1a5b-f66e-433a-918b-9755c8ecb096
Destinationsside https://cst.ku.dk/sto_ordbase/
Høstes af Datavejviser
Udgivelsesdato 01-01-2013
Seneste ændringsdato
Opdateringsfrekvens
Dækningsperiode  / 
Emne(r)
  • 16.05.07 Sprog og retskrivning
  • Uddannelse, kultur og sport
Adgangsrettigheder offentlig
Overholder
Proveniensudsagn

Links til kildedatasæt: https://repository.clarin.dk/repository/xmlui/handle/20.500.12115/22 https://repository.clarin.dk/repository/xmlui/handle/20.500.12115/23 https://repository.clarin.dk/repository/xmlui/handle/20.500.12115/26 https://repository.clarin.dk/repository/xmlui/handle/20.500.12115/21

Dokumentation