-
Dansk Wikisource
Maskinlæsbar version af dumps fra den danske wikipedia kilder. Se https://foundation.wikimedia.org/wiki/Terms_of_Use -
Dansk Wikiquote
Maskinlæsbar version af dumps fra den danske wikipedias citater. Se https://foundation.wikimedia.org/wiki/Terms_of_Use -
Dansk Wikipedia
Maskinlæsbar version af dumps fra den danske wikipedia. Se https://foundation.wikimedia.org/wiki/Terms_of_Use, da der kan forekommer forskellige licensvilkår afhængigt af... -
DanPASS-korpus (Danish Phonetically Annotated Spontaneous Speech)
The DanPASS corpus was developed for research and applied research purposes. It consists of of non-scripted monologues and dialogues, recorded by 27 speakers, comprising a total... -
Kommunal semantisk grundmodel 2
Kommunal Semantisk Grundmodel nr. 2 er en semantisk søgemodel der en finjusteret version af den Kommunale grundmodel nr.1 til at klassificere et givet KL-område baseret på en... -
Compilation of Danish-English parallel corpora resources used for training...
Dette tosproget korpora er bygget af en række forskellige korpusser fra udvalgte offentlige og private korpus og er blevet brugt til at træne NTEU (Neural Translation for the... -
Danish Dependency Treebank (DaNE)
DaNE adds NER annotations to the The Danish Universal Dependencies Treebank (UD-DDT). The Danish UD treebank (Johannsen et al., 2015, UD-DDT) is a conversion of the Danish... -
Bidirectional Long-Short Term Memory tagger
A toolkit for Part-of-Speech tagging and NER in DyNet. It has been tested on Danish, amongst other languages (for the UD POS tags in the UD_Danish-DDT version 1.1 and 2.3)... -
Bornholmsk (NLP tools / data for Bornholmsk)
Language processing resources and tools for Bornholmsk, a language spoken on the island of Bornholm, with roots in Danish and closely related to Scanian. Includes corpora, word... -
Danish Universal Dependencies DDT (UD_Danish-DDT)
The Danish Universal Dependencies treebank (Johannsen et al., 2015, UD-DDT) is a conversion of the Danish Dependency Treebank (Buch-Kromann et al. 2003) based on texts from... -
Danish Summarisation
Danish Summarisation er en model til automatisk opsummering af tekst (automatic abstrasctive text summarisation). Modellen er domæne specifik for danske nyhedsartikler. Modellen... -
Danish Named Entity Recognition data on top of the Danish Universal...
This resource is an annotation of four NER types (PER, ORG, LOC, MISC) on top of the UD_Danish-DDT data. Status: published and freely available since summer 2019 Reference:... -
Danish BERT
BERT (Bidirectional Encoder Representations from Transformers) is a deep neural network model used in Natural Language Processing. The network learns the grammar and semantics... -
CST's tokeniserings- og segmenteringsprogram
CST's tokeniserings- og segmenteringsprogram til tekst- og RTF-filer. Opdeler en tekst i ord og ordforbindelser -
CST STO
The STO (SprogTeknologisk Ordbase) lexicon is a comprehensive computational lexicon of Danish developed for NLP/HLT applications. The syntax layer of the lexicon, presented here... -
COVID-19 EUR-LEX dataset. Bilingual (EN-DA)
Bilingual (EN-DA) corpus acquired from website (https://eur-lex.europa.eu/legal-content) of the EU portal (9th July 2020). Contains 21238 translations units (DA-EN) -
COVID-19 EUROPARL dataset v2. Bilingual (EN-DA)
Bilingual (EN-DA) corpus acquired from the website (https://www.europarl.europa.eu/) of the European Parliament (9th May 2020). Contains 633 translation units (DA-EN). -
COVID-19 EU presscorner v2 dataset. Bilingual (EN-DA)
Bilingual (EN-DA) corpus acquired from website (https://ec.europa.eu/commission/presscorner/) of the EU portal (8th July 2020). Contains 6261 translation units (DA-EN). -
COVID-19 EC-EUROPA v1 dataset. Bilingual (EN-DA)
Bilingual (EN-DA) corpus acquired from website (https://ec.europa.eu/*coronavirus-response) of the EU portal (20th May 2020). Contains 2803 translation units (DA-EN). -
COVID-19 ANTIBIOTIC dataset. Bilingual (EN-DA)
This dataset has been generated out of public content available through the portal (https://antibiotic.ecdc.europa.eu/) of the European Centre for Disease Prevention and Control...
Du kan også tilgå dette register med API (se API-dokumenter).