Skip to main content

Danish Dynaword

The Danish dynaword is a collection of Danish free-form text datasets from various domains. All of the datasets in Danish Dynaword are openly licensed and deemed permissible for training large language models.

Danish Dynaword is continually developed, which means that the dataset will actively be updated as new datasets become available. The authors welcome contributions to the dataset, including new sources, improved data filtering, and other enhancements. Please consult the contribution guidelines beforehand.

Please note that the license varies from dataset to dataset in the ressource and we advice users to inform themselves about the license on the specific datasets they intend to use.

Data og ressourcer

Nøgleord

Yderligere info

URI https://data.gov.dk/dataset/lang/956719a6-e440-4d1c-908e-1144ee6e277e
Destinationsside https://huggingface.co/datasets/danish-foundation-models/danish-dynaword
Høstes af Datavejviser Nej
Udgivelsesdato 04-08-2025
Seneste ændringsdato 05-08-2025
Opdateringsfrekvens kontinuerlig
Dækningsperiode  / 
Emne(r)
  • Regeringen og den offentlige sektor
  • Uddannelse, kultur og sport
Adgangsrettigheder offentlig
Overholder
Proveniensudsagn
Dokumentation