Datasæt - sprogteknologi.dk

Bilingual corpus made out of PDF documents from the European Medicines...

EN-DA Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020). Attribution details: This dataset has...

TMX

Bilingual English-Danish parallel corpus from the official Nordic cooperation website

Contents of the Nordic Co-operation web site http://www.norden.org downloaded and converted into a parallel corpus This dataset has been created within the framework of the...

TMX

Bilingual English-Danish parallel corpus from VisitDenmark - The official...

Contents of https://www.visitdenmark.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the...

TMX

Bilingual English-Danish parallel corpus from Visit Vejle website

Contents of https://www.visitvejle.com were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the...

TMX

Bilingual English-Danish parallel corpus from The Viking Ship Museum website

Contents of https://www.vikingeskibsmuseet.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. Contains 12403 translation units (EN-...

TMX

Bilingual English-Danish parallel corpus from The Geological Survey of...

Contents of http://www.geus.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...

TMX

Bilingual English-Danish parallel corpus from The Danish Nature Agency website

Contents of https://naturstyrelsen.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the...

TMX

Bilingual English-Danish parallel corpus from The Danish Medicines Agency

Contents of https://laegemiddelstyrelsen.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. Contains 22699 translation units between...

TMX

Bilingual English-Danish parallel corpus from The Danish Gambling Authority website

Contents of https://spillemyndigheden.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the...

TMX

Bilingual English-Danish parallel corpus from The Danish Environmental...

Contents of https://eng.mst.dk/ and https://mst.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created...

TMX

Bilingual English-Danish parallel corpus from The Agency for Culture and...

Contents of https://slks.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...

TMX

Bilingual English-Danish parallel corpus from Odense Municipality website

Contents of https://www.odense.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework...

TMX

Bilingual English-Danish parallel corpus from National Museum of Denmark website

Contents of https://natmus.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...

TMX

Bilingual English-Danish parallel corpus from Denmark National Space...

Contents of https://www.vikingeskibsmuseet.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. Contains 1939 translation units (EN-DA)....

TMX

Bilingual English-Danish parallel corpus from Danmarks Statistik website

Contents of https://www.dst.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...

TMX

Bilingual English-Danish parallel corpus from Danish Ministry of Foreign...

Contents of http://um.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the...

TMX

Bilingual English-Danish parallel corpus from Danish Ministry of Finance website

Contents of https://uk.fm.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...

TMX

Bilingual English-Danish parallel corpus from Danish Maritime Authority website

Contents of https://www.dma.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...

TMX

Bilingual English-Danish parallel corpus from Aarhus 2017 - European Capital...

Contents of http://www.aarhus2017.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the...

TMX

Bilingual Danish-English parallel corpus from the State Audit Office...

Contents of http://rigsrevisionen.dk/ website downloaded, aligned and converted into parallel corpus This dataset has been created within the framework of the European Language...

TMX

214 sprogressourcer fundet