A Danish semantic reasoning benchmark compiled from lexical semantic resources This benchmark is the first version of a semantic reasoning benchmark for Danish compiled semi-automatically from a number of human-curated lexical-semantic resources, which function as our gold standard. The datasets constitute a benchmark for assessing selected language understanding capacities of large language models (LLMs) for Danish.
Beta version
Current version is a beta version and is not yet fully curated. Compared to the first version, errors in the “lexical inference” dataset have been corrected.
License: Creative Commons Attribution-NoDerivs 4.0 International License