／var／log marcus chiu

❯

❯

Artificial Intelligence (AI) - Cognitive Computing - Machine Intelligence

❯

❯

Natural Language Processing (NLP) - Computational Linguistics

❯

Language Models

Measuring Massive Multitask Language Understanding (MMLU)

Created on Oct 11, 2025

Measuring Massive Multitask Language Understanding (MMLU)

is a popular benchmark for evaluating the capabilities of large language models