Exploring the (Lack of) Cultural Diversity in Multilingual Datasets for NLP
Lea Krause PhD candidate at Vrije Universiteit Amsterdam The project addresses the critical need for cultural diversity in multilingual datasets used to train and evaluate language models and conversational agents. Current practices often involve translating English-centric content, which limits the cultural authenticity and applicability of these datasets across different regions. For example, evaluating models using […]
Exploring the (Lack of) Cultural Diversity in Multilingual Datasets for NLP Read More »