Skip to content

Address Knowledge Gaps

We are developing systems that identify and address gaps across Wikimedia projects.
Swirling galactic dust clouds
Image by NASA

Overview

In 2030, the world's population is projected to be 8.6 billion, almost 80% of which will live in Africa and Asia. Latin America's population will continue to grow rapidly while population growth in Europe and Northern America—today's largest sources of contributors and readership to Wikimedia projects—will plateau. How can we help Wikimedia projects thrive in a world that is becoming increasingly different from the one we are building for today, both in terms of production and consumption of content?

The Wikimedia movement has identified as a strategic goal supporting "the knowledge and communities that have been left out by structures of power and privilege". In order to meet this goal, we need to understand how to serve audiences, groups, and cultures that today are underrepresented in Wikipedia, Wikidata, Commons and other Wikimedia projects—in terms of participation, access, representation, and coverage.

We have begun to advance knowledge equity with a research program to address knowledge gaps. This program aims to deliver citable, peer-reviewed knowledge and new technology in order to generate baseline data on the diversity of the Wikimedia contributor population, understand reader needs across languages, remove barriers for contribution by underrepresented groups, and help contributors identify and expand missing content across languages and topics.

More information can be found in our roadmap.

Recent updates

Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia

A model for automatically locating suitable positions for new links in Wikipedia articles, supporting editors in cases where a suitable anchor text does not yet exist.

Architectural styles of curiosity in global Wikipedia mobile app readership

Uncovers complex patterns of Wikipedia navigation and characterizes reader curiosity types in the mobile app.

An Open Multilingual System for Scoring Readability of Wikipedia

A multilingual model to score the readability of Wikipedia articles across languages.

Language-Agnostic Modeling of Wikipedia Articles for Content Quality Assessment across Languages

Applies a language-agnostic article quality model to millions of revisions from over 300 language editions of Wikipedia.

Leveraging Recommender Systems to Reduce Content Gaps on Peer Production Platforms

An experiment to explore the potential role of recommender systems in reducing content gaps on Wikipedia.

Orphan Articles: The Dark Matter of Wikipedia

A study of the surprisingly large number of orphan articles in Wikipedia and how to improve their visibility.

Curious Rhythms: Temporal Regularities of Wikipedia Consumption

A study of the temporal patterns in how Wikipedia articles are accessed by readers, helping understand the diversity of their information needs.

Increasing Participation in Peer Production Communities with the Newcomer Homepage

Shows how the Newcomer Homepage has increased participation amongst newcomers to Wikipedia.

Research pages

Slides

Videos

Publications