Overview
Wikimedia projects are created and maintained by a vast network of individual contributors and organizations with different roles and expertise. The Wikimedia Foundation, including Wikimedia Research, plays an important role in supporting these efforts, but our internal capacity and expertise will always be more limited than those of the Movement as a whole. Tackling the strategic challenges ahead requires an investment in foundational social and technical infrastructure that individuals, groups, and organizations across the Movement can use.
We see an urgent need for increasing the development and dissemination of foundational resources to grow research capacities across the Movement. These foundational resources take many forms: new tools for developing scientific knowledge about projects and contributors; new open data resources and improved tools for working with them; new methods and guidance for mission-aligned research and technology development; and outreach activities designed to foster a healthy, diverse, and dynamic community of researchers to be part of the Wikimedia Movement.
More information can be found in our white paper and research community mapping.
Recent updates
Resources and links
Research pages
- Language-Agnostic Topic Classification
- External Reuse of Wikimedia Content
- Preventing, identifying, and addressing bias in recommender systems
Publications
- Isaac Johnson, Lucie-Aimée Kaffee, and Miriam Redi. 2024. Wikimedia data for AI: A review of Wikimedia datasets for NLP tasks and AI-assisted editing. Proceedings of Advancing Natural Language Processing for Wikipedia Workshop (EMNLP '24).
- Temilola Adeleye, Skye Berghel, Damien Desfontaines, Michael Hay, Isaac Johnson, Cléo Lemoisson, Ashwin Machanavajjhala, Tom Magerlein, Gabriele Modena, David Pujol, Daniel Simmons-Marengo, and Hal Triedman. 2023. Publishing Wikipedia usage data with strong privacy guarantees. Theory and Practice of Differential Privacy (TPDP) 2023. https://doi.org/10.48550/arXiv.2308.16298
- Isaac Johnson and Emily Lescak. 2022. Considerations for Multilingual Wikipedia Research. Wiki-M3L: Wikipedia and Multi-Modal & Multi-Lingual Research (co-located with ICLR 2022).
- Tiziano Piccardi, Miriam Redi, Giovanni Colavizza, and Robert West. 2021. On the Value of Wikipedia as a Gateway to the Web. In Proceedings of The Web Conference 2021 (WWW '21). https://doi.org/10.1145/3442381.3450136
- Isaac Johnson, Martin Gerlach, and Diego Sáez-Trumper. 2021. Language-agnostic Topic Classification for Wikipedia. WikiWorkshop 2021: In Companion Proceedings of The Web Conference 2021 (WWW '21). https://doi.org/10.1145/3442442.3452347
- Isaac Johnson. 2020. Analyzing Wikidata Transclusion on English Wikipedia. 1st Wikidata Workshop: International Semantic Web Conference (ISWC '20).
- Swati Goel, Ashton Anderson, and Leila Zia. 2019. Thanks for Stopping By: A Study of "Thanks" Usage on Wikimedia. WikiWorkshop 2019: In Companion Proceedings of The Web Conference 2019 (WWW '19).
- Xiaoxi Chelsy Xie, Isaac Johnson, and Anne Gomez. 2019. Detecting and Gauging Impact on Wikipedia Page Views. WikiWorkshop 2019: In Companion Proceedings of The Web Conference 2019 (WWW '19).
