2025
2024
Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP '24)
- https://doi.org/10.1126/sciadv.adn3268
Architectural styles of curiosity in global Wikipedia mobile app readership
Science Advances. 10, eadn3268
Wikimedia data for AI: A review of Wikimedia datasets for NLP tasks and AI-assisted editing
Proceedings of Advancing Natural Language Processing for Wikipedia Workshop (EMNLP '24)
An Open Multilingual System for Scoring Readability of Wikipedia
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL '24)
Language-Agnostic Modeling of Wikipedia Articles for Content Quality Assessment across Languages
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media (ICWSM '24)
Leveraging Recommender Systems to Reduce Content Gaps on Peer Production Platforms
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media (ICWSM '24)
Orphan Articles: The Dark Matter of Wikipedia
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media (ICWSM '24)
Curious Rhythms: Temporal Regularities of Wikipedia Consumption
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media (ICWSM '24)
2023
- https://doi.org/10.1145/3583780.3615254
A Comparative Study of Reference Reliability in Multiple Language Editions of Wikipedia
In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (CIKM '23)
Increasing Participation in Peer Production Communities with the Newcomer Homepage
Proc. ACM Hum.-Comput. Interact. (CSCW '23).
https://doi.org/10.1145/3610071- https://doi.org/10.48550/arXiv.2308.16298
Publishing Wikipedia usage data with strong privacy guarantees
Theory and Practice of Differential Privacy (TPDP) 2023
Fair multilingual vandalism detection system for Wikipedia
In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '23)
Longitudinal Assessment of Reference Quality on Wikipedia
In Proceedings of The Web Conference 2023 (WWW '23)
https://doi.org/10.1145/3543507.3583218Overview of the TREC 2022 Fair Ranking Track
The Thirty-First Text REtrieval Conference (TREC 2022) Proceedings
A Large-Scale Characterization of How Readers Browse Wikipedia
ACM Transactions on the Web
https://doi.org/10.1145/3580318
2022
Templates and Trust-o-meters: Towards a widely deployable indicator of trust in Wikipedia
CHI '22: CHI Conference on Human Factors in Computing Systems
https://doi.org/10.1145/3491102.3517523Crosslingual Section Title Alignment in Wikipedia
In Proceedings of the 2022 IEEE International Conference on Big Data (Big Data '22)
Wiki Loves Monuments: Crowdsourcing the Collective Image of the Worldwide Built Heritage
J. Comput. Cult. Herit. 16, 1, Article 20 (March 2023)
https://doi.org/10.1145/3569092"We Need a Woman in Music": Exploring Wikipedia's Values on Article Priority
25th ACM Conference On Computer-Supported Cooperative Work And Social Computing (CSCW '22)
https://doi.org/10.1145/3555156Considerations for Multilingual Wikipedia Research
Wiki-M3L: Wikipedia and Multi-Modal & Multi-Lingual Research (co-located with ICLR 2022)
Data Governance in the Age of Large-Scale Data-Driven Language Technology
Proceedings of FAccT 2022
Overview of the TREC 2021 Fair Ranking Track
The Thirtieth Text REtrieval Conference (TREC 2021) Proceedings
Going Down the Rabbit Hole: Characterizing the Long Tail of Wikipedia Reading Sessions
WikiWorkshop 2022: In Companion Proceedings of The Web Conference 2022 (WWW '22)
Wikipedia Reader Navigation: When Synthetic Data Is Enough
Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining (WSDM '22)
https://doi.org/10.1145/3488560.3498496Visual Gender Biases in Wikipedia: A Systematic Evaluation across the Ten Most Spoken Languages
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media (ICWSM '22)
A Large Scale Study of Reader Interactions with Images on Wikipedia
EPJ Data Science
https://doi.org/10.1140/epjds/s13688-021-00312-8WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia
2021 IEEE International Conference on Big Data (Big Data)
https://doi.org/10.1109/BigData52589.2021.9671319
2021
Wiki-Reliability: A Large Scale Dataset for Content Reliability on Wikipedia
SIGIR '21
https://doi.org/10.1145/3404835.3463253Tracking Knowledge Propagation Across Wikipedia Languages
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media (ICWSM '21)
WikiCheck: An end-to-end open source Automatic Fact-Checking API based on Wikipedia
30th ACM International Conference on Information and Knowledge Management (CIKM '21)
A Multilingual Entity Linking System for Wikipedia with a Machine-in-the-Loop Approach
30th ACM International Conference on Information and Knowledge Management (CIKM '21)
A preliminary approach to knowledge integrity risk assessment in Wikipedia projects
MIS2'21: Misinformation and Misbehavior Mining on the Web Workshop held in conjunction with KDD 2021
Global gender differences in Wikipedia readership
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media (ICWSM '21)
On the Value of Wikipedia as a Gateway to the Web
In Proceedings of The Web Conference 2021 (WWW '21)
https://doi.org/10.1145/3442381.3450136Language-agnostic Topic Classification for Wikipedia
WikiWorkshop 2021: In Companion Proceedings of The Web Conference 2021 (WWW '21)
https://doi.org/10.1145/3442442.3452347A Taxonomy of Knowledge Gaps for Wikimedia Projects (Second Draft)
2020
Scalable Recommendation of Wikipedia Articles to Editors Using Representation Learning
ComplexRec 2020, Workshop on Recommendation in Complex Scenarios at the ACM RecSys Conference on Recommender Systems (RecSys 2020)
Analyzing Wikidata Transclusion on English Wikipedia
1st Wikidata Workshop: International Semantic Web Conference (ISWC '20)
A Taxonomy of Knowledge Gaps for Wikimedia Projects (First Draft)
Uneven Coverage of Natural Disasters in Wikipedia: The Case of Floods
17th International Conference on Information Systems for Crisis Response and Management (ISCRAM 2020)
Matching Ukrainian Wikipedia Red Links with English Wikipedia's Articles
WikiWorkshop 2020: In Companion Proceedings of the Web Conference 2020 (WWW '20)
https://doi.org/10.1145/3366424.3383571Quantifying Engagement with Citations on Wikipedia
In Proceedings of The Web Conference 2020 (WWW '20)
https://doi.org/10.1145/3366423.3380300
2019
Online Disinformation and the Role of Wikipedia
Eliciting New Wikipedia Users' Interests via Automatically Mined Questionnaires: For a Warm Welcome, Not a Cold Start
In Proceedings of the Thirteenth International AAAI Conference on Web and Social Media (ICWSM '19)
Thanks for Stopping By: A Study of "Thanks" Usage on Wikimedia
WikiWorkshop 2019: In Companion Proceedings of The Web Conference 2019 (WWW '19)
Detecting and Gauging Impact on Wikipedia Page Views
WikiWorkshop 2019: In Companion Proceedings of The Web Conference 2019 (WWW '19)
Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia's Verifiability
In Proceedings of The Web Conference 2019 (WWW '19)
https://doi.org/10.1145/3308558.3313618Why the World Reads Wikipedia: Beyond English Speakers
International ACM Conference on Web Search and Data Mining (WSDM '19)
https://doi.org/10.1145/3289600.3291021
2018
Reciprocity and Donation: How Article Topic, Quality and Dwell Time Predict Banner Donation on Wikipedia
Proceedings of the ACM on Human-Computer Interaction (CSCW '18)
https://doi.org/10.1145/3274360'Welcome' Changes?: Descriptive and Injunctive Norms in a Wikipedia Sub-Community
Proceedings of the ACM on Human-Computer Interaction (CSCW '18)
https://doi.org/10.1145/3274321With Few Eyes, All Hoaxes are Deep
Proceedings of the ACM on Human-Computer Interaction (CSCW '18)
https://doi.org/10.1145/3274290Bot Detection in Wikidata Using Behavioral and Other Informal Cues
Proceedings of the ACM on Human-Computer Interaction (CSCW '18)
https://doi.org/10.1145/3274333WikiConv: A Corpus of the Complete Conversational History of a Large Online Collaborative Community
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP '18)
Conversations Gone Awry: Detecting Early Signs of Conversational Failure
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL '18)
- https://doi.org/10.1145/3233391.3233544
Evaluating the impact of the Wikipedia Teahouse on newcomer socialization and retention
Proceedings of the 14th International Symposium on Open Collaboration (OpenSym '18)
Structuring Wikipedia Articles with Section Recommendations
Proceedings of the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '18)
Operationalizing conflict and cooperation between automated software agents in Wikipedia: A replication and expansion of Even Good Bots Fight
Proceedings of the ACM on Human-Computer Interaction (Nov 2017 issue, CSCW 2018 Online First)
https://doi.org/10.1145/3134684
2017
WikiCite 2017 Report
figshare
https://doi.org/10.6084/m9.figshare.5648233Democratizing Data Science: The Community Data Science Workshops and Classes.
Big Data Factories: Scientific Collaborative approaches for virtual community data collection, repurposing, recombining, and dissemination
Identifying Semantic Edit Intentions from Revisions in Wikipedia
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
Interpolating Quality Dynamics in Wikipedia and Demonstrating the Keilana Effect
Proceedings of the 13th International Symposium on Open Collaboration (OpenSym '17)
https://doi.org/10.1145/3125433.3125475Building Automated Vandalism Detection Tools for Wikidata
Proceedings of the 26th International Conference on World Wide Web Companion (WWW '17 Companion)
https://doi.org/10.1145/3041021.3053366Why We Read Wikipedia
Proceedings of the 26th International Conference on World Wide Web (WWW '17)
https://doi.org/10.1145/3038912.3052716The Wikipedia Adventure: Field Evaluation of an Interactive Tutorial for New Users
Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17)
https://doi.org/10.1145/2998181.2998307- https://doi.org/10.1145/3022198.3022661
Advancing the OCDX: Building Social Computing Infrastructure. In
Companion of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '17 Companion)
Ex machina: Personal attacks seen at scale
Proceedings of the 26th International Conference on World Wide Web (WWW '17)
https://doi.org/10.1145/3038912.3052591
2016
WikiCite 2016 Report
figshare
https://doi.org/10.6084/m9.figshare.4042530Improving Website Hyperlink Structure Using Server Logs
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining (WSDM '16)
https://doi.org/10.1145/2835776.2835832Not at Home on the Range: Peer Production and the Urban/Rural Divide
Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI '16)
https://doi.org/10.1145/2858036.2858123Who Did What: Editor Role Identification in Wikipedia.
Proceedings of the Tenth International AAAI Conference on Web and Social Media (ICWSM '16)
Growing Wikipedia Across Languages via Recommendation
Proceedings of the 25th International Conference on World Wide Web (WWW '16)
https://doi.org/10.1145/2872427.2883077LeadWise: Using Online Bots to Recruit and Guide Expert Volunteers
Proceedings of the 19th ACM Conference on Computer Supported Cooperative Work and Social Computing Companion (CSCW '16 Companion)
https://doi.org/10.1145/2818052.2869106- https://doi.org/10.1145/2818052.2855512
Breaking into new Data-Spaces: Infrastructure for Open Community Science. In
Proceedings of the 19th ACM Conference on Computer Supported Cooperative Work and Social Computing Companion (CSCW '16 Companion)
2015
Effects of a Wikipedia Orientation Game on New User Edits
Proceedings of the 18th ACM Conference Companion on Computer Supported Cooperative Work & Social Computing (CSCW'15 Companion)
https://doi.org/10.1145/2685553.2699022User Session Identification Based on Regularities in Inter-activity Time
Proceedings of the 24th International Conference on World Wide Web (WWW '15).
https://doi.org/10.1145/2736277.2741117The Success and Failure of Quality Improvement Projects in Peer Production Communities
Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15)
https://doi.org/10.1145/2675133.2675241MoodBar: Increasing New User Retention in Wikipedia through Lightweight Socialization
Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW '15)
https://doi.org/10.1145/2675133.2675181Advancing an Industry/Academic Partnership Model for Open Collaboration Research
Proceedings of the 18th ACM Conference Companion on Computer Supported Cooperative Work & Social Computing (CSCW'15 Companion)
https://doi.org/10.1145/2685553.2685559
2014
Editing beyond articles: Diversity & dynamics of teamwork in open collaborations
Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing (CSCW '14)
https://doi.org/10.1145/2531602.2531654Accept, decline, postpone: How newcomer productivity is reduced in English Wikipedia by pre-publication review
Proceedings of The International Symposium on Open Collaboration (OpenSym '14)
https://doi.org/10.1145/2641580.2641614Snuggle: Designing for efficient socialization and ideological critique
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '14)
https://doi.org/10.1145/2556288.2557313
2013
The Rise and Decline of an Open Collaboration System. How Wikipedia's Reaction to Popularity Is Causing Its Decline
American Behavioral Scientist
https://doi.org/10.1177/0002764212469365Tea and sympathy: Crafting positive new user experiences on Wikipedia
Proceedings of the 2013 conference on Computer supported cooperative work (CSCW '13)
https://doi.org/10.1145/2441776.2441871A content analysis of wikiproject discussions: Toward a typology of coordination language used by virtual teams
Proceedings of the 2013 conference on Computer supported cooperative work companion (CSCW '13)
https://doi.org/10.1145/2441955.2442011Are we there yet? The development of a corpus annotated for social acts in multilingual online discourse
Dialogue and Discourse
https://doi.org/10.5087/dad.2013.201Managing complexity: Strategies for group awareness and coordinated action in Wikipedia
Proceedings of the 9th International Symposium on Open Collaboration (WikiSym '13)
https://doi.org/10.1145/2491055.2491060When the levee breaks: Without bots, what happens to Wikipedia's quality control processes?
Proceedings of the 9th International Symposium on Open Collaboration (WikiSym '13)
https://doi.org/10.1145/2491055.2491061Using edit sessions to measure participation in Wikipedia
Proceedings of the 2013 conference on Computer supported cooperative work (CSCW '13).
https://doi.org/10.1145/2441776.2441873Making peripheral participation legitimate: Reader engagement experiments in Wikipedia
Proceedings of the 2013 conference on Computer supported cooperative work (CSCW '13)
https://doi.org/10.1145/2441776.2441872- https://doi.org/10.1145/2491055.2491093
Descending Mount Everest: Steps towards applied Wikipedia research. In
Proceedings of the 9th International Symposium on Open Collaboration (WikiSym '13)
2012
Etiquette in Wikipedia: Weaning new editors into productive ones
Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration (WikiSym '12)
https://doi.org/10.1145/2462932.2462939- https://doi.org/10.1109/MC.2012.82
Bots and cyborgs: Wikipedia's immune system
Computer
- https://doi.org/10.1145/2462932.2462983
What aren't we measuring?: Methods for quantifying wiki-work. In
Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration (WikiSym '12)
Defense Mechanism or Socialization Tactic? Improving Wikipedia's Notifications to Rejected Contributors
Proceedings of the Sixth International AAAI Conference on Weblogs and Social Media (ICWSM '12)
Negotiating Cultural Values in Social Media: A Case Study from Wikipedia
Proceedings of the 45th Hawaii International Conference on System Science (HICSS '12)
https://doi.org/10.1109/HICSS.2012.443