Print

Measuring Wikipedia


Speakers

Luca de Alfaro and Felipe Ortega.

Abstract

This tutorial is an introduction to the best methodologies, tools and practices for Wikipedia research.

The tutorial will be led by Luca de Alfaro (Wiki Lab at UCSC, California, USA) and Felipe Ortega (Libresoft, URJC, Madrid, Spain). Both cumulate several years of practical experience exploring and processing Wikipedia data [1 (external link)], [2 (external link)], [3 (external link)]. As well, their respective research groups have led the development of two cutting-edge software tools (WikiTrust (external link) and WikiXRay (external link)), for analyzing Wikipedia. WikiTrust (external link) implements an author reputation system, and a text trust system, for wikis. WikiXRay (external link) is a tool automating the quantitative analysis of any language version of Wikipedia (in general, any wiki based on MediaWiki).

After attending this session, attendees should be able to:

  • Outline a general picture of the different research perspectives currently applied to Wikipedia.
  • Easily find previous research works and information sources to contextualize their own work on Wikipedia.
  • Store, navigate and process information retrieved from the Wikipedia data jungle.
  • Discriminate the optimum set of available tools that fits their own Wikipedia research needs, as well as develop their own set of tools.
  • Create and refine their research roadmap, to achieve concrete goals and results.
  • Feel comfortable using and extending WikiTrust (external link) and WikiXRay (external link).

Speakers bios


Luca de Alfaro received a PhD. in Computer Science from Standford University (1998). He is an associate professor of Computer Engineering at the UC Santa Cruz. He leads the UCSC Wiki Lab, responsible for the development of the WikiTrust (external link) tool. Likewise, he also leads the Dvlab (Design and Verification laboratory) at UCSC. His research interests cover a wide range of topics, including applications (reputation systems, collaborative information creation, e-commerce), system design (embedded software design, formal methods for system design), system verification (discrete, real-time, embedded, and probabilistic systems) and foundations (game theory, concurrency theory, automata theory). WikiTrust (external link) is a MediaWiki (external link) extension that implements an author reputation system, and a text trust system, for wikis. WikiTrust (external link) adds to a wiki a check text tab that enables any visitor to check the author, origin, and reliability of wiki text. Thus, visitors can easily spot spam, surreptitious changes, and information tampering. The Wiki Lab members have produce a number of research papers on WikiTrust (external link), most notably [1 (external link)] and [2 (external link)]. WikiTrust was also recently highlighted, on August 2009, on Wired Magazine (external link).

Felipe Ortega is Researcher and Project Manager at Libresoft (external link) (Universidad Rey Juan Carlos), since November 2007. He received a Ph.D. in Computer Science from Universidad Rey Juan Carlos (2009). His main research line is the Wikipedia (external link) project and its community of authors/editors. His PhD. dissertation (external link), (available online (external link)), is the first one to provide a side-by-side empirical analysis of the top 10 language editions of Wikipedia, from different points of view. He also works to develop novel methodologies to analyze open collaborative projects (like FLOSS development projects, Wikipedia and social networking platforms) involving a very high number of participants. Felipe is the main developer behind WikiXRay (external link), an extensible tool supporting the automation of quantitative analyses on any language edition of Wikipedia. Packages with quantitative information from many Wikipedia language editions, extracted by WikiXRay (external link) and ready to be used for research purposes, are available on the new WikiResearch repository (external link), hosted by Spanish RedIRIS. He is (part-time) Associate Professor at Alfonso X El Sabio University (UAX), on Computer Architecture and Cryptography and Network Security. Before joining Libresoft, he was full-time Associate Professor at UAX, coordinating the Networking Lab and teaching more than 8 different courses (most of them related to Networking and Security) in 4 different Master Programs.