110 likes | 124 Views
Explore how collaborative editing, user contributions, and positive feedback loops in the Gene Wiki project facilitate large-scale gene annotation in biomedical research, showcasing the potential impact on data generation and scientific outreach.
E N D
The Gene Wiki: Community Intelligence Applied to Gene Annotation FaceBase Kick-off Meeting November 16, 2009 Andrew Su, Ph.D.
The biomedical literature is massive Centralized curation efforts do not scale with the rapid growth of the biomedical literature 811,214 articles in PubMed in 2008 Sooner or later, the research community will need to be involved in the annotation effort to scale up to the rate of data generation.
Wikipedia as a model • Wikipedia: “the free encyclopedia that anyone can edit.” • Contains a huge breadth of topics and volume of information • > 2 million articles, > 1 billion words • More accurate than one might think • comparable to Britannica Online • Epitomizes collaborative editing • 300K+ active editors • Displays both structured and unstructured data • figures, images, photos http://en.wikipedia.org/wiki/Wikipedia:Size_comparisons, July 2008
Gene “stubs” • Active MCB community at WP had already developed ~650 gene articles • Can we accelerate this process through stub creation? • In total, created 8000 new articles and edited 650 previously existing articles.
Positive feedback loops Gene wiki page utility 1 100 2 200 Number of contributors Number of users
25k gene-specific review articles? Goal: Create a continually-updated, collaboratively-written, and community-reviewed review article for every gene in the human genome. Figures and diagrams Inline PubMed citations Table of contents Hyperlinks to related concepts
Gene Wiki usage Utility Median: 303 views / page / month Total: 2.9 million views / month Users Contributors 85% of Gene Wiki pages are found on the first page of Google search results…
Gene Wiki editing activity Utility During Jan – Jun 2009… … 6848 edits were made by 1923 unique users of IP addresses … average of 1100 edits per month (SD=171) … additional 11,912 edits made by automated “bots” … total increase in text content by 2.28 megabytes, approximately equal to 19 research articles in PLoS Biology Users Contributors
Positive feedback loop initiated? Utility Monthly statistics Users Contributors There is substantial evidence that Wikipedia and the Gene Wiki are used by both scientists and the general public, so we as a community better make it good.
Dual FaceBase Wiki efforts • Direct participation in the Gene Wiki • Goal: Scientific outreach and education • Pro: • Existing critical mass of editors and content • Con: • Bureaucracy • Limited scope as a general encyclopedia • Standalone FaceBase Wiki • Goal: Creation of a useful research tool • Pro: • Greater editorial control • Inclusion of unpublished data and findings • Emphasis on content for craniofacial community • Con: • Difficult to create and maintain critical mass • Q: How should we seed content?
Acknowledgements Collaborators Current group members Past members John Hogenesch, UPenn Angel Pizzaro, UPenn Faramarz Valafar, SDSU Donabel Roberts, SDSU Pierre Lindenbaum, FondationJean Dausset Michael Martone, Rush Konrad Koehler, Karo Bio David Delano Jennifer Floyd James Goodale Phil McClurg Steve Su Richard Trager Julia Turner Serge Batalov Ghislain Bonamy Jason Boyer Jon Huss Yue Hu Jeff Janes Marc Leglise Camilo Orozco Chunlei Wu Funding and Support NIGMS, NIH Novartis Research Foundation To access the Gene Wiki, Google “gene wiki portal” (or your favorite gene’s symbol) for more info…