210 likes | 361 Views
TWC LOGD: A Portal for Linking Open Government Data. Dominic DiFranzo, Li Ding, John S. Erickson, Xian Li, Tim Lebo, James Michaelis, Alvaro Graves, Gregory Todd Williams, Jin Guang Zheng, Johanna Flores, Zhenning Shangguan, Gino Gervasio, Deborah L. McGuinness, Jim Hendler
E N D
TWC LOGD: A Portal for Linking Open Government Data Dominic DiFranzo, Li Ding, John S. Erickson, Xian Li, Tim Lebo, James Michaelis, Alvaro Graves, Gregory Todd Williams, Jin Guang Zheng, Johanna Flores, Zhenning Shangguan, Gino Gervasio, Deborah L. McGuinness, Jim Hendler Tetherless World Constellation Rensselaer Polytechnic Institute Semantic Web Challenge 2010 Nov 10, 2010
The TWC LOGD Portal for SWC2010 Real World Data • US, UK, China,… • Health, energy, economy Semantic Web in Gov Domain • Major partner of Data.gov • 8.5 billion triples in LOD End User Applications • Community Portal • Fast, Low-cost Mashups
“Semantic Web” and RDF logo showed up on the frontpage of the US Data.gov website
Major Partner of US Data.gov Project • Semantic Web Tech deployed in Data.gov • RDF data, SPARQL endpoint, semantic mashups
Government Adoption Process data.gov online SPARQL End Point & RDF data &Demos Replicated at Data.gov data.gov relaunch with semantic web featured May 21, 2009 May 21, 2010 May, 2010 New Application published by a team at DOE Oct, 2010 2009 2010 … Demos Tutorials Videos SPARQL Endpoint Data-gov Wiki @RPI online Two-day Mashathon in Washington DC July,2009 2009-2010 Aug, 2010 TWC LOGD Drupal Site announced Oct, 2010
The Largest Real World LOD Dataset • 8.5+ billion triples from real world • 7500+ LOD links • Accessible via Data Browser, e.g. Tabulator
Smoking Prevalence vs. Tax, Policy …Extensible and accountable Mashups for NIH project Trends in Smoking Prevalence, Tobacco Policy Coverage and Tobacco Prices (1991-2007) Extensible Mashups via Linked Data • Diverse datasets from NIH • Potentially linking to “unemployment rate” Accountable Mashups via Provenance • Annotate datasets used in demos • Feedback users’ comment to gov contact
White House Visitor SearchLeveraging linked data (DBpedia & New York Times) NYTimes Wikipedia dbpedia:Barack_Obama Semantic Wiki “POTUS” The White House • [Person Mashup] Data.gov (statistics) + DBpedia (personal profiles)+ NYTimes (news) • [Technologies] Semantic MediaWiki, Google Visualization, IPad Apps available in Apple Store Created by Dominic DiFranzo, Evan Patton, RPI, http://data-gov.tw.rpi.edu/demo/stable/white-house-visitor/top100-visitees.php
Linking GDP of the US and ChinaLinking international government data meaningfully GDP of the US (Billion Dollar) 8.3 6.3 2000 2010 GDP of China (Billion Chinese Yuan ) [Temporal Mashup] bea.gov + federalreserve.gov +stats.gov.cn
Reaching Open Source CommunitiesLinking semantic web with web developers • Social Semantic Web extensions/modules to popular CMS, e.g. Semantic Wiki, Drupal • Process/consume integrated gov data in a number of different ways: social networks, natural language technologies, workflows, search… Web n-grams
TWC LOGD Status: Website Statistics • 378,128 page hits • 28,481 visits • 16,041 visitors • 4126 cities • 34 countries Note: the above statistics are about http://data-gov.tw.rpi.edu. Dataset access not counted.
Summary of the TWC LOGD Portal http://logd.tw.rpi.edu Real World Data • 8.5+ billion triples • 400+ datasets • 10+ sources • Many domains Semantic Web Technology • completely open source • Demos/tutorials/videos Community and Users • partner of US government • open source community • education in university Linking Open Government Data Now!
The Team and Sponsors • Jim Hendler • Deborah L. McGuinness • Li Ding • Dominic DiFranzo • Sarah Magidson • James Michaelis • Alvaro Graves • Jin Guang Zheng • Xian Li • Gregory Todd Williams • Tim Lebo • Zhenning Shangguan • Devin Gaffney • Peter Coons • Adam Bell • William Cooper • Brian Zaik • Johanna Flores Government Sponsors DARPA NSF NASA IARPA NIH/NCI …
Data.gov and World-Wide Open Government Data Activities data.gov online data.gov relaunch with semantic web featured January 1, 2009 “Openness will strengthen our democracy and promote efficiency and effectiveness in Government.” --- President Obama May 21, 2010 May 21, 2009 2009 2010 … January 19, 2010 June30,2009 Putting Government Data online • Many countries • US • UK • Australia • New Zealand • … data.gov.uk online
Data-gov Wiki: Innovations at RPI The Data-gov Wiki explores and educates the use of semantic web technologies, esp. linked data, in producing, processing and utilizing government data from data.gov. 40+ Demos 400+ Datasets Tutorials & Videos The Data-gov Wiki is run by the Tetherless World Constellation at RPI, headed by Professors Jim Hendler and Deborah McGuinness and led by Li Ding. Other student team members include: Dominic DiFranzo, Sarah Magidson ,James Michaelis, Alvaro Graves, Adam Bell, Jin Guang Zheng, Xian Li, Tim Lebo, Gregory Todd Williams, Peter Coons, Zhenning Shangguan, Devin Gaffney, William Cooper, Brian Zaik, and Johanna Flores .
Tech: Abstraction and Versioning Conversion Layer LOGD (raw) LOGD (e1) … Version OGD (part1) Snapshot OGD (part2) Snapshot Data publishing stages … Source Dataset Table Record … high Levels of structural data granularity low
Tech: Provenance in LOGD data Convert Access Enhance Version SemDiff derive derive create revision derive
Consume LOGD data in Semantic Search Data-gov Semantic Search Web Search Results http://data-gov.tw.rpi.edu/ HTML XHTML+RDFa RDFa RDF Annotation ARC2