210 likes | 316 Views
Tom De Groeve. Quality from Quantity Deriving actionable knowledge from social media streams. Tom De Groeve, Beate Stollberg Frank Ostermann , Laura Spinsanti European Commission, Joint Research Centre, Ispra , Italy. Forest Fires Rapidly changing situation
E N D
Quality from Quantity Deriving actionable knowledge from social media streams Tom De Groeve, Beate Stollberg Frank Ostermann, Laura Spinsanti European Commission, Joint Research Centre, Ispra, Italy
Forest Fires Rapidly changing situation Limited capacity of observation Localized public warnings
Earthquakes Impact unknown Limited capacity of observation Local authorities may be affected Specific needs-driven response
Information gaps Who? • Public What? • What is happening? • Should I flee? • What can I do?
Information gaps Who? • Public What? • What is happening? • Should I flee? • What can I do? Who? • International response What? • What really happened? • Should I deploy? • What are the needs?
Information flow Information flow
Information flow Information flow
Situation assessment What kind of earthquake is it? 1 day 20min Deployment? Earthquake Magnitude 7 Tsunami 2m Population 1 million OSOCC LEMA in charge USAR teams Control centre
Situation assessment What kind of earthquake is it? Killed? Collapsed? Black out? Earthquake Magnitude 7 Tsunami 2m Population 1 million OSOCC LEMA in charge USAR teams Control centre Schools? Airports? Injured? Hospital?
Data curation: sustainable? • Full external curation: retrieve, rate/score, and publish • Some external curation: Retrieve, validate, publish
Automated data curation Filter relevant messages Content No rumours Statistics on relevant terms Time critical First 6h
Elements of quality assessment Source Credibility Context Location
Elements of quality assessment Source Credibility Context Location
Elements of quality assessment Source Content Credibility Relevance Context Location
Elements of quality assessment Source Content Credibility Relevance Context Location
VGI Topicality Geo-location Spatio-temp-oral Context Integrated Quality Score Implementation aspects • Sum of weighted scores: QS(VGIj) = ∑Ni=1wisji • Topicality: keyword-based • Proximity: next concurrent reported hotspot • Land cover: Forest, no-Forest, Built-up • Population Density: Risk factor • Information clusters: Similar messages or lone signal?
Conclusions • From Quantity to Quality • Integrated Quality Score • Credibility • Relevance • Build answers for information gaps • Build answers for decision processes • Who takes responsibility? Who is liable?