130 likes | 159 Views
* metrics from a Technical Point of View. Julius Stropel Verbundzentrale des GBV (VZG). Main motivation behind this work package : Explore which practical challenges occur when crawling for * metrics data on the internet.
E N D
*metricsfrom a Technical Point ofView Julius Stropel Verbundzentrale des GBV (VZG)
Main motivationbehindthisworkpackage: Explorewhichpracticalchallengesoccur whencrawlingfor *metricsdata on theinternet. *metrics In Transition Workshop – Göttingen – 27.03.2019
How do wegettheinformationthat a personinteractedwith a certainscientificworkonline? „a person“ Who? „interacted“ How? „a certainwork“ Whichone? *metrics In Transition Workshop – Göttingen – 27.03.2019
So who do weask? *metrics In Transition Workshop – Göttingen – 27.03.2019
Gathering Information about Scientific Impact on Social Media / Online Platforms our database *metrics In Transition Workshop – Göttingen – 27.03.2019
Currentstateofdatagathering • Crawlingfor ~ 225k works • Fromrepositories „GoeScholar“, „EconStor“, „SSOAR“, … • Bydoi, handle, landingpageurl, metadata • Someresults: • 17k tweets, 1.87 millionMendeleyreaders, 6.5k Wikipedia citations, … *metrics In Transition Workshop – Göttingen – 27.03.2019
Whatwerethechallenges? • Services‘ API restrictions • Services‘ API malfunctions • *metricsmanipulation? • Someworksdid not have a uniqueidentifier • Collectingdatabysomeidentifiersdoes not yieldmanyresults (doiworksbest) • Data protection *metrics In Transition Workshop – Göttingen – 27.03.2019
Howis *metrics different fromcommercialproviders? • Ourdataisfree. • Ourdataisfullyaccessible. • Oursoftwareis open-source, hencethealgorithmsarepublic. • Wehavenoconflictofinterestwhenitcomestohonestyaboutlimitationsofdataquality. *metrics In Transition Workshop – Göttingen – 27.03.2019
Howishe dataofthe *metricsprojectshared? • Software: https://github.com/gbv/metrics-crawler • Data • API: http://api.metrics.gbv.de/v1/work/doi?v={doi} • Data dumps (askus) • Web-Interface: http://explore.metrics.gbv.de/ *metrics In Transition Workshop – Göttingen – 27.03.2019
What do I needtouseyourserviceordata? • … touseour web-interface: • Internet accessandsomedois • … touseour API: • Internet access, dois, knowledge in consumingjson-datafrom a http-request • … touseoursoftware: • Internet access, doisorlocalhandles (max. 300k), a serverwithcertainsoftware such as Node.js, MySQL, chromedriver, …, someonewhoiscapableofmanagingtheserverandconfiguringthesoftware (shouldonlytake 1 or 2 days) *metrics In Transition Workshop – Göttingen – 27.03.2019
Thankyou / Vielen Dank! • Web metrics-project.net • Email metrics-project@sub.uni-goettingen.de • Twitter @metrics_project • Facebook @metricsproject