1 / 7

DATA IN THE DIGITAL HUMANITIES

DATA IN THE DIGITAL HUMANITIES. Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges http://methodologicalchallenges.group.shef.ac.uk/. Data in the humanities is usually: Small (discrete sources created by individuals).

Antony
Download Presentation

DATA IN THE DIGITAL HUMANITIES

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. DATA IN THE DIGITAL HUMANITIES Michael Pidd 26th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges http://methodologicalchallenges.group.shef.ac.uk/

  2. Data in the humanities is usually: • Small (discrete sources created by individuals). • Broad (many different types of sources have to be assembled). • Complex (because humans are not spreadsheets). • Rarely ‘Big’. http://hridigital.shef.ac.uk @hridigital • The data lifecycle in a typical digital humanities • project: • Acquisition (e.g. digitisation) • Processing (adding value) • Analysis (and dissemination)

  3. Data acquisition: • Most of the evidence base is pre-digital. Very little is ‘born digital’. • Data acquisition is a question of translation, representation and interpretation. • The methods we use either enable or inhibit research. • But, the process also develops intimate knowledge of the evidence.

  4. British Library Newspapers Keyword search for “pidd” gives 2,730 results…

  5. Data processing: • Metadata can be complex, reflecting the complexity of the data. • Metadata can be very specialised, limiting re-use. • When processed at scale, computational methods are a trade-off between through-put and accuracy.

  6. Nominal record linkage using computational means to trace the lives of 90,000 people. • Record linkage across 45 separate datasets (some public, some commercial, all in different formats and with different data models). • And most people have common names. • http://www.digitalpanopticon.org

  7. Analysing data: Do data visualisations tell us anything that we do not already know? Data visualisation is only as good as the data. Data visualisation should reveal trends and anomalies, directing us to deeper readings of the evidence.

More Related