240 likes | 383 Views
Data visualization and digital humanities research: a survey of available data sets and tools . LITA National Forum 2011 St. Louis, MO Friday, September 30, 2011 Erik Mitchell, University of Maryland Susan Sharpless Smith, Wake Forest University. Motivation.
E N D
Data visualization and digital humanities research: a survey of available data sets and tools LITA National Forum 2011 St. Louis, MO Friday, September 30, 2011 Erik Mitchell, University of Maryland Susan Sharpless Smith, Wake Forest University
Motivation “Digital humanities needs gateway drugs. Kudos to the pushers on the Google Books team.” - Dan Cohen http://www.dancohen.org/2010/12/19/ “Linked open data could have the same leveraging effect that the World Wide Web had on computing, said Micki McGee, an assistant professor of sociology at Fordham University” -Steve Kolowich, The Promise of Digital Humanities, Inside HigherEd
Birth of a word “Imagine if you could record your life, everything you said, everything you did available in a perfect memory store at your finger tips. “ - Deb Roy – The Birth of a Word http://www.ted.com/
Overview • Discuss examples of data-focused research tools • Explore tools • Consider roles for librarians • Wrap-up/Q & A
Searching and Discovery Examples: BYU Corpuahttp://corpus.byu.edu/ WOK Citation Mapping WOK
Visualization Free Visualization Tools
Analysis and publishing NodeXLhttp://nodexl.codeplex.com/
Tool exploration • Discover / Search • What kinds of discovery tools exist and how common are the discovery features across different datasets / systems? • Visualization • What visualization features exist, are there products that are easy to use, are the skills transferable? • Analysis / Annotation • What analytical tools are included, what analysis techniques are common?
Perseus http://www.perseus.tufts.edu
JSTOR Data For Research http://dfr.jstor.org
Wordseer AditiMuralidharan Marti Hearst http://bebop.berkeley.edu/wordseer
Google’s Ngram Viewerbooks.google.com/ngramsculturomics.org But here's the rub. Google Books, as others point out, wasn't really built for research. . . That means Google Books didn't come with the interfaces scholars need for vast data manipulation . . . http://chronicle.com/article/The-Humanities-Go-Google/65713/
Ted talk on Google NGRAM viewer http://www.ted.com/talks/what_we_learned_from_5_million_books.html
Concordancing Eric Lease Morgan - http://dh.crc.nd.edu/sandbox/cyl/catalog/
Google’s public data explorer http://www.google.com/publicdata/
Data analysis - NodeXL http://nodexl.codeplex.com/ Analyzing Social Media Networks with NodeXL: Insights from a Connected World
Data cleaning – Google Refine http://code.google.com/p/google-refine
Data visualization – Google Fusion Tables http://google.com/fusiontables http://www.google.com/fusiontables/DataSource?dsrcid=332788
Research/teaching need • Researcher needs vary from advanced linguistic analysis and IT support to need for basic digital content/infrastructure Corpus-based research
Librarian contributions • Domain specific, tool-type specific comparisons • IT and research support – data analysis, data curation, tool/data sources identification • Shift from “reference” to “research” in sync with move from resource discovery to thematic analysis
Next steps • Build new skills, develop new systems • Create tutorials guides • Explore connections between data/curation and publishing and these tools – so is there a connection • Explore role of library discovery systems and consider new feature implementation.