280 likes | 449 Views
From web archiving to web collecting. The development of the KB’s web archive Anna Rademakers, May 21st 2014. Introduction. Collection policy of the KB in general The history of web archiving in the KB From web archiving to web collecting. Mission statement.
E N D
From web archiving to web collecting • The development of the KB’s web archive • Anna Rademakers, May 21st 2014
From web archiving to web collecting Introduction • Collection policy of the KB in general • The history of web archiving in the KB • From web archiving to web collecting
From web archiving to web collecting Mission statement • “We bring people and information together” • we offer everyone everywhere access to everything published in and about the Netherlands. • we play a central role in the (scientific) information infrastructure of the Netherlands. • we promote permanent access to digital information both nationally and internationally.
From web archiving to web collecting Web archiving in the KB • Correspondswithourgeneralcollection policy • Archiving & making permanentlyaccessible • Since 2007 • Ca. 6000 websites • Using the WaybackMachine • At the moment onsiteaccessible in the KB forgeneral user • Datasets foracademic research (e.g. Webart)
From web archiving to web collecting Limitations • No full .nl domain harvest • Dutch websites also in .com and .net domain • No Dutch DepositLaw • Opt Out System: • Notice sent to web owners, theycan object tobeingarchived • Part of Dutch law, soonlyapplicableto Dutch websites
From web archiving to web collecting Selection by subject librarians (1) • 1) Selection made by subject librarians: • Focus on librarycollection profile • Dutch heritage, culture, language & history • Special collections • Event harvesting (national & international (IIPC)) • E.g. 200 yearsKingdom of the Netherlands, • the Netherlands in World War I, • the Olympics,…
From web archiving to web collecting Selection by subject librarians (2) • 1) Selection made by subject librarians: • Focus on librarycollection profile • Dutch heritage, culture, language & history • Special collections • Event harvesting (national & international (IIPC)) • Websites on special topics • E.g. embassies, • Sinterklaas (Saint-Nicolas), • Product andtradeassociations,…
From web archiving to web collecting Selection by subject librarians (3) • Selection made by subject librarians: • Focus on librarycollection profile • Dutch heritage, culture, language & history • Special collections • Event harvesting (national & international (IIPC)) • Websites on special topics • Canceled websites
From web archiving to web collecting Selection by subject librarians (3) • Selection made by subject librarians: • Focus on librarycollection profile • Dutch heritage, culture, language & history • Special collections • Event harvesting (national & international (IIPC)) • Websites on special topics • Cancelled websites • Frysian websites (cooperation withTresoar)
From web archiving to web collecting Selection by subject librarians (3) • Selection made by subject librarians: • Focus on librarycollection profile • Dutch heritage, culture, language & history • Special collections • Event harvesting (national & international (IIPC)) • Websites on special topics • Cancelled websites • Frysian websites (cooperation withTresoar) • How can we make ourselection more representativeandefficient?
From web archiving to web collecting Selection by relevance ranking • Alexa: 500 most used websites in the Netherlands • Only 160 websites Dutch • Many marketing websites • Technical issues • Wikipedia: 12000 websites beingused as a reference in the Dutch Wikipedia • More objective • Diversity • Wikipedia community
From web archiving to web collecting What not to select? • Technical: databases, webshops,… Publications vs. services • Collectionsarchivedbyother Dutch institutions, e.g. • Websites of PoliticalParties Archipol • Although we do archivegovernmental websites! • Websites about or from Rotterdam Municipalarchive • Although we do archivewebsites of national interest! • Controversial websites, e.g. • Right or leftwingextremists • Pedophilia • Motorcycle gangs
From web archiving to web collecting What not to select? • Technical: databases, webshops,… Publications vs. services • Collectionsarchivedbyother Dutch institutions, e.g. • Websites of PoliticalParties Archipol • Although we do archivegovernmental websites! • Websites about or from Rotterdam Municipalarchive • Although we do archivewebsites of national interest! • Controversial websites, e.g. • Right or leftwingextremists • Pedophilia • Motorcycle gangs • KB as quality mark?
From web archiving to web collecting Closing remarks • Future: • Website with search options • Online ifpossible • Further cooperation withresearchers • Further cooperation in terms of collectiondevelopment • No full archiving (because we have no legalframeworkforthat), but building a permanent andaccessiblerepresentativecollection of Dutch websites
From web archiving to web collecting Thank you • Anna Rademakers • anna.rademakers@kb.nl