220 likes | 355 Views
Collaborative Filtering: Possibilities for Digital Libraries. Jon Herlocker Janet Webster Seikyung Jung Oregon State University. Current search engines are insufficient. Two important search engine problems. They don’t understand: Quality Context. But First: Our Context.
E N D
Collaborative Filtering:Possibilities for Digital Libraries Jon Herlocker Janet Webster Seikyung Jung Oregon State University CNI 2003/Herlocker, Jung, and Webster
Current search engines are insufficient. CNI 2003/Herlocker, Jung, and Webster
Two important search engine problems • They don’t understand: • Quality • Context CNI 2003/Herlocker, Jung, and Webster
But First: Our Context • Why are we standing up here? • We think we can improve the digital library experience. CNI 2003/Herlocker, Jung, and Webster
Today’s Context • Research questions & hypotheses • Collaborative filtering • Our approach to CF in the Library • Challenges of collaborative filtering for library search • Initial lessons learned CNI 2003/Herlocker, Jung, and Webster
The Librarian’s Questions • As electronic information increases in amount and value, how to provide access to it? • How to change digital libraries from disconnected collections to integrated systems? • How to integrate the expertise of librarians into the development process? • How to adapt traditional library values to new opportunities? CNI 2003/Herlocker, Jung, and Webster
The Computer Scientist’s Questions • What is the next big leap in document search technology? • How to overcome the limitations of software’s ability to understand language? • How can we build a search engine that learns by observing searchers? CNI 2003/Herlocker, Jung, and Webster
Our Research Hypotheses • Enabling the entire community to participate in organizing and recommending information will add value to the digital library • In other words: Collaborative Filtering will increase the value of a digital library CNI 2003/Herlocker, Jung, and Webster
What is Collaborative Filtering? • Communities of people sharing their evaluations of content • Recommendations are transferred between people of like interest • Examples: • MovieLens.org • Epinions.com • Launchcast (launch.yahoo.com) • Amazon.com CNI 2003/Herlocker, Jung, and Webster
CF and Libraries • Search is central to user experience of digital library • Collaborative Filtering: • Could overcome the limitations of current search technology • CF already exists in libraries. • Not search, but cataloguing (OCLC) • Adapting CF for document searching is not trivial. • Information needs are dynamic. CNI 2003/Herlocker, Jung, and Webster
Our Approach • OSU Libraries Recommender System • Perform at CF at query level • Match similar queries in addition to similar users • Generate results based on past user recommendations • Infer recommendations from user behavior • Integrate with existing library systems and traditions CNI 2003/Herlocker, Jung, and Webster
The Benefits of CF • Quality is considered. • Recommendations are based on human evaluations. • Context is considered. • The system gets better as it’s used. • Doesn’t require significant, centralized human resources CNI 2003/Herlocker, Jung, and Webster
CS Challenges • How to collect evaluations? • How to identify the “useful” element of recommendations? • How to represent the information needs of searchers? • How to rank results? • How to design the interface? CNI 2003/Herlocker, Jung, and Webster
Library Challenges • How to balance privacy with personalization & involvement? • How to maintain authority of recommended information? • How to deal with timeliness of information? • How to integrate with existing library systems? • How to fund research in the library setting? CNI 2003/Herlocker, Jung, and Webster
What We’ve Learned • Weakness of “old” search technology affects perception of new • Wrapper technology minimizes IT commitment • Existing internal data can be used to jumpstart system • Controlled experiments show • Increased performance • Increased perception of non-tangibles CNI 2003/Herlocker, Jung, and Webster
CF and Digital Libraries • Helps handle more electronic information • Improve search results • Shapes direction of digital libraries • Supports collaboration on many levels • Nothing ventured, nothing gained. CNI 2003/Herlocker, Jung, and Webster
Funding • OSU Libraries Gray Chair for Innovative Technologies • National Partnership for Advanced Computing Infrastructure (NSF) • Georgia Pacific HMSC internship CNI 2003/Herlocker, Jung, and Webster
More information • Silence of the Sleeper • Malcom Gladwell, The New Yorker, October 4th, 1999 (gladwell.com) • System for Electronic Recommendation Filtering Prototype (SERF) for OSU Libraries • http://dl.nacse.org/osu CNI 2003/Herlocker, Jung, and Webster
Contacts • Janet Webster • Oregon State University Libraries, Hatfield Marine Science Center • janet.webster@oregonstate.edu • Jon Herlocker • Oregon State University, School of Electrical Engineering & Computer Science • herlock@eecs.oregonstate.edu CNI 2003/Herlocker, Jung, and Webster