1 / 34

Improving Information Retrieval on the Web

This research project aims to understand how people manage information for re-access and re-use on the web, and find solutions to the problems they encounter. The study includes observations of participants' work-related web tasks and efforts to re-find web information. The results so far show various methods used, including sending emails to oneself, saving web pages as files, bookmarking, and more. Participants also expressed the need for better organization and ease of access.

wmcsherry
Download Presentation

Improving Information Retrieval on the Web

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. How Do People Get Back to Information on the Web? How Can They Do It Better? William Jones, Harry Bruce The Information SchoolUniversity of Washington Susan Dumais Microsoft Research

  2. The Problem • Finding things is a well-studied problem. • Keeping things found is not so well-studied but arises in many domains: • Everyday objects in our lives • Personal files – paper and electronic • Email • The Web

  3. Related Work • Organizing personal files • Files & “piles”, (Malone, 1983) • Location memory is limited, (Jones & Dumais, 1986) • Preference for browsing, (Barreau & Nardi, 1995); but see Fertig, Freeman & Gelernter (1996) for a rebuttal. • Organizing email • Similar use patterns, similar problems as for personal files, (Whittaker & Sidner, 1996)

  4. Related Work (cont.) • Organizing the Web • Widespread use of “Bookmarks”, (Pitkow & Kehoe, 1996) • Steady increase in number with time, (Abrams Baecker & Chignell, 1998) • Increasing use of folders, (Abrams et al., 1998) • Frequent use of “Back” button within a session; infrequent use of “History”, (Tauscher & Greenberg, 1997)

  5. Overall Research Objectives We’re looking for answers to the following questions: • How do people manage information for re-access and re-use? How do people “keep found things found”? • What problems do people encounter? • What can be done to help?

  6. The Research Project • Study 1: Observe “keeping” activities as participants complete work-related, web-intensive tasks in their workplace. • 25 participants in all. • Study 2: Observe efforts to “re-find” web information for a subset of these same participants. • 13 participants in all • Survey a larger group. • Initiated. • Prototype selectively.

  7. Study 1: The Participants • 6 Researchers. • 10 Information professionals -- including librarians and corporate information specialists. • 9 Managers.

  8. Study 1: The Procedure • Prior to the observation • Participants completed an email questionnaire… • and listed at least three work-related, web-intensive “free-time” tasks. • One task was selected for the observation. • During the observation • Participants were observed in their own workplace. • Sessions lasted about an hour. • An “over-the-shoulder” video recording was made of participants as they “thought-aloud” while working on the task. • Participants handled office interruptions (phone calls, visitors, etc.) as they normally would.

  9. Study 1: The Results Many “keeping” methods were observed: • Send email to self. • Send email to others. • Print out the web page. • Save the web page as a file. • Paste URL into a document. • Add hyperlink to a web site. • Do nothing (and enter URL directly later, search for or access from another web site). • Bookmark the page. • Write down the URL on paper.

  10. Survey results (so far)

  11. A Researcher • MC is a part-time lecturer and researcher at the University of Washington. Her task for the observation was to locate web materials that might relate to a lecture she was preparing on the use of Microsoft PowerPoint. MC made frequent use of email. She mailed several URLs to herself – each in a separate email along with comments. On two occasions MC also emailed URLs to colleagues along with comments regarding potential relevance. MC maintained an elaborate organization of folders and subfolders in her email application (Microsoft Outlook) and expressed confidence that she could quickly locate an old email when needed. MC uses Favorites from time to time but declared that “it is a mess” because it hadn’t been organized recently.

  12. A Manager • DH is a third-level manager at Boeing. He travels frequently and is rarely in his office for an hour at a time. DH was interviewed over the telephone. DH rarely accesses the Web directly for workplace matters. When he does use the Web, the task is nearly always limited in time and scope. For example, he may occasionally use the Web to look up contact information for someone or to confirm a flight. DH depends heavily on email – from colleagues, his subordinates and other managers to whom he is responsible in one way or another.

  13. A Functional Analysis Several functions appear to influence the choice of method: • Reminding • Context • Number of access points – home, work, road… • Ease of maintenance • …

  14. A Functional Analysis (cont.)

  15. A Functional Analysis (cont.)

  16. A Functional Analysis (cont.)

  17. Other Notables • Participants seemed to distinguish between three categories. • Web sites used repeatedly – make it easy to access. • Web sites used infrequently but important to be able to access. • Web sites to check out later to see if useful.

  18. Other Notables (cont.) • Some participants went to great lengths to maintain a single hierarchy. • Print web pages to file with other papers. • Save email documents to filing system for e-docs. • Work with assistant to establish consistent organizations across paper documents, e-docs, email & favorites. • Keeping practices appear to vary with a person’s job and relationship to information.

  19. Study 2: Delayed Cued Recall • A second study looks at how/how well people are able to get back to web sites. • Session 1: Participant describes each in a set of web sites they have visited recently – without including name or URL. • Session 2, 3-6 months later: Participants are cued with these descriptions and told to get back to the site as best they can. We observe methods used and problems encountered.

  20. Study 2 Methods • Focus on “useful” web sites – sites that a participant is very likely to want to re-access over the next 12 months (75% or greater). • Sample for different frequencies of access in a typical week. • High – daily access in a typical week. • Medium – 1 to 3 times in a typical week. • Low – not accessed in a typical week. • Measure cue effectiveness. Does the participant recall the task? The site?

  21. Study 2 Results • Success rate is high – 90% or better. • Roughly 2/3 of the re-finding methods require no “keeping” forethought. • Direct entry of URL • Access web site via another web site (such as a “hub”) • Search again

  22. When Participant Recalls Task and Site

  23. When Participant Recalls Task Only

  24. Prototyping … as driven by the data. Simple extensions to Add Favorites to support the following options: • Add a comment. • Save Favorite to filing system. • Email Favorite.

  25. The Prototype

  26. Conclusions • People use a diversity of methods to organize web information for re-access and re-use. • A functional analysis can help us to understand the diversity of methods observed and their relative popularity.

  27. Conclusions (cont.) • Methods differ in the functions they provide. • No single current method provides all the functions a user may need. • The relative importance of functions (and hence the choice of methods) depends upon the task at hand.

  28. Conclusions (cont.) • A “natural” study of people doing tasks in their workplace can be very useful.

  29. How Can We Do It Better? • Better reminding. • Better integration. • Fewer organizational schemes. • Further improvements in “do nothing” methods. • In-line matching for suggested completions • Factor history into search results? “Stuff I’ve Seen” searching.

  30. For More Information • http://kftf.ischool.washington.edu/

  31. Supplemental slides

  32. A Functional Analysis (cont.) Additional functions: • Persistence of information • Preservation of information in its current state • Currency of information • Ease of integration • Communication and information sharing • Portability of information

  33. Use of keeping methods by participant group

  34. A Functional Analysis (cont.)

More Related