Evidence from Behavior

Evidence from Behavior LBSC 796/INFM 719R Douglas W. Oard Session 7, October 22, 2007

  Picture of Relevance Feedback Initial query x x x x o x x x x x x x o x o x x o x o o x x x x Revised query x non-relevant documents o relevant documents

Rocchio Formula qm = modified query vector; q0 = original query vector; α,β,γ: weights (hand-chosen or set empirically); Dr = set of known relevant doc vectors; Dnr = set of known irrelevant doc vectors

Rocchio Example Typically,  <  0 4 0 8 0 0 0 4 0 8 0 0 Original query (+) 2 4 8 0 0 2 1 2 4 0 0 1 Positive Feedback (-) Negative feedback 8 0 4 4 0 16 2 0 1 1 0 4 New query -1 6 3 7 0 -3

Using Positive Information Source: Jon Herlocker, SIGIR 1999

Using Negative Information Source: Jon Herlocker, SIGIR 1999

Use ratings as to describe objects Personal recommendations, peer review, … Beyond topicality: Accuracy, coherence, depth, novelty, style, … Has been applied to many modalities Books, Usenet news, movies, music, jokes, beer, … Rating-Based Recommendation

Motivations to Provide Ratings • Self-interest • Use the ratings to improve system’s user model • Economic benefit • If a market for ratings is created • Altruism

Explicit Feedback: Assumptions • A1: User has sufficient knowledge for a reasonable initial query • A2: Selected examples are representative • A3: The user will try giving feedback • A4: The user will keep giving feedback

A1: Good Initial Query? • Two problems: • User may not have sufficient initial knowledge • Few or no relevant documents may be retrieved • Examples: • Misspellings (Brittany Speers) • Cross-language information retrieval • Vocabulary mismatch (e.g., cosmonaut/astronaut)

A2: Representative Examples? • There may be several clusters of relevant documents • Examples: • Burma/Myanmar • Contradictory government policies • Opinions

A3: Will People Use It? • Efficiency • Longer queries require more processing time • Understandability • Harder to see why subsequent documents retrieved • Risk • Users are reluctant to provide negative feedback

A4: Self-Interest Decreases! Marginal value to community Value of ratings Marginal cost Marginal value to rater Few Lots Number of Ratings

Solving the Cost vs. Value Problem • Maximize the value • Provide for continuous user model adaptation • Minimize the costs • Use implicit feedback rather than explicit ratings • Minimize privacy concerns through encryption • Build an efficient scalable architecture • Limit the scope to noncompetitive activities

Solution: Reduce the Marginal Cost Marginal value to community Marginal cost Marginal value to rater Few Lots Number of Ratings

View Select Listen Print Bookmark Save Purchase Subscribe Delete Copy / paste Forward Quote Reply Link Cite Mark up Tag Organize Publish Type Edit

Examine Retain Behavior Category Reference Annotate Create View Select Listen Print Bookmark Save Purchase Subscribe Delete Copy / paste Forward Quote Reply Link Cite Mark up Tag Organize Publish Type Edit

Minimum Scope Segment Object Class Examine Retain Behavior Category Reference Annotate Create View Select Listen Print Bookmark Save Purchase Subscribe Delete Copy / paste Forward Quote Reply Link Cite Mark up Tag Organize Publish Type Edit

Estimating Authority from Links Hub Authority Authority

Armonk, NY-based computer giant IBM announced today www.ibm.com Big Blue today announced record profits for the quarter Joe’s computer hardware links Compaq HP IBM Anchor Text Indexing • Sometimes yields unexpected results: • Google ranked Microsoft #1 for “evil empire” Adapted from: Manning and Raghavan

Collecting Click Streams • Browsing histories are easily captured • Make all links initially point to a central site • Encode the desired URL as a parameter • Build a time-annotated transition graph for each user • Cookies identify users (when they use the same machine) • Redirect the browser to the desired page • Reading time is correlated with interest • Can be used to build individual profiles • Used to target advertising by doubleclick.com

180 160 140 120 Reading Time (seconds) 100 80 60 58 50 43 40 32 20 0 No Interest Low Interest Moderate Interest High Interest Rating Full Text Articles (Telecommunications)

More Complete Observations • User selects an article • Interpretation: Summary was interesting • User quickly prints the article • Interpretation: They want to read it • User selects a second article • Interpretation: another interesting summary • User scrolls around in the article • Interpretation: Parts with high dwell time and/or repeated revisits are interesting • User stops scrolling for an extended period • Interpretation: User was interrupted

55 42 51 52 No Interest No Interest Low Interest Moderate Interest High Interest Abstracts (Pharmaceuticals)

Recommending w/Implicit Feedback Estimate Rating User Ratings User Model Predicted Ratings User Observations Community Ratings User Ratings Ratings Server Predicted Observations User Model Estimate Ratings Predicted Ratings User Observations Community Observations Observations Server

Critical Issues • Protecting privacy • What absolute assurances can we provide? • How can we make remaining risks understood? • Scalable rating servers • Is a fully distributed architecture practical? • Non-cooperative users • How can the effect of spamming be limited?

Gaining Access to Observations • Observe public behavior • Hypertext linking, publication, citing, … • Policy protection • EU: Privacy laws • US: Privacy policies + FTC enforcement • Statistical assurance of privacy • Distributed architecture • Model and mitigate privacy risks

A: Southeast Asia (Dec 27, 2004) B: Indonesia (Mar 29, 2005) C; Pakistan (Oct 10, 2005) D; Hawaii (Oct 16, 2006) E: Indonesia (Aug 8, 2007) F: Peru (Aug 16, 2007) Search EngineQuery Logs

Search Engine Query Logs http://hannu.biz/aolsearch/

AOL User 4417749

Evidence from Behavior

Evidence from Behavior

Presentation Transcript

From Anecdote to Evidence

From Evidence-based Practice to Practice-based Evidence: Behavior Analysis in Special Education

Dynamic Consumption Behavior: Evidence from Japanese Household Panel Data

Liquefaction: Behavior Evidence, Prediction, and Prevention

Evidence from Paleontology

From Behavior to Temperament?

How does financial innovation affect bank behavior? Evidence from credit markets

Financial Education and Behavior Formation: Large Scale Experimental Evidence from Brazil

Does Choice Increase Information? Evidence from Online Search Behavior

Writing from Evidence

Comments on Does Choice Increase Information? Evidence from Online School Search Behavior

Market Integration and Prosocial Behavior: Evidence from the Standard Cross-Cultural Sample.

Evidence from Content

Learning from Behavior Performances vs Abstract Behavior Descriptions

Evidence from Content

EVIDENCE FROM GENETICS

Evidence from Content

From evidence to Policy:

From Theory to Evidence