710 likes | 861 Views
The Wisdom of Crowds in the Aggregation of Rankings. Mark Steyvers Department of Cognitive Sciences University of California, Irvine. Joint work with: Michael Lee, Brent Miller, Pernille Hemmer. Rank aggregation problem.
E N D
The Wisdom of Crowds in the Aggregation of Rankings Mark Steyvers Department of Cognitive Sciences University of California, Irvine Joint work with: Michael Lee, Brent Miller, Pernille Hemmer
Rank aggregation problem • Goal is to combine many different rank orderings on the same set of items in order to obtain a “better” ordering • Example applications • Combining voters rankings: social choice theory • Information retrieval and meta-search* • *e.g. Lebanon & Mao (2008); Klementiev, Roth et al. (2008; 2009), Dworket al. (2001)
Example ranking problem in our research What is the correct chronological order? Abraham Lincoln Ulysses S. Grant time Ulysses S. Grant Rutherford B. Hayes Rutherford B. Hayes James Garfield Abraham Lincoln Andrew Johnson James Garfield Andrew Johnson
Aggregating ranking data ground truth group answer ? A B C D = A B C D Aggregation Algorithm A D B C D A B C B A D C A C B D A B D C
Generative Approach latent truth ? ? ? ? Generative Model A D B C D A B C B A D C A C B D A B D C
Wisdom of crowds phenomenon • Aggregating over individuals often leads to an estimate that is among the best individual estimates (or sometimes better) Galtons Ox (1907): Median of individual weight estimates came close to true answer
Approach • No communication between individuals • There is always a true answer (ground truth) • ground truth only used in evaluation • Unsupervised weighting of individuals* • exploit relationship between expertise and consensus • experts tend to be closer to the truth and therefore reach more similar judgments • Incorporate prior knowledge about latent truth • discount a priori bad rankings • * Klementiev, Roth et al. (2008, 2009); Dani, Madani, Pennock et al. (2006). Bayesian truth serum (Prelec et al., 2004); Cultural Consensus Theory (Batchelder and Romney, 1986)
Overview of talk • General knowledge tasks • reconstructing order of US presidents • Thurstonian models • Sports prediction • forecasting NBA and NCAA outcomes • Thurstonian models • Episodic memory • reconstructing order of personally experienced events • Mallows model
Measuring performance Kendall’s Tau: The number of adjacent pair-wise swaps = 1 = 1+1 = 2 Ordering by Individual A B E C D A B E CD E C D A B C D E A B True Order A B C D E
Empirical Results (random guessing) t
Unsupervised models for ranking data • Classic models: • Thurstone (1927) • Mallows (1957); Fligner and Verducci, 1986 • Diaconis (1989) • Voting methods: e.g. Borda count (1770) • We will focus on Thurstonian and Mallows models • implemented as graphical models • MCMC inference Many models were developed for preference rankings and voting situations no known ground truth
Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson Each item has a true coordinate on some dimension
Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson … but there is noise because of encoding errors
Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson A B C Each persons mental encoding is based on a single sample from each distribution
Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson A A < C < B B C The observed ordering is based on the ordering of the samples
Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson A A < B < C B C The observed ordering is based on the ordering of the samples
Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson Important assumption: across individuals, variance can vary but not the means
Graphical Model of Extended Thurstonian Model Latent truth Expertise of individual Mental samples Observed ordering j individuals
Inferred Distributions for 44 US Presidents George Washington (1) John Adams (2) Thomas Jefferson (3) James Madison (4) James Monroe (6) John Quincy Adams (5) Andrew Jackson (7) Martin Van Buren (8) William Henry Harrison (21) John Tyler (10) James Knox Polk (18) Zachary Taylor (16) Millard Fillmore (11) Franklin Pierce (19) James Buchanan (13) Abraham Lincoln (9) Andrew Johnson (12) Ulysses S. Grant (17) Rutherford B. Hayes (20) James Garfield (22) Chester Arthur (15) Grover Cleveland 1 (23) Benjamin Harrison (14) Grover Cleveland 2 (25) William McKinley (24) Theodore Roosevelt (29) William Howard Taft (27) Woodrow Wilson (30) Warren Harding (26) Calvin Coolidge (28) Herbert Hoover (31) Franklin D. Roosevelt (32) Harry S. Truman (33) Dwight Eisenhower (34) John F. Kennedy (37) Lyndon B. Johnson (36) Richard Nixon (39) Gerald Ford (35) James Carter (38) Ronald Reagan (40) George H.W. Bush (41) William Clinton (42) George W. Bush (43) Barack Obama (44) error bars = median and minimum sigma
Calibration of individuals t individual t distance to ground truth s inferred noise level for each individual
Heuristic Models • Many heuristic methods from voting theory • E.g., Borda count method • Suppose we have 10 items • assign a count of 10 to first item, 9 for second item, etc • add counts over individuals • order items by the Borda count • i.e., rank by average rank across people
Model Comparison t Borda
Other ordering tasks Ten Commandments Ten Amendments
Overview of talk • General knowledge tasks • reconstructing order of US presidents • Sports prediction • forecasting NBA and NCAA outcomes • Episodic memory • reconstructing order of personally experienced events • New directions
Human forecasting experiment • Forecast end-of-season rankings for 15 NBA teams • Eastern conference • Western conference • Participants were college undergraduates • heterogeneous population regarding basketball expertise • 172 individuals for Eastern conference • 156 individuals for Western conference • Experiment conducted Feb 2010 • teams have played about a bit over half of games in regular season
Model predictions for Eastern conference Borda Boston Cleveland Orlando Miami Detroit Chicago Philadelphia Atlanta New York New Jersey Indiana Washington Toronto Charlotte Milwaukee Actual outcome Cleveland Orlando Atlanta Boston Miami Milwaukee Charlotte Chicago Toronto Indiana New York Detroit Philadelphia Washington New Jersey Thurstonian Model Cleveland Boston Orlando Miami Atlanta Chicago Detroit Charlotte Toronto Philadelphia Washington Indiana New York Milwaukee New Jersey
East 73% t 93% West t 87% 94%
Calibration Results East West t t s s
Heuristics: who will win more games? Chicago Bulls Charlotte Bobcats vs Won 0 championships Team in existence for 6 years Won 6 championships Team in existence for 44 years Related to work on “fast and frugal heuristics” by Gigerenzer et al.
Heuristic ranking by #championships won Actual outcome Cleveland Orlando Atlanta Boston Miami Milwaukee Charlotte Chicago Toronto Indiana New York Detroit Philadelphia Washington New Jersey #championships Boston Chicago Philadelphia Detroit Indiana New York New Jersey Atlanta Washington Milwaukee Miami Orlando Cleveland Toronto Charlotte
Informative Priors on Expertise • Individuals who closely follow heuristic orderings are probably not experts • Set hyperparameters of variance prior based on distance to heuristic ordering prior for individual who closely follows heuristic ordering s
Graphical Model j individuals
East 73% 93% t 96% West t 87% 94% 96%
Forecasting NCAA tournament (March Madness) • 64 US college basketball teams are placed in a set of four seeded brackets, and play an elimination tournament. • Midwest bracket:
Data • Predictions from 16,718 Yahoo users • Each individual predicts the winner of all games • We use the predictions for the first four rounds (60 games total) • Two scoring systems • Number of correct predictions • Points: • 1 point per correct winner in 1st round • 2 points in 2nd • 4 points in 3rd • 8 points in 4rd
Data and Results of Heuristic Strategies majority rule 73% priorseeding 61% Obama 83% #correct predictions majority rule 71% priorseeding 66% Obama 47% points individuals
Thurstonian Model Team A Team B Team C • Each team has a mean on a single “strength” dimension • Each person has single variance
Thurstonian Model Team A Team B Team C A B wins over A B • The probability a person will choose team A over team B is the probability their strength for team A will be sampled above team B
Thurstonian Model Team A Team B Team C B C wins over B C • The probability a person will choose team A over team B is the probability their strength for team A will be sampled above team B
Modeling Results Thurstonian modelinform. priors 90% Thurst model 83% majority rule 73% priorseeding 61% #correct predictions Thurst. modelinform. priors 81% Thurst. model 78% majority rule 71% priorseeding 66% points individuals
Overview of talk • General knowledge tasks • reconstructing order of US presidents • Sports prediction • forecasting NBA and NCAA outcomes • Episodic memory • reconstructing order of personally experienced events
Recollecting Order from Episodic Memory Study this sequence of images
How good is your memory? Place the images in the correct sequence (by reading order) B C D A F H E G J I
Problem • What if we have only a small number of individuals? • How can we guard against individuals with poor memory? • Idea: “smooth” the inferred group ordering with a prior
Approach • Empirically measure the prior orderings over events • Experiment: a separate group of individuals orders the images without seeing original video • Use this data to construct a prior on the group ordering
Mallows Model Kendall tau distance ω latent truth θj expertise for person j observed ranking for person j yj (memory data)
Mallows Model with an informative prior on the latent truth ωo prior on orderings θ* ω latent truth θoj θj expertise for person j observed ranking for person j yoj yj (prior knowledge data) (memory data)
Results when picking K worst “witnesses” uniform prior informative prior t Number of “witnesses” (K)