140 likes | 157 Views
Predictive Analytics World. Predictive Keyword Scores to Optimize PPC Campaigns Vincent Granville, Ph.D. Click Forensics February 19, 2009. Problem. Advertisers bidding on keywords on search engines (PPC programs offered by Google, Yahoo, etc.)
E N D
Predictive Analytics World Predictive Keyword Scores to Optimize PPC Campaigns Vincent Granville, Ph.D. Click Forensics February 19, 2009 CONFIDENTIAL 1
Problem • Advertisers bidding on keywords on search engines (PPC programs offered by Google, Yahoo, etc.) • Bidding strategy should achieve some goals, typically • Profit optimization • ROA (revenue on ad spend) optimization • Minimization of cost of user acquisition • Maximization of user lifetime value
Short Term Goals • Short term ROA can be negative • Paid + organic search usually provides positive ROI • Organic search used as a leverage to buy traffic and increase reach • KPI’s: • Clicks per keyword • Conversions per keyword • Revenue, profit or return • Conversion rate
Issues • Keywords with few clicks (“long tail”): difficult to predict • Attaching a conversion to a click: data quality (cookies) • Revenue numbers not known until tomorrow • New bid => Google needs to “learn” how to handle it • Real time implementation of keyword bidding subject to high volatility • Focus on end-of-day or bi-weekly algorithm • Pitfall 1: if max bid is much higher than actual CPC => Google will eventually notice! • Pitfall 2: keyword performance can be impacted by “poor” keywords in same ad group, or by impression fraud / click fraud spikes • Match type
Keyword Scoring • Same as click scoring / credit card transaction scoring • Scores computed at the keyword / ad group level • Response: RPC, Conversion rate, etc. • Independent variables: binary rules • Actually, there are highly auto-corraleted • Model • Logistic regression (ridge or constrained regression) • Naïve Bayes (related to logistic regression via the odds ratio) • Decision trees or combo • Score is predictor of RPC, return or conversion rate, etc. • Conversion blending
Bidding Strategy • Goals • Be able to predict response for keywords with very little historical data • Be able to predict response for new keywords • Conversion rate = f(score) • New bid = g(previous bid, keyword score, ROI, RPC, …) • Methodology • Permanent multivariate (A/B/C) testing • g is a parametric function • A/B/C: each case corresponds to a particular parameter set • Moves in parameter space driven by a simulated annealing algorithm
Examples of Rules • Text mining rules used in the keyword scoring engine • Length of keyword • Number of terms • Keyword contains “free”, “new”, “2009” • Keyword contains digits • Keyword contain top 1-term word with known response • Keyword contains 2-term word with known response • Example • Keyword “used car Honda 2000” contains the 2-term word “used car” • All keywords containing “used car” have on average a 5% conversion rate
Issues • Keyword cleaning • Each keyword contains multiple 1-term and 2-term words • Scalability • Most keywords contain at least one top term • With 50MM keywords and 25K top terms, 95% of the keywords contain one top term (at least) • Response is not known for most of the 50MM keywords (too granular), but it is known for each of the 25K top terms (aggregate level) • Works with new keywords
Results: Interpretation • Keyword score is a good predictor of conversion rate • Bids are too high on good keywords, too low on poor keywords • Simple corrective action suggested • A/B/C parametric bidding strategy not discussed here • Cross validation: see next slide
Results: Cross-Validation • Process 15 days worth of data using score lookup tables based on training set • No time period overlap, between training and test • Keyword overlap • Large volume of new keywords (“new” means a KW not found in training set) • Robustness against missing data / new keywords • Predictive power somewhat reduced, but still good