360 likes | 518 Views
Alternatives to Credit Scoring in Insurance. James Guszcza, FCAS, MAAA Cheng-Sheng Peter Wu, FCAS, ASA, MAAA CAS 2004 Ratemaking Seminar Philadelphia March 12-13, 2004. Agenda. Introduction The credit scoring revolution From credit scoring to predictive modeling
E N D
Alternatives to Credit Scoring in Insurance James Guszcza, FCAS, MAAA Cheng-Sheng Peter Wu, FCAS, ASA, MAAA CAS 2004 Ratemaking Seminar Philadelphia March 12-13, 2004
Agenda • Introduction • The credit scoring revolution • From credit scoring to predictive modeling • The big idea: credit scoring is just one kind of insurance predictive (“data mining”) model… many other predictive models can be built • Conclusions • What it means to actuaries
Introduction – Our Efforts • Is credit for real? • “Does Credit Score Really Explain Insurance Losses? Multivariate Analysis from a Data Mining Point of View” • Going beyond credit • “Mining the Most from Credit and Non-Credit Data” • How do credit scoring models work? • “A View Inside the “Black Box”: Review and Analysis of Personal Lines Insurance Credit Scoring Models Filed in the State of VA”
Why? Multiple Choice • Progressive provided foosball tables and free snacks to their trendy, 20-something workforce • Progressive built a compound Gamma-Poisson GLM model to design their class plan • Progressive pioneered the use of credit in pricing/underwriting
Personal Lines Pricing and Class Plans – History • Few rating factors before World War II • Explosion of class plan factors after the War • Auto class plans: • Territory, driver, vehicle, coverage, loss and violation, others, tiers/company… • Homeowners class plans: • Territory, construction class, protection class, coverage, prior loss, others, tiers/company... • Credit scoring introduced in late 80s and early 90s
Personal Lines Credit Scoring – History • First important factor identified over the past 2 decades • Composite multivariate score vs. raw credit information • Introduced in late 80s and early 90s • Viewed at first as a “secret weapon” • Quiet, confidential, controversial, black box, …etc “Early believers and users have gained significant competitive advantage!”
The Current Environment • Now everyone is using it: • Marketing and direct solicitation • New business and renewal business pricing and underwriting • How to stay competitive if everyone is using it? • Regulatory constraints: • Many states have conducted studies on the true correlation with loss ratio and potential discrimination issues - WA study, TX study, MO study • Many states have/are considering restricting the use of credit scores or certain types of credit information • More states want the “black box” filed and opened
Some Facts About Credit Scores • A composite score that usually contains 10 to 40 pieces of credit information • Payment pattern information, bankruptcies/liens, collections, inquiries, bad debt/defaults… • Loss ratio lift is significant – a powerful class plan factor or rate tiering factor • Benefits/ROI are measurable • Lift curve can be translated into bottom-line benefit • Blind test and independent validation can be done to verify the benefit
120 90 82 78 74 70 66 62 58 50 Loss Ratio Lift Curve Loss Ratio Credit Score Decile
From Credit Scores to Predictive Models • What is a predictive model? • A multivariate scoring formula (linear or non-linear) that combines the values of several predictivevariables to estimate the value of a target variable • What is a credit score? • A multivariate scoring formula (linear or non-linear) that combines the values of several credit variables to estimate the value of a target variable
From Credit Scores to Predictive Models • A credit score is just one example of an insurance predictive model • A credit scoring project is a first approximation to a full insurance data mining project. • The same methods used to build credit scores are used in data mining to build insurance predictive models. • The primary difference is in the predictive information used.
From Credit Scores to Predictive Models • Credit scores are PMs that use only credit-related variables to predict relative profitability. • Payment pattern information, bankruptcies/liens, collections, inquiries, bad debt/defaults… • But PMs can also be built using • Both credit and non-credit information (preferred) • Only non-credit information (perfectly feasible)
Non-Credit PMs • Why would we want to build a purely non-credit PM? • Competitive advantages – e.g. matrix with credit • State-specific regulatory constraints • Expense of ordering credit reports • Thin files/no-hits • Public relations • But from a purely actuarial POV, credit is predictive should be used as part of the PM!
PMs: Considerations • The key is to use as much information as possible • in a multivariate way • Choice of statistical techniques is important, but the real key is the quality and breadth of predictive variables used. • GIGO • Actuarial/insurance knowledge is critical • Untapped riches reside in many companies’ transactional records.
PMs: Data Sources • We classify possible data sources into two groups • Internal data sources: predictive information gleaned from the company’s own systems • Regardless of how or whether it is currently used • External data sources: predictive information available from 3rd parties. • Both credit and non-credit
Internal Data Sources • Policy information • Limits, Deductibles, Measure of exposure (# cars, #houses, #employees, $sales, premium size… • Line-Specific information • Driver, Vehicle, Business Class … • Policyholder information • Age, gender, marital status …
Internal Data Sources • Customer-level information • Transactional data • Coverage, premium and loss transactions • Billing information • Correlation with credit • Agent information A little creativity in using these data sources will go a long way!
External Data Sources • Credit • Predictive both for commercial and personal lines • MVR – CLUE • Zipcode/geographic information • Rating territory • Many different sources available • The sky is the limit but • Consider cost, hit rate, implementation, …etc
Types of Variables Generated • Territory-level • Demographic, weather, crime, ...etc • Policy / policyholder-specific • Many traditional rating variables fall into this category • Behavioral • Less traditional – fits more neatly into data mining paradigm than classification ratemaking • Credit, billing, prior claims, cancel-reinstatements…
How Many Variables? • It is possible to generate literally hundreds of predictive variables • Some will be redundant • Some will not be very predictive • Some will be somewhat predictive • Some will be “killer” • A good model can contain as few as 15-20 or as many as 60-70 variables • Usually no single “ideal” model
Which Variables to Use? • Choosing is a major part of the data mining process • Use variety of exploratory statistical techniques • Use prior modeling experience / actuarial knowledge • Several considerations • Actuarial / underwriting knowledge • Client’s business needs • Legal / regulatory considerations • Data availability / cost • Systems implementation considerations
In Our Experience…. Do non-credit PMs work? • YES: non-credit predictive models are • Valuable alternative to credit scores • Flexible • Tailored to individual companies • Leverage company’s untapped internal data • Comparable predictive power to credit scores • And mixed credit / non-credit PMs can be even stronger
…But It’s Not a Walk Through the Park Challenges for PMs: • IT resources constraints • Project management • Business process buy-in • Success of system and business implementation • Training and organizational change
Industry Trends • How do companies try to stay competitive regarding the use of credit? • How do companies prepare for increasing regulatory constraints? • Industry trends • Companies are developing modeling capabilities and pursuing various applications • Companies are developing proprietary credit scoring models rather than buying “off-the-shelf” credit scores. • Companies are also going beyond credit, to build scoring models that don’t rely solely on credit
Keys to Building PMs • Fully utilize all sources of information • Leverage company’s internal data sources • Enriched with other external data sources • Use large amount of data • Employ systematic analytical process • Use state-of-the-art modeling tools • Apply multivariate methodology • Disciplined project management
Implications for Actuarial and Ratemaking Practice • Opportunities for out-of-the-box thinking (who thought of credit a decade ago?) • Increased multivariate analytic projects in the future • On-going search for new predictive data sources, new modeling techniques, and new applications • LTV, fraud, cross sell, retention, ..etc. • Next generation of pricing – more segmentation • A price for every risk • New methodologies • Statistical computing • Lift curve concept • Blind test / model validation methodology • ROI benefit calculation • …etc
Implications for Ratemaking Principles • Actuarial Ratemaking Principle #1: “A rate is an estimate of the expected value of future costs” • Actuarial Ratemaking Principle #4: ” A rate is reasonable and • Not excessive • Not inadequate • Not unfairly discriminatory • But is that really the way profit-seeking companies price their products? • Are rates ultimately based on costs or on what the market will bear?
Implications for Ratemaking Principles • How do you measure the ROI of a traditional ratemaking/class plan exercise? • Why do ratemaking principles not mention blind tests of pricing algorithms? • “Unfairly discriminatory”: • If we develop a powerful new segmentation model, is it discriminatory to certain risks? • If we don’t introduce it, is it discriminatory to other risks? • How do we know if we don’t do the analysis?