STATISTISTICAL METHODS FOR RANKING

STATISTISTICAL METHODS FOR RANKING Sture Holm FMS Jubileumskonferens på Utö 25 oktober 2012

A COMMON TYPE OF PRESENTATIONComparisonbetweencare centers

Here about 200 care centers • 40 000 involved patients, 56 percent answers • In central part often 1 unit differences • Weighting of 6 ”items” med 50-steps (from 0 to 600 ”points” for the individuals) • NO MEASURES OF DISPERSION GIVEN !! • ”Manufactured” by consultants ”Indikator”

A typical exempleSuccess rates in units

Howcanwe make correctstatisticalstatements on the ranks ? • As I see it there are two steps: • A suitablestatisticalmethod for pairwisecomparisonbetweencases. This is standard technique i statistics. • To useresults in this first step for making a proper statisticalstatement on the rank of a unit with a confidence interval.

Step 1. Comparison within pairs • Different problems requires different basic methods, for example: • Wilcoxon test methods • Binomial comparison • Normal approximation • OBSERVE: MOSTLY THERE ARE DIFFERENT SAMPLE SIZES IN THE UNITS

Step 2: • Principle: To use one-sided p values for the pairs in order to get the number of pairs, that can be considered ”significant” on each side. (within the half over all risk). • But how ? • To use mutiple test methods • Will give choosen confidence degree

Simplest: Holm-Bonferroni • Bonferroni: Devide the risk (e.g. 2.5 %) with the number of cases, and use it individually. • + Holm (SJS 1979): Continue with new steps where the number is the remaining non-rejected. Go on as long as there are new rejections. • ”Translate” to intervals for rank.

Illustration of multiple test

The simple binomial example • Consider for instance unit E. Test hypothesis that all other units are at least as good with multiple level 2.5 % and find the ones which can be declared worse than E. • Repeat on other side

The estimates

Estimated order: • I, A, H, B, E, G, F, J, D, C • The ”central” B and E have no or almost no information on rank • I, H, A are positioned low [1;4] • G, F, J, D, C are positioned high [4;10]

An exemple with scales • A: 35, 44, 51, 88, 46 • B: 19, 29, 34, 41, 30 • C: 43, 52, 54, 99, 56 • D: 42, 50, 70, 117, 61 • E: 13, 37, 49, 85, 45 • F: 15, 21, 28, 71, 35 • G: 16, 40, 46, 86, 34 • H: 19, 36, 39, 73, 31 • I: 17, 37, 55, 126, 65 • J: 16, 15, 18, 50, 20

Confidence intervals for ranks

Estimated order • B, C, A, H, D, G, J, E, F, I • For the extremes B and I (153 resp. 300 obs.):

Advantages and disadvantages • Correct general method for different situations with chosen confidence degree. • A little inefficient since it is based on the Boole inequality, which is very good for negatively dependent and independent cases, but not so good for positively dependent cases. We have positive dependence here since there is comparison with the same case.

Is there a big loss ? • Here follows a number of cases with 50 % correlation exact normal bound correspoding Bonferroni bound. Total risk 1 %. • Is it worthwhile to try to be exact ?

STATISTISTICAL METHODS FOR RANKING

STATISTISTICAL METHODS FOR RANKING

Presentation Transcript

Ranking

Ranking Definitions with Supervised Learning Methods

Ranking

Ranking

Ranking

Adaptive Ranking Model for Ranking Code-Based Static Analysis Alerts

Mutual Information Scheduling for Ranking

Social Networking Techniques for Ranking

Ranking

Ranking

first a digression The POC Ranking the Methods

RANKING CRITERIA FOR UNIVERSITIES

Ranking services for composition

SEO strategy for ranking on Search Enginefor ranking

Methods for Ranking your New Website.

Seo Services For Google Ranking

Methods of SEO to increase ranking

Adaptive Ranking Model for Ranking Code-Based Static Analysis Alerts