Parametric measures to estimate and predict performance of identification techniques

Amos Y. Johnson & Aaron Bobick STATISTICAL METHODS FOR COMPUTATIONAL EXPERIMENTS IN VISUAL PROCESSING & COMPUTER VISION NIPS 2002 Parametric measures to estimate and predict performance of identification techniques

Setup – for example • Given a particular human identification technique

x Setup – for example • Given a particular human identification technique • This technique measures 1 feature (q) from n individuals - 1D Feature Space -

x Setup – for example • Given a particular human identification technique • This technique measures 1 feature (q) from n individuals • Measure the feature again - 1D Feature Space -

x Setup – for example • Given a particular human identification technique • This technique measures 1 feature (q) from n individuals • Measure the feature again Probe Gallery - 1D Feature Space -

x Setup – for example • Given a particular human identification technique • This technique measures 1 feature (q) from n individuals • Measure the feature again Target Probe Gallery For template - 1D Feature Space -

x Setup – for example • Given a particular human identification technique • This technique measures 1 feature (q) from n individuals • Measure the feature again Target Imposters Probe Gallery For template - 1D Feature Space -

Target Imposters Probe x Gallery For template - 1D Feature Space - Question • For a given human identification technique, how should identification performance be evaluated?

Target Imposters Probe x Gallery For template - 1D Feature Space - Possible ways to evaluate performance • For a given classification threshold, compute • False accept rate (FAR) of impostors • Correct accept rate (HIT) of genuine targets

Possible ways to evaluate performance • For various classification thresholds, plot • MultipleFAR and HIT rates (ROC curve)

Possible ways to evaluate performance • For various classification thresholds, plot • MultipleFAR and HIT rates (ROC curve) • Compute area under a ROC curve (AUROC) Probability of correct classification

Probability of incorrect classification Possible ways to evaluate performance • For various classification thresholds, plot • MultipleFAR and HIT rates (ROC curve) • Compute 1 - area under a ROC curve (1 -AUROC)

Problem • Database size • If the database is not of sufficient size, then results may not estimate or predict performance on a larger population of people. 1 - AUROC

Our Goal • To estimate and predict identification performance with a small number subjects 1 - AUROC

Our Solution • Derive two parametric measures • Expected Confusion (EC) • Transformed Expected-Confusion (EC*)

Our Solution • Derive two parametric measures • Expected Confusion (EC) • Transformed Expected-Confusion (EC*) Probability that an imposter’s feature vector is within the measurement variation of a target’s template

Our Solution • Derive two parametric measures • Expected Confusion (EC) • Transformed Expected-Confusion (EC*) Probability that an imposter’s feature vector is closer to a target’s template, than the target’s feature vector

Our Solution • Derive two parametric measures • Expected Confusion (EC) • Transformed Expected-Confusion (EC*) EC* = 1 - AUROC

Target Imposters Probe x Gallery For template - 1D Feature Space - Expected Confusion • Probability that an imposter’s feature vector is within the measurement variation of a target’s template

Expected Confusion - Uniform • The templates of the n individuals, are from an uniform density • Pp(x) = 1/n P(x) Pp(x) 1/n x - 1D Feature Space -

Expected Confusion - Uniform • The measurement variation of a template is also uniform • Pi(x) = 1/m P(x) Pi(x) 1/m Pp(x) 1/n x - 1D Feature Space -

Expected Confusion - Uniform • The probability that an imposter’s feature vector is within the measurement variation of template q3 is the area of overlap • True if m << n P(x) Pi(x) 1/m Pp(x) 1/n x - 1D Feature Space -

Expected Confusion - Uniform • The probability that an imposter’s feature vector is within the measurement variation of any template q • True if m << n P(x) Pi(x) 1/m Pp(x) 1/n x

: Population density : Measurement variation Expected Confusion - Gaussian • Following the same analysis, for the multidimensional Gaussian case

Expected Confusion - Gaussian • Following the same analysis, for the multidimensional Gaussian case • True if the measurement variation is significantly less then the population variation Probability that an imposter’s feature vector is within the measurement variation of a target’s template

Expected Confusion - Gaussian • Relationship to other metrics • Mutual Information • The negative natural log of the EC is the mutual information of two Gaussian densities

Target Imposters Probe x Gallery For template - 1D Feature Space - Transformed Expected-Confusion • Probability that an imposter’s feature vector is closer to a target’s template, than the target’s feature vector

Target Imposters x k For template Transformed Expected-Confusion • First: We find the probability that a target’s feature vector is some distance k away from its template

Target Imposters x k For template Transformed Expected-Confusion • Second: We find the probability that an imposter’s feature vector is less than or equal to that distance k

x Transformed Expected-Confusion • Therefore: The probability that an imposter’s feature is closer to the target’s template, than the target’s feature (for a distance k) is Target Imposters k

x Transformed Expected-Confusion • Therefore: The probability that an imposter’s feature is closer to the target’s template, than the target’s feature (for any distance k) is Target Imposters k

Transformed Expected-Confusion • Therefore: The expected value of this probability over all target’s templates is x

Transformed Expected-Confusion • Next: Replace the density of the distance between a target’s feature-vectors and its template q

Transformed Expected-Confusion • Answer: Probability that an imposter’s feature vector is closer to a target’s template, than the target’s feature vector

Transformed Expected-Confusion • This probability can be shown to be one minus the area under a ROC curve • Following the analysis of Green and Swets (1966)

Transformed Expected-Confusion • Integrate: With these assumptions

Transformed Expected-Confusion • Integrate: Probability that an imposter’s feature vector is closer to a target’s template, than the target’s feature vector

Transformed Expected-Confusion • Compare: EC* with 1 - AUROC EC* = 1 - AUROC

Conclusion • Derive two parametric measures • Expected Confusion (EC) • Transformed Expected-Confusion (EC*) Probability that an imposter’s feature vector is closer to a target’s template, than the target’s feature vector

Conclusion • Derive two parametric measures • Expected Confusion (EC) • Transformed Expected-Confusion (EC*) Probability that an imposter’s feature vector is within the measurement variation of a target’s template Probability that an imposter’s feature vector is closer to a target’s template, than the target’s feature vector

Future Work • Developing a mathematical model of the cumulative match characteristic (CMC) curve • Benefit: To predict how the CMC curve changes as more subjects are added

Parametric measures to estimate and predict performance of identification techniques

Parametric measures to estimate and predict performance of identification techniques

Presentation Transcript

Performance Measures

Performance Measures

Performance Measures

Introduction to Performance Measures

Performance measures

Performance Measures

Performance measures

Non-parametric Measures of Association

Measures of Performance

Performance measures

Measures of Performance

PERFORMANCE MEASURES

PERFORMANCE MEASURES

Performance measures

Performance measures

Performance Measures

Performance Measures

Performance Measures

Performance Measures

Performance Measures

Performance Measures

Parametric Study of Turbofan Performance