190 likes | 400 Views
Ford Classification Challenge Results. Mahmoud Abou-Nasr. The Challenge. Data samples from an automotive subsystem were collected in batches of 500 samples per diagnostic session.
E N D
Ford Classification Challenge Results Mahmoud Abou-Nasr WCCI2008: Ford Classification Challenge
TheChallenge • Data samples from an automotive subsystem were collected in batches of 500 samples per diagnostic session. • The objective is a classifier that will determine whether a certain symptom exists or does not exist after examining the samples. • To this end, batches of 500 samples were collected when the symptom exists and batches of 500 samples were collected when the symptom does not exist. WCCI2008: Ford Classification Challenge
About the Data • The 500 samples collected in each diagnostic session represent a set of sequential values of the measured variable, where sample n+1 occurs after sample n. • The beginning of the sampling process is not aligned with any external circumstance or any aspect of the observed pattern. WCCI2008: Ford Classification Challenge
The Data Sets • Ford_A Data samples of known and hidden classification were collected in typical operating conditions, with minimal noise contamination. • Ford_B Data samples of known classification were collected in typical operating conditions, while datasamples of hidden classification were collected under noisy conditions. WCCI2008: Ford Classification Challenge
Example of a Symptom Free Pattern WCCI2008: Ford Classification Challenge
A Pattern that Exhibits the Symptom WCCI2008: Ford Classification Challenge
What was provided to the Contestants? WCCI2008: Ford Classification Challenge
Evaluation • The classification performance used in the evaluation is the accuracy of the classifier and in case of a tie the false positive rate of the classifier is also used. WCCI2008: Ford Classification Challenge
Definitions Accuracy = (a + d) / (a + b + c +d) (1) False positive rate = b / (a + b) (2) WCCI2008: Ford Classification Challenge
Ford_A Results WCCI2008: Ford Classification Challenge
Ford_A Results WCCI2008: Ford Classification Challenge
Ford_B Results WCCI2008: Ford Classification Challenge
Ford_B Results WCCI2008: Ford Classification Challenge
Results Both Data Sets: Ford_A and Ford_B WCCI2008: Ford Classification Challenge
Results Both Data Sets: Ford_A and Ford_B WCCI2008: Ford Classification Challenge
In Conclusion • This challenge problem was motivated by a potential automotive application. Abstractly, this problem amounts to classification of finite data sequences, in contrast to the more commonly encountered problem of classification based on feature vectors. • The length of the sequences reflects the time available for making the classification decision. Presumably, the task would be easier if the sequence length were increased, but this would violate the requirements of the application. • This problem does not appear to have a simple solution that emerges from visual inspection of the sequences. This distinguishes it from others in our experience where, at least in some range of operation, examples from opposite classes are readily differentiated. WCCI2008: Ford Classification Challenge