Component-specific usability testing

Component-specific usability testing Dr Willem-Paul Brinkman Lecturer Department of Information Systems and Computing Brunel University (willem.brinkman@brunel.ac.uk)

Topics • Introduction • Whether and how the usability of components can be tested empirically. • Testing different versions of component • Testing different components • Whether and how the usability of components can be affected by other components. • Consistency • Memory load

Introduction Component-Based Software Engineering Empirical Usability Testing

Layered communication

Editor Processor 38 15 + 23 15 + 15 38 15 + 23 15 + 15 Layered Protocol Theory(Taylor, 1988) Control results 01111 10111 Add 15 + 23 = 100110 38 Control equation 15 = 23 + User Calculator

Usability Testing Aim to evaluate the usability of a component based on the message exchange between a user and a specific component

Two paradigms • Multiple versions testing paradigm • Single version testing paradigm Manage Create Support Re-use

Test Procedure • Normal procedures of a usability test • User task which requires interaction with components under investigation • Users must complete the task successfully

Component Component-specific component measures Number of messages received The effort users put into the interaction Objective performance Perceived ease-of-use Perceived satisfaction Control process Control loop

y Component-specific component measures Increasing the statistical power Objective performance Perceived ease-of-use Perceived satisfaction y1 = xk+ k y2 = xm+ m k = k component + k rest m = m component + m rest Assumption k restm rest keys messages

Component-specific component measures Component-specific questionnaire increase the statistical power because they help help the users to remember their control experience with a particular interaction component Objective performance Perceived ease-of-use Perceived satisfaction

Component-specific component measures Perceived Usefulness and Ease-of-use questionnaire (David, 1989), 6 questions, e.g. • Learning to operate [name] would be easy for me. • I would find it easy to get [name] to do what I want it to do. Unlikely Likely Objective performance Perceived ease-of-use Perceived satisfaction

Component-specific component measures Post-Study System Usability Questionnaire (Lewis, 1995) • The interface of [name] was pleasant. • I like using the interface of [name]. Strongly Strongly disagree agree Objective performance Perceived ease-of-use Perceived satisfaction

Experimental validation 80 users 8 mobile telephones 3 components were manipulated according to Cognitive Complexity Theory (Kieras & Polson, 1985) • Function Selector • Keypad • Short Text Messages

Architecture Mobile telephone Send Text Message Function Selector Keypad

Experimental validation • Functions Selector • Broad/shallow • Narrow/deep

Repeated-Key Method “L” Modified-Model-Position method “J” Experimental validation Keypad

Experimental validation Simple Send Text Message Complex

Results Average probability that a measure finds a significant (α = 0.05) effect for the usability difference between the two versions of FS, STM, or the Keypad components

Results Wilcoxon Matched-Pairs Signed-Ranks Tests between the number of correct classification made by discriminant analyses on overall and component-specific measures

Two paradigms • Multiple versions testing paradigm • Single version testing paradigm Manage Create Support Re-use

Testing Different Components • Component specific objective performance measure: • Messages received + Weight factor A common currency • Compare with ideal user A common point of reference • Usability of individual components in a single device can be compared with each other and prioritized on potential improvements

Right Mouse Button Menu Properties Assigning weight factors to represent the user’s effort in the case of ideal user {7} Set <Fill colour red, no border> {2} Call <> {1} Click <left on Fill tab> {1} Click <left on on colour red> {1} Click <left on Outline tab> {1} Click <left No Line button> {1} Click <right> {1} Click <left no Ok button> {1} Click <left on Properties option>

Properties Right Mouse Button Menu Total effort value • Total effort =  MRi.W • MRi.W : Message received. Weight factor {2} Call <> {1} Click <left on Fill tab> {1} Click <left on on colour red> {1} Click <left on Outline tab> {1} Click <left No Line button> {1} Click <right> {1} Click <left no Ok button> {1} Click <left on Properties option> 5 + 2 = 7 2

Visual Drawing Objects Properties Right Mouse Button Menu Assigning weight factors in case of real user Correction for inefficiency of higher and lower components

Visual Drawing Objects Properties Right Mouse Button Menu Assigning weight factors in case of real user Inefficiency of lower level components: need more messages to pass on a message upwards than ideally required Assign weight factors as if lower components operate optimal

Visual Drawing Objects Properties Right Mouse Button Menu  MRi.W #MSU ideal UE = #MSU real Assigning weight factors in case of real user Inefficiency of higher level components: more messages are requested than ideally required • UE : User effort • MRi.W : Message received. Weight factor • #MSUreal :Number of messages sent upward by real user • #MSUideal :Number of messages sent upward by ideal user

The total effort an ideal user would make The total effort a real user made The extra effort a real user made Prioritize Ideal User versus Real User • Extra User Effort = User Effort - Total effort Calculate for each component:

Experimental validation 40 users 40 mobile telephones 2 components were manipulated (Keypad only Repeated-Key Method) • Function Selector • Short Text Messages

Results Extra User Effort Mobile phones

Results Partial correlation between extra user effort regarding the two components and other usability measures *p. < .05. **p. < .01.

Comparison with other evaluation methods Overall measures Sequential Data analysis GOMS Thinking-aloud, Cognitive Walkthrough and heuristic evaluation Example: Keystrokes, task duration, overall perceived usability Relatively easy to obtain Unsuitable to evaluate components

Comparison with other evaluation methods Overall measures Sequential Data analysis GOMS Thinking-aloud, Cognitive Walkthrough and heuristic evaluation Based only on lower-level events Pre-processing: selection, abstraction, and re-coding Relation between higher-level component and compound message less direct Components’ status not recorded

Comparison with other evaluation methods Overall measures Sequential Data analysis GOMS Thinking-aloud, Cognitive Walkthrough and heuristic evaluation Help to understand the problem Only looking at error-free task execution Considers the system only at the lowest-level layer

Comparison with other evaluation methods Overall measures Sequential Data analysis GOMS Thinking-aloud, Cognitive Walkthrough and heuristic evaluation Quicker Evaluator effect (reliability)

Consistency problems

Consistency Activation of the wrong mental model

Consistency experiments • 48 Users • Used 3 applications: • 4 Room Thermostats • 4 (2 Web-Enabled TV sets 2 Web Page Layouts) • 4 Applications (2 Timers  2 Application domains)

Within one layer

Within one layer – Experimental Design Day time Temperature Moving Pointer Moving Scale Moving Pointer Night time Temperature Moving Scale

Within on layer - Results

Between layers Web-enable TV set Browser versus Web pages

Between layers - Page Layout Matrix layout List layout

Between layers - Browser

Between layers – Experimental Design Web Page Version List Matrix Linear Browser Plane

Between layers - Results

Application domain

Between Application domain – Experimental Design Application Alarm radio Microwave Mechanical alarm Timer Hot dish

Component-specific usability testing

Component-specific usability testing

Presentation Transcript

Usability Testing

Usability Testing

Usability Testing

Usability Testing

Usability testing

Usability Testing

Usability testing

Usability Testing

Usability Testing

Component Testing

Usability testing

Usability Testing

Usability Testing

Usability Testing

Usability Testing

Usability Testing

Usability Testing

Usability Testing

Usability Testing

Usability Testing

Usability Testing

Usability testing