Comparable Evidence: subjects, objects and context?

Comparable Evidence:subjects, objects and context? Claes WohlinBlekinge Institute of TechnologySweden

Session outline • Introduction: Comparable Evidence across Experiments, Claes Wohlin • Readers Project: Quality Aspects in Empirical Studies, José Maldonado • Witch Doctors and Sugar Pills, Barbara Kitchenham Presentation specific discussions after the two latter presentations and then a general discussion at the end.

Evidence? Individual studies produce: • Evidence Admissible evidence • Circumstantial • evidence • Can we make a case? • Sufficient evidence? • Trustworthy evidence? Screening What makes evidence admissible?

Why this session? Concerns when answering the two questions! • What makes evidence admissible? • Can we make a case? Experimentation in software engineering: We are trying to use experiments as a means for change, for example as part of technology transfer. This means that evidence from experimental studies must be comparable, but not too similar.

Subjects • What is the sample? • What is the population? • Are the subjects sufficiently motivated? • Are the subjects capable of performing the task as would be expected by the intended population? • What does it for example mean to assign roles in PBR in comparison to actually using subjects having the roles?

Objects • How do we measure that objects are of the same difficulty for the subjects? • Do any of the treatments require more skills and experience than the other? • Are the treatments (methods/techniques/tools) described to the same level of detail? • Is the education into the treatments fair (on the same level)?

Context • Does the context of the objects resemble the true use? • What does it mean to run experiments off-line? • How do we create a sufficiently realistic context?

Research • How do we ensure that all of the above is similar enough when replicating experiments? • How much replication? By whom? • Are lab packages a good idea? Too similar experiments? • How do we ensure that different studies do not intentionally or unintentionally favor one of the treatments over the other? Especially inventor evaluate her/his method against something else.

Presentations • José Maldonado • Barbara Kitchenham We will hopefully have some time to return to the questions above at the end of the session.

Comparable Evidence: subjects, objects and context?