Test Selection Practices in a Large IT Company

Test Selection Practices in a Large IT Company Vincent Blondeau

Agenda • Presentation of the industrial context • Identifying root causes of project failures • Improving testing habits of Worldline developers • Conclusion and future work

Industrial Context

Industrial Context: Worldline

Global Business Units Industrial Context: Worldline Merchant Services Financial Services Mobility & eTransactionalServices Benelux Germany & CEE France RegionalBusiness Units UK APAC LATAM & Iberia Transversal OperationalUnits Production Services Software DevelopmentCommunityOffice

How to Predict the Health of a Project?

Project Health: Definition Scope Resources Schedule

Systematic Literature Review • Structure defined by [Staff Keeke et al., 2007] • Population: Software projects • Intervention: Predict health (success or failure) • Comparison: Cross projects • Outcome: Model based on metrics • Context: Industrial What metrics could help predict the success or failure of a project?

Systematic Literature Review Keywords: Predict & Software & Project & Metric & (success | failure)

Systematic Literature Review • Individually, metrics to make predictions exist • No unique metric for all the projects • Predictions retrospectively

Data Mining • 19 projects • Average 1400 man-days • Between 900 to 2300 man-days • 91 excel files • 2012 to 2015 • Correlation of project metrics • Budget: 2 • Bugs: 8 • Slippage: 3 • Project Name (placebo metric) • From managers Excel files

Data Mining Correlation method: Spearman Bugs Budget Slippage 12

Data Mining Wewere not able to predictprojecthealth

Pilot Study • Delay at the start of the project • Lack of team cohesion • Lack of collaboration between team and client • Misunderstanding of requirements • Misunderstanding of application domain concepts • Lack of experience with the used frameworks • Change of the framework during the project • Bypass qualification tests [Blondeau et al., IWST’15] [Blondeau et al., SATToSE’15]

Questionnaire

Questionnaire “In order to improve project health, what tool could help you?” …

Summary • As far as we know: • Literaturedid not apply • Data miningdid not work • Interviews identifiedroot causes of failure • Not easy to measureit • Questionnaire confirmed the results Give up the projecthealthpredictionproblem

Thesis Can we improve the testing habits of Worldline developers? (by giving them faster feedback)

New Problem

Test Selection • Example

Test Selection Automatic Test Selection Select only the tests related to the changes and run only those ones 3 hours 5 minutes

Strategy Can we improve the testing habits of Worldline developers? (by giving them faster feedback) • Choose a test selection approach • Establish current testing behavior of developers • Create a test selection tool

Agenda • Presentation of the industrialcontext • Identifyingroot causes of projectfailures • ImprovingTesting Habits of Worldline developers • Choose a test selection approach • Establish current testing behavior of developers • Create a test selection tool • Conclusion and Future work

Choose a Test Selection Approach

Literature Review • Kazmiet al. “Effective regression test case selection: A systematic literature review”, ACM Comput. Surv., 2017 • Mining and Learning • Model Based Testing • Program Slicing • Oracle • Control Flow Graph

Two Approaches T T T T T T T Dynamic Static

Feasability Study • Which approach can we use at Worldline? • Dynamic more accurate • But should be kept up to date • Dynamic problem of failing tests • Static faster in our case • Feasability study on 3 sample projects

Feasability Study

Breaks • Issues to retrieve the tests from the changed code • Identified and classified as: • Dynamic breaks • Third Party breaks • Multi-program breaks • Polymorphism breaks [Blondeau et al., SoftwareQuality Journal, 2016] [Blondeau et al., IWST’16], [Blondeau et al., BENEVOL’15]

Dynamic breaks: Attribute Initialization

Third Party breaks: Anonymous Classes

Third Party breaks: Delayed Execution

Polymorphism breaks: Interfaces

Research Questions • What is the impact on test selection of: • Granularity (class, method)? • Third-party breaks issue resolution? • Dynamic breaks issue resolution? • Polymorphism breaks issue resolution? • Combined solutions?

Case Study • Ratio of the total test suite selected • Precision • How many selected tests are relevant? • Recall • How many relevant tests are selected?

Case Study • Compared several static approaches resolving the issues • Dynamic approach is the oracle • Considered each covered method as changed T M T M M T M T M

Tooling

Results

Results: Finer granularity? Number of Selected Tests Precision Recall

Results: Third-party break issue? Number of Selected Tests Precision Recall

Results: Polymorphism break issue? Number of Selected Tests Precision Recall

Results: Dynamic break issue? Number of Selected Tests Precision Recall

Results: Combined solutions? Number of Selected Tests Precision Recall

Results • Breaks • are intertwined • impact differentlyeachproject • have different solutions for a given break class • Good final result: 3% of tests selected

Results

Current Testing Behavior

When, How, and Why Developers TestBeller et al. (2015) • Open-source + industry + students • Monitor 416 software engineers Main conclusions • Majority of the developers rarely test in their IDE • Quick tests do not lead to more test executions • Some failing tests are fixed later: 50% of the test repairs happen within 10 minutes whereas 75% within 25 minutes

Comparison of Manual and Automated Test Selection – Gligoric et al. (2014) • How developers manually select tests • Compare it to an automatic selection • Academic experiment: 14 developers: 5 professionals + 9 students Main conclusions • Need for better automated test selection techniques • Better integration with developer IDEs

Field Study • Literature conclusions are not trivially transposable to our industrial case • Open source projects • Students • Need another study • Based on the replication of the 2 literature studies • In our context

Field Study • Research Questions: • Do developers testtheir code changes? • Do quick tests lead to more test executions? • Do developers practice test selection? • Does manual test selection depend on sizeof code changes? • How manual test selection compares to automatic test selection?

Test Selection Practices in a Large IT Company

Test Selection Practices in a Large IT Company

Presentation Transcript

Me Create a Selection Test

This is a Test It is ONLY a Test

IT Company in Jaipur

Innovative Test Practices

Test it in Service

Working With A Large Company

Best Practices in Loading Large Datasets

Best practices for technology selection

Insurance Company Selection

IT Company in Atlanta

IT company in India

IT Company in Australia

IT Company in Lucknow

IT Company in Dallas

IT Company in Oklahoma

Test Automation Best Practices

Large Format Printing Company in Singapore

IT Company in Madurai

EVOLUTION IN LARGE POPULATIONS: SELECTION

IT COMPANY IN KOCHI

IT Company In Dallas

IQ test: Should it be used in a selection process