Kobi Inkumsah Tao Xie Dept. of Computer Science North Carolina State University,

Improving Structural Testing of Object-Oriented Programs via Integrating Evolutionary Testing and Symbolic Execution Kobi Inkumsah Tao Xie Dept. of Computer Science North Carolina State University, Raleigh, NC

Motivation • Generating OO unit tests involves two tasks: • Task 1: generating relevant method sequences • Task 2: generating relevant method arguments • Symbolic execution good at Task 2 but not Task 1 • bounded exhaustive sequence but low bound • Evolutionary testing good at Task 1 but not Task 2 • concrete values can be evolved but largely random • Contribution: Integrating the two techniques to address their respective weaknesses

Evolutionary Testing – Chromosome • Encode a method sequence with argument values as a chromosome: Method sequence Method arguments E.g., eToc [Tonella 04]

Evolutionary Testing - Algorithm • Start with random population of chromosomes • Evolve population towards optimal solution by recombination and mutation P ← generateRandomPopulation() while P is not optimal CS ← evaluateFitness(chromosomes[P]) P´ ← performCrossOverOnPairsOf(CS) P ← mutate(P´) end while E.g., eToc [Tonella 04]

Fitness Calculation eToc [Tonella 04] public void withdraw(double amount) {L1 if (amount > balance) {L2 printError();L3 return; }L4 if (numberOfWithdrawals >= 10)L5 if (amount > 500.00) {L6 printError();L7 return;}L8 dispence(amount);L9 balance -= amount;L10 numberOfWithdrawals++; } Target

Re-combination eToc [Tonella 04]

Re-combination

Mutation randomly mutated to 80.00

Evolutionary Testing - Summary + Method sequences are evolved guided by fitness - Method arguments are largely randomly picked

Dynamic Symbolic Execution amount: 20.0 Also called concolic testingEx. DART, jCUTE, Pex

Dynamic Symbolic Execution amount: 20.0 amount > 1.0 Also called concolic testingEx. DART, jCUTE, Pex

Dynamic Symbolic Execution amount: 20.0 Also called concolic testingEx. DART, jCUTE, Pex

Dynamic Symbolic Execution amount: 20.0 Negates amount > 1.0 new constraint: amount <= 1.0 new value: amount: 1.0 (1.00); Also called concolic testingEx. DART, jCUTE, Pex

Evolutionary Testing - Summary + Method sequences are evolved guided by fitness - Method arguments are largely randomly picked + Method arguments are solved - Method sequences are fixed (or bounded)

Evacon Not that good at argument generation Not that good at sequence generation

Transform primitive arguments of method sequences (produced by evolutionary testing) into symbolic arguments. Benefits: Allow a symbolic execution tester (e.g., jCUTE) to do concrete and symbolic execution Transform any JUnit method sequence into a symbolic test driver. Argument Transformation

Argument Transformation - Example Sym exe test generation

Evacon Not that good at argument generation Not that good at sequence generation

Chromosome Construction • Construct chromosomes out of method sequences generated using symbolic execution Evolutionary test generation

Evacon - Summary Not that good at argument generation Not that good at sequence generation

Evaluation • We compare Evacon’s achieved branch coverage with four publicly available test generation tools: • eToc [Tonella 04] alone • jCUTE [Sen&Agha 06] alone • JUnit Factory[Agitar Labs 07] • Randoop [Pacheco&Ernst 07]

Experimental Subjects

Branch Coverage • Evacon-A achieves the highest branch coverage for 10 out of 13 subjects • The best branch coverage can be achieved by tool combination for 8 out of 13 subjects

Required Length of Method Sequences The length of the longest method sequence generated by Evacon-A or Randoop that achieves new branch coverage - The required length reaches up to 23- Bounded exhaustive testing may not be feasible

Branch Ranking • What does branch coverage of two tools: 85% > 75% tell? • Tool with 75% may be better at covering those difficult-to-cover branches when used in tool combination • Need take into account difficulties of branches being covered (esp. using tools in combination) • Proposed metric: #branches categorized into: • Branch-1: covered by only 1 tool under comparison • … • Branch-n: covered by only n tools under comparison Covering more branches in Branch-1 means uniquely covering more branches not being covered by the other tools under comparison

Branch Ranking #Covered Branches/#Branches in category Branch-n Evacon-A is best in terms of uniquely covering branches in Branch-1 Using Evacon-A + JUnit Factory is the best choice if only two tools are to be used(not necessarily Evacon-A + Randoop!)

Conclusion • A new integration of evolutionary testing and symbolic execution to achieve two tasks of OO test generation • An empirical comparison of our integration with state-of-the-art representative testing tools • A detailed comparison of the strengths and weaknesses of tools w.r.t achieving high structural coverage (e.g., branch ranking)

Questions? THANK YOU!

Experiments • We applied two types of Evacon integrations: • Evacon-A and Evacon-B • We measure branch coverage achieved by the tests generated by all six tools within the same period of runtime, except for JUnit Factory • For remaining tools we use Evacon-A’s runtime as the common runtime

Coverage Subsumption • Branch coverage of Evacon-A subsumed: • Evacon-B (in 12 of 13 PsUT) • eToc (in 7 of 13 PsUT) • jCUTE (in 3 of 13 PsUT) • JUnit Factory (in 1 of 13 PsUT) • Randoop (in 4 of the 13 PsUT) • Branch coverage of Randoop subsumed: • Evacon-A (in 1 of 13 PsUT) • Overall, branch coverage of Evacon-A + branch coverage of JUnit Factory subsumed branch coverage achieved by all tools in 5 of 13 PsUT

Kobi Inkumsah Tao Xie Dept. of Computer Science North Carolina State University,

Kobi Inkumsah Tao Xie Dept. of Computer Science North Carolina State University,

Presentation Transcript

North Carolina State University College of Textiles

NORTH CAROLINA STATE UNIVERSITY

North Carolina State University

NORTH CAROLINA STATE UNIVERSITY

Xusheng Xiao 1 , Shi Han 2 , Dongmei Zhang 2 , Tao Xie 1,3 1 North Carolina State University

NORTH CAROLINA STATE UNIVERSITY

North Carolina State University

Tao Xie University of Illinois at Urbana-Champaign

Dept. of Computer Science, NC State University Raleigh, North Carolina, USA

Jyh-haw Yeh Dept. of Computer Science Boise State University

Dept. of Computer Science, NC State University Raleigh, North Carolina, USA

Tao Xie Assistant Professor North Carolina State University (PhD, Univ. of Washington, 2005)

Dept. of Computer Science, NC State University Raleigh, North Carolina, USA

Dept. of Computer Science, NC State University cbl.ncsu/OpenProjects/OmniFlow/

Sarah Heckman and Laurie Williams Department of Computer Science North Carolina State University

North Carolina State University Raleigh, North Carolina, USA

Tao Xie

Center for Embedded Systems Research/ Dept. of Computer Science North Carolina State University

Kiran Shakya Tao Xie North Carolina State University

North Carolina State University College of Textiles

North Carolina State University College of Textiles

Sarah Heckman and Laurie Williams Department of Computer Science North Carolina State University