370 likes | 425 Views
Genetic Process Mining. Wil van der Aalst Ana Karla Medeiros Ton Weijters Eindhoven University of Technology D epartment of Information Systems a.k.medeiros@tm.tue.nl. Outline. Process Mining Genetic Algorithms Genetic Process Mining Internal Representation Fitness measure
E N D
Genetic Process Mining Wil van der Aalst Ana Karla Medeiros Ton Weijters Eindhoven University of Technology Department of Information Systems a.k.medeiros@tm.tue.nl
Outline • Process Mining • Genetic Algorithms • Genetic Process Mining • Internal Representation • Fitness measure • Genetic Operators • Experiments and Results • Conclusion and Future Work
Outline • Process Mining • Genetic Algorithms • Genetic Process Mining • Internal Representation • Fitness measure • Genetic Operators • Experiments and Results • Conclusion and Future Work
Process Mining X = apply for license A = classes motobike B = classes car C = theoretical exam C = theoretical exam D = practical motorbike exam E = practical car exam Y = get result
Process Mining (cont.) • Most of the current techniques cannot handle • Structural constructs: non-free choice, duplicate tasks and invisible tasks • Noisy logs • Reason: local approach
Outline • Process Mining • Genetic Algorithms • Genetic Process Mining • Internal Representation • Fitness measure • Genetic Operators • Experiments and Results • Conclusion and Future Work
local optimum global optimum Genetic Algorithms • Global approach
Outline • Process Mining • Genetic Algorithms • Genetic Process Mining • Internal Representation • Fitness measure • Genetic Operators • Experiments and Results • Conclusion and Future Work
Genetic Process Mining (GPM) Aim: Use genetic algorithm to tackle noise, duplicate activities, non-free choice and invisible tasks Internal Representation Fitness Measure Genetic Operators
GPM – Internal Representation • Causal Matrix
GPM – Internal Representation • Causal Matrix
GPM – Internal Representation • Causal Matrix
GPM – Internal Representation • Causal Matrix
GPM – Internal Representation • Causal Matrix • Compact representation
GPM – Internal Representation • Causal Matrix • Semantics Invisible tasks only fire to enable visible tasks!
Deadlock! GPM – Internal Representation • Causal Matrix • Semantics Invisible tasks only fire to enable visible tasks!
GPM – Internal Representation • Causal Matrix • Mappings
GPM – Internal Representation • Causal Matrix • Mappings
GPM – Fitness Measure • Main idea • Benefit the individuals that can parse more frequent material in the log • Challenges • How to assess an individual’s fitness? • How to punish individuals that allow for undesired extra behavior?
L = log and CM = causal matrix Fitness - How to assess an individual’s fitness? - Use continuous semantics parser and register problems
Trace: SS,A,B,C,D,EE For noise-free, fitness punishes: OR-split AND-split AND-join OR-join
Trace: SS,A,B,C,D,EE For noise-free, fitness punishes: OR-join AND-join AND-split OR-split
Fitness - How to punish individuals that allow for undesired extra behavior? Fitness = 1
Fitness - How to punish individuals that allow for undesired extra behavior? • - Count the amount of enabled tasks at every reachable marking
Fitness Measure L = log and CM = causal matrix and CM[] = population • where
Genetic Operators • Crossover • Recombines existing material in the population • Crossover probability • Crossover point = task • Subsets are swapped • Mutation • Introduce new material in the population • Mutation probability • Every task of a individual can be mutated
Outline • Process Mining • Genetic Algorithms • Genetic Process Mining • Internal Representation • Fitness measure • Genetic Operators • Experiments and Results • Conclusion and Future Work
Experiments and Results • Experiments • ProM framework • Genetic Algorithm Plug-in • http://www.processmining.org • Simulated data • Results • The genetic algorihm found models that could parse all the traces in the log
Outline • Process Mining • Genetic Algorithms • Genetic Process Mining • Internal Representation • Fitness measure • Genetic Operators • Experiments and Results • Conclusion and Future Work
Conclusion and Future Work • Conclusion • Genetic algorithms can be used to mine process models • Future Work • Tackle duplicate tasks • Apply the genetic process mining to "real-life" logs