280 likes | 416 Views
Brazilian Symposium on Neural Networks (SBRN’2012), October 20-25 2012, Curitiba, Parana, Brazil. PARTICLE SWARM OPTIMIZATION APPLIED TO THE DINAMYC ALLOCATION PROBLEM. Jean L. Pierobom Myriam R. Delgado, PhD Celso A. A. Kaestner, PhD. Introduction.
E N D
Brazilian Symposium on Neural Networks (SBRN’2012), October 20-25 2012, Curitiba, Parana, Brazil PARTICLE SWARM OPTIMIZATION APPLIED TO THE DINAMYC ALLOCATION PROBLEM Jean L. Pierobom Myriam R. Delgado, PhD Celso A. A. Kaestner, PhD
Introduction • This paper proposes a modification of the Particle Swarm Optimization (PSO) algorithm to solve a real-world application of the Dynamic Task Assignment Problem (DTAP). • In a previous work, non-dynamic instances of Task Assignment Problem (TAP) were solved using the Particle Swarm Optimization with Permutations (PSO-P). • In this paper, the algorithm is improved by adding some methods that enable it to deal with dynamic fitness landscapes. The new approach is called Dynamic Particle Swarm Optimization with Permutations (DPSO-P). 2
Problem description • The problem considered in this paper is the Cab-Customer Allocation Problem (CCAP). • CCAP can be categorized as a Dynamic TAP. • CREIO QUE UMA DEFINIÇÃO RÁPIDA DO TAP SERIA INTERESSANTE • The problem consists in allocating N cabs (service offer agents) to M customers (demand service agents) in a way that the total distance traveled by the cabs to get the customers is minimal. 3
Problem formulation • Let A be the allocation function that maps a set V of service offer agents (taxis) to a set P of demand service agents (customers): where A(i) = j if the offer agent i is allocated to demand agent j. 5
Problem formulation • Let C(A) be the cost function of a solution A: where distance(i; j) is the geographic distance between two points in the city; in this case, the distance between the agents i and j, and j = A(i): • In the previous work, we use the Euclidean distance between geographical locations. In this paper we employ the minimal paths over the Open Street Map using the Dijkstra algorithm. • The problem is to find the optimal solution with minimal cost in the set , i.e.: 6
Real scenario changes (1 of 4) • The decision variables of CCAP/TAP are subject to changes during the optimization. (a) a new customer comes (the customer is “waiting”); 7
Real scenario changes (2 of 4) (b) the cab positions change because they are moving to the allocated customer; 8
Real scenario changes (3 of 4) (c) when the cab arrives to the customer position a pair (cab, customer) is created, and these elements must be eliminated from the optimization scenario (the cab is “occupied”); 9
Real scenario changes (4 of 4) (d) when the pair (cab, customer) arrives to its destination the service ends, and a new service offer agent appears (the cab becomes “free”). 10
The proposed approach • In this paper, the allocation function A for the Dynamic TAP considers |V| = |P| = N. • Two FIFO queues are then used; one for service offer agents (O) and other for service demand agents (D). This approach minimizes the agent waiting time as it avoids undesirable conditions. • As the offer agents can meet only one demand agent at a time, and a demand agent must be served by one (and only one) offer agent, N is calculated after each detected change, as follows: where and are the total of elements at episode in queues and , respectively. 11
DPSO-P principles Particle encoding • The DPSO-P encodes particles as position permutations. • The particle’s position is represented by a vector of integer numbers whose indexes identify the service offer agents and whose values represent the demand agents. • In the example below, cab 1 would be allocated to customer 4, cab 2 would be allocated to customer 3, and so on. 12
DPSO-P principles Particle velocity • The particle velocity is normalized into [0, 1], with each element indicating the probability of a swap operation. The particle movement is illustrated below: 13
DPSO-P principles Inertia adjustment • In the DPSO-P algorithm, the inertia suffers a decay given by: where T is the average number of iterations between detected changes. At the first step of simulation, T is fixed according to the problem size, because there are no changes yet; after this, T is adjusted at each occurred change. The inertia decay thus occurs along time windows. 14
Change detection • A random monitored particle solution is created to support the change detection; • The method used to detect changes is that suggested in Carlisle and Dozier (2000), it chooses a random particle and checks if its fitness has changed even if the particle position has not changed. • Experiments have shown that this method is able to detect any change to the addressed problem, including those considered smooth. 15
Response to changes • The proposed algorithm tries to detect changes and apply the approaches A and B to react to them. Additionally, the approach C is applied to maintain diversity in the swarm. • Approach A: restart to random positions a portion of the swarm, whenever a change is detected (Eberhart and Shi, 2000) (Hu and Eberhart, 2002). The particles chosen to be reset are those with the worst fitness value; • Approach B: reset pbest memory of all particles in the swarm, whenever a change is detected(Eberhart and Shi, 2000); • Approach C: uses a perturbation operator on particles position (Esquivel and Coello, 2004). At each iteration there is a chance of applying randomly swap operations. 16
Experiments Test set • Experiments are conducted to simulate the CCAP/TAP dynamics; • The simulation process considers a series of episode transitions; • Each instance is generated with changes being produced from initial to final episodes. 17
Experiments Running information • The program was coded in Java; • Employs 100 particles; • Run up along transitions of all successor episodes for each instance considered; • The inertia adjustment from 0.9 to 0.4(Eberhart and Shi, 2000); • The values of c1 = c2 = 1.49445 (Eberhart and Shi, 2000); • AQUI ESTÁ CONFUSO ESTE c1, c2 com C1, C2 e C3 18
Experiments Comparison approach – robust PSO • For small instances (N = 10), the optimal solution for each episode is identified by the exhaustive search described in [11]. • For other cases (N = 100), used a robust PSO to identify a reference solution for each tested instance. • In this work, the robust PSO runs over 10,000 iterations and has swarm composed of 10,000 particles, allowing a better exploitation of the search space of the problem. 19
Experiments Performance measures • Fitness error (Blackwell, 2007): • Average accuracy (Weicker, 2002): • Average stability (Weicker, 2002): 20
Obtained results • The algorithm was executed only once for each tested approach; • However, the algorithm runs for a considerable number of episodes and the performance indicators represent averages of values obtained at the end of each time window. 22
Conclusions • The proposed DPSO-P algorithm used permutations to encode particles and adopted some approaches to respond to changes, namely: • (A) randomizes a portion of the swarm whenever any change is detected; • (B) resets all particles memory; • (C) applies a perturbation operator with a certain probability. • These approaches were also combined, to improve the results. 25
Conclusions • The experiments show that, in the case of smaller instances, diversifying a portion (10%) of the swarm can be efficient. • In the case of larger instances, this approach is also efficient, but it can be improved using a perturbation operator on particle positions, to maintain diversity in the swarm; • The obtained results allow us to conclude that the combination of traditional methods enables the PSO to track the optimal solution after the occurrence of changes in decision variables of the problem; • The information about the problem size and change scale can be used to determine the appropriate approach to be used. 26
Future works • The approach can be extended to consider |V| ≠ |P| (different number of cabs and customer, which is now treated with simple queues). This extension can be treated internally by the model - using non square matrices - or adding a filter based on queues priority. • Other similar applications can be handled in the same way, for example: • optimization times of busses and subways; • allocation of police vehicles for the care of occurrences; • allocation of technical assistance for technical support. 27
Brazilian Symposium on Neural Networks (SBRN’2012), October 20 to 25, 2012, Curitiba, Parana, Brazil PARTICLE SWARM OPTIMIZATION APPLIED TO THE DINAMYC ALLOCATION PROBLEM Jean L. Pierobom Prof.ª Dra. Myriam R. Delgado Prof. Dr. Celso A. A. Kaestner