160 likes | 376 Views
OPERATIONS RESEARCH AND HIGH ENERGY PHYSICS. Alberto De Min Politecnico di Milano and INFN. INTRODUCTION. Operations Research (OR) is the discipline of applying advanced analytical methods to help make better decisions
E N D
OPERATIONS RESEARCH AND HIGH ENERGY PHYSICS Alberto De Min Politecnico di Milano and INFN CHEP06 Mumbai, India 13-17 February 2006
INTRODUCTION • Operations Research (OR) is the discipline of applying advanced analytical methods to help make better decisions • Applications concern an uncountable number of fields: from economics to finance, from logistics to supply chain management, from social sciences to politics, from engineering to physics • The aim of this talk is to illustrate possible applications to high energy physics and review the existingmathematicaltools, restricting the scope to linear and mixed-integer programming applications CHEP06 Mumbai, India 13-17 February 2006
LAYOUT • Basics of operations research Two practical examples: Resource optimization, Set covering Linear programming: • Simplex method • Interior-point method Mixed-integer programming • Applications to high energy physics Two pattern recognition examples: • Vertex tracking • Muon tracking Other detector issues (calibration, alignment, design, logistics) • Mathematical libraries Commercial software packages Open source packages • Conclusions CHEP06 Mumbai, India 13-17 February 2006
EXAMPLE 1 (resource optimisation) • A firm produces tables and chairs in 3 phases: cutting, assemblying and finishing. The firm wants to maximise total profit. Solution: x = 35.0 chairs and y = 30.0 tables should be produced per day CHEP06 Mumbai, India 13-17 February 2006
C A B B C A A B B A A C C A EXAMPLE 2 (set covering) • An airline needs to cover its flights with crew (based in city A). • Build all possible flight sequences (rotations) leaving and returning to A • Choose the subset of rotations (j) covering all flights (i) with minimum cost (Cj) • Solution is a set of integer (binary) values for xj • xj = 0.5 cannot be approximated to 0 (no covering) or 1 (no optimum) CHEP06 Mumbai, India 13-17 February 2006
LINEAR PROGRAMMING • Solution is a point in n-dim space • Linear functions are (hyper)planes in that space • Linear constraints () define the half-space above (below) a plane • The space of feasible solutions is all points inside the convex polyedrum defined by the constraint-planes (a «politope » or « simplex ») • The objective function is an infinity of parallel planes of which we need the uppermost intersecting the politope • The optimal solution is the point of such intersection CHEP06 Mumbai, India 13-17 February 2006
THE SIMPLEX METHOD • The optimal solution cannot be an internal point (as it would always be possible to find a « more optimal » point nearby!) • The optimal solution needs to be on a corner (or an edge if degenerate) • Only the (few) corners need to be analysed to reach optimality • The objective function determines the direction to a better corner • A complex minimization task has been reduced to an easy combinatorial problem • The discovery of the simplex method in 1947 gave Dantzig the Nobel Price • This is an « external point method » CHEP06 Mumbai, India 13-17 February 2006
INTERIOR POINT METHODS • Take the politope of feasible solutions • Build an ellipsoid contained in the politope • Find the tangence point A with the objective plane • Build a new ellipsoid centered in A contained in the politope • Repeat until size of ellipsoid is small enough • Easy when matrix of coefficients is sparse (many zeros!) • It works also for non-linear cases • For some problems faster than the simplex • Developed in the last few decades • This is an « internal point method » Objective function A B CHEP06 Mumbai, India 13-17 February 2006
Xi= 1.3 Xi= 1 Xi= 2 (MIXED) INTEGER PROGRAMMING • Suppose some variables should be integer but non-integer values are found at optimum • Store corresponding value of objective function (OF) as a lower bound (LB) on the solution • Take one such variables, say Xi=1.3, and branch into two separate problems (nodes or trees) imposing the additional constraints Xi= 1 or Xi=2 • Solve the corresponding LP problem for each node: • If the solution is infeasible or OF is larger than UB -> prune the node • If in the solution all required variables are integer -> : • If no upper bound (UB) exists on the solution -> store the corresponding OF as an initial upper bound (UB) • If the OF is smaller than an already existing UB -> update the upper bound • If the OF is larger than an already existing UB -> prune the node • If in the solution some required variables are still non-integer -> branch further the problem • Continue until all required variables are integer. This method is called «branch & bound », alternative methods are: « branch & cut », « branch & price », « implicit enumeration », etc. CHEP06 Mumbai, India 13-17 February 2006
TPC 1 2 VDET 3 4 HEP APPLICATIONS: EXAMPLE 1 • Global VDET pattern recognition for ALEPH, P.Rensing, CHEP95 • Extrapolating tracks from TPC to the silicon vertex detector (VDET), featuring two double-sided lawyers (4 readout layers), presents ambiguities in hit assignment process. How to assign VDET hits to tracks optimally? • Minimize z = ijklm Cijklm xijklm, with xijklm=0,1 • subject to: (1 configuration per track) (1 or 2 tracks per hit, depending on hit signal amplitude) Cijklm = 2 + 11NNon associated + 4N1 dimension (i=0 means no assigned track; j,k,l,m=0 means no hit in VDET layer 1,2,3,4) CHEP06 Mumbai, India 13-17 February 2006
HEP APPLICATIONS: EXAMPLE 2 • Linear pattern recognition algorithm for the CMS barrel muon detector, • A. De Min, CMS Note, Feb 2006, being submitted to CMS • In CMS tracks are straight lines in a noisy environment. • Linear programming can be used to remove outliers: • Pi =(Xi,Zi) should be associated to straight line x=mz+q if |Xi-mZi+q| i (i is the « resolution » of Pi) • This is not a linear constraint! But it can be linearized: • Best (linear) fit is min i (i / i ) subject to: • provided that only points with i i are considered (pattern recognition) • Try combining pattern recognition and fit in a single problem by adding the new integer (binary) variables i = {0,1} and new constraints CHEP06 Mumbai, India 13-17 February 2006
HEP APPLICATIONS: EXAMPLE 2 • The problem can be formulated as follows (M is a big, fixed number): (i=0) (i i) hit Piin the fit (i=1) (i i) hit Pi out of the fit The solution will provide a fast, unbiased pattern recognition and a stable (linear) fit in one go! It also provides the list of included points (i=0) and their residuals i CHEP06 Mumbai, India 13-17 February 2006
OTHER HEP APPLICATIONS • Apart from pattern recognition problems, LP may be useful in several other applications related to HEP. A few examples: • Detector design (ex. to optimize cabling network in order to minimize the material in front of the calorimeters) • Detector construction (ex. optimal assembly schedule, optimal use of resources, supply chain management and spare parts procurement) • Detector alignment (ex. tracker subparts alignment with charged tracks) • Detector calibration (ex. optimal shower clustering in test-beams for calorimeter) • Data analysis (ex. optimal association of calorimeter showers to charged tracks) • Logistics in HEP laboratories (ex. shift roster of minimum cost, optimal test-beam period allocation) CHEP06 Mumbai, India 13-17 February 2006
AVAILABLE LIBRARIES • Two kinds of packages: • Modeling systems: • Designed to help people formulate LP problems and analyze their solutions. • In practice they are modeling languages. • Output is in standard format to feed any LP solver. • Not of interest here, more dedicated to consultants who need to quickly apply LP techniques to solve very different problems. • An old but still widely used standard format is the MPS format (originally from IBM). Essentially a matrixdescription convention. • LP solvers: • Algorithmic codes devoted to finding optimal solutions to specific linear problems. • A code takes as input a the LP numerical coefficients in standard format and produces as output a compact listing of optimal solution values. • LP solvers are available as commercial packages and as open source CHEP06 Mumbai, India 13-17 February 2006
AVAILABLE LIBRARIES • Most used commercial packages: • ILOG/CPLEX:Simplex,Interior,Integer (www.ilog.com) • Xpress-MP: Simplex,Interior,Integer (www.dashoptimization.com) • IBM/OSL: Simplex,Interior,Integer (www.optimize.com) • In general rather sophisticated and expensive, but special agreements possible for research use. Some versions are parallelizable for GRID platforms. • Best open source packages: • COIN-OR (COmputational INfrastructure for Operations Research): Simplex,Integer.Originally from IBM and now available under Common Public Licence (www.coin-or.org) • GLPK (GNU Linear Programming Kit): Simplex,Interior,Integer. Available under the GNU General Public Licence (www.gnu.org/software/glpk/glpk.html) • Lp_solve: Simplex,Integer. Available, under the GNU Lesser Public Licence (www.coin-or.org) • Not as performing as commercial codes, but free and available in source code. See licence for permitted uses. • For additional information see also: • http://www-unix.mcs.anl.gov/otc/Guide/faq/linear-programming-faq.html CHEP06 Mumbai, India 13-17 February 2006
CONCLUSIONS • A wide class of problems encountered in HEP can be formulated similarly to problems related to other fields of human activities, normally covered by Operations Research • It is advisable that the HEP community is kept informed of the latest OR achievements for what concerns both mathematical algoritms and software packages and has quick access to all available tools • Limited to (mixed-integer) linear programming, this talk has provided a brief overview of the existing tools and of their possible applications to HEP CHEP06 Mumbai, India 13-17 February 2006