220 likes | 362 Views
A Polyhedral Approach to Cardinality Constrained Optimization. Ismael Regis de Farias Jr. and Ming Zhao University at Buffalo, SUNY. Summary. Problem definition Relation to previous work Simple bound inequalities Further research. Problem Definition.
E N D
A Polyhedral Approach to Cardinality Constrained Optimization Ismael Regis de Farias Jr. and Ming Zhao University at Buffalo, SUNY
Summary • Problem definition • Relation to previous work • Simple bound inequalities • Further research
Problem Definition • Given c1n , Amn ,bm1, and ln1, un1≥ 0,find xn1that: • maximizes • cx • subject to • Ax b, • −l≤x ≤u, • and at most k variables are nonzero
Motivation • Portfolio selection • Feature selection in data mining
Polyhedral approach Derive within a branch-and-cut scheme strong inequalities valid for: Pi= conv {x Rn : jN aij xjbi , −l≤x ≤u, and at most k variables are nonzero}, i{1, …, m}, to use as cutting planes in the branch-and-cut
Previous work • Bienstock (1996): critical set inequalities • de Farias and Nemhauser (2003): cover inequalities. However, the present case is more general and the polyhedral structure is much richer …
Example Let P = conv {x [−1, 1]6 : 6 x1 + 4 x2 + 3 x3 + 2 x4 + x5 + x6 • 6 and at most 3 variables are nonzero}. The following inequalities define facets of P: • 6 x1 + 4 x2 + 3 x3 + 2 x4 + x5 + x66 • 4 x2 + 3 x3 + 2 x4 + x5 + x66 • 4 x2 + 3 x3 + 2 x4 + x56 • 4 x2 + 3 x3 + 2 x4 + x66 • 4 x2 + 2 x4 + x5 + x66 • 4 x2 + 2 x4 + x66 • 4 x2 + 2 x4 + x56
To take advantage of previous work … first, we scale and translate the variables, i.e. P = conv {x [0, 1]n : jN aj xjb and xj • βj , j N, for at most k variables}, and second, we consider the pieces of P
The pieces are defined as follows … • Proposition Let W N, XW= {x Rn : xjβjj W and xj≤βjj N− W}, and PW= P∩XW . Then, PW = conv (S∩XW), where S = {x [0, 1]n : jN aj xjb and xj βj , j N, for at most k variables}.
For each piece … i.e. for a given W, we change the variables as: • yj← (xj –βj) / (1 – βj), j W • yj← (βj – xj) / βj , j N− W
Example P = conv {x [0, 1]2: 6 x1 + 4 x27, and x1 = ½ or x2= ½}. x2 1 ½ 6 x1 + 4 x2=7 x1 ½ 1
Example P = conv {x [0, 1]2: 6 x1 + 4 x27, and x1 = ½ or x2= ½}. x2 1 P{2} PN ½ P P{1} x1 ½ 1
Example P = conv {x [0, 1]2: 6 x1 + 4 x27, and x1 = ½ or x2= ½}. x2 1 −3 y1 + 2 y22 3 y1 + 2 y22 at most 1 nonzero at most 1 nonzero ½ −3 y1 − 2 y22 3 y1 − 2 y22 at most 1 nonzero at most 1 nonzero x1 ½ 1
When aj 0 and b > 0 … • Proposition The inequality jN xj k is facet-defining iff an−k+ …+ an−1 b and a1+ an−k+2+ …+ an b. • Proposition Whenan−k+ …+ an−1 b and a1+ an−k+2+ …+ an> b, the inequality a1x1+2≤j≤n−k−1max {aj ,Δ} xj +Δn−k≤j≤n xj ≤ k Δ defines a facet of P, where Δ = (b − n−k−2≤i≤n ai).
It then follows that … • Proposition The inequality: • jW (xj–βj)/(1 – βj)–jN−W (xj–βj)/βj k is valid W N, and it is facet-defining “under certain conditions”.
In the same way … • Proposition The inequality: • a1(x1–β1)/(1 – β1)+2≤j≤n−k−1, jWmax {aj ,Δ}(xj–βj)/(1 – βj)+2≤j≤n−k−1, jN−Wmax {aj,Δ} (xj–βj)/βj + Δn−k≤j≤n , jW(xj–βj)/(1 – βj) + Δn−k≤j≤n, jN−W(xj–βj)/βj ≤k Δ defines a facet of P “under certain conditions”.
Example P = conv {x [0, 1]2: 6 x1 + 4 x27, and x1 = ½ or x2= ½}. x2 1 −x1 + x21/2 x1 + x23/2 (y1 + y21 and −3 y1 + 2 y22) (y1 + y21 and 3 y1 + 2 y22) x1 − x21/2 x1 + x2≥1/2 (y1 + y21 and 3 y1 − 2 y22) (y1 + y21 and −3 y1 − 2 y22) x1 ½ 1
Critical sets and covers • By fixing, at 0 or 1, variables with positive or negative coefficients, we can obtain implied critical sets or cover inequalities that define facets in the projected polytope. • Then, by lifting the fixed variables, we obtain strong inequalities valid for P
Example Let P = conv {x [0,1]5: 6x1+4x2−3x3−2x4 +x56 and at most 2 variables are positive}. Fix x3 = 1 and x4 = 0. The inequality: • 6x1+4x2+ 3x59 defines a facet of P∩ {x [0,1]5:x3 = 1 and x4 = 0}.
Simple bound inequalities Let P = conv {x [0,1]4:6x1−4x2+3x3−x4 • 3 and at most 2 variables are positive}. Fix x3 = x4 = 0. Then, x11 defines a facet of P∩ {x [0,1]4: x3 = x4 = 0}. Lifting with respect to x4, we obtain x1+ αx4≤ 1, which gives α= ⅓. Lifting now with respect to x3, we obtain 3x1+αx3+x4≤ 3, which gives α= 2, and so 3x1+2x3+x4≤ 3.
Additional results • Two families of lifted cover inequalities • Two families of inequalities derived from simple bounds • Necessary and sufficient condition for “pieces of a facet” to be a facet
Further Research • Separation routines and computational testing • Inequalities derived from intersection of knapsacks • Special results for feature selection in data mining