Here $f_0: \mathbb{R}^n \mapsto \mathbb{R}$ is the objective function and $f_i:\mathbb{R}^n \mapsto \mathbb{R}$, $i=1,\ldots,m$, are the constraint functions.
An equality constraint $f_i(\mathbf{x}) = b_i$ can be absorbed into inequality constraints $f_i(\mathbf{x}) \le b_i$ and $- f_i(\mathbf{x}) \le - b_i$.
If the objective and constraint functions are convex, then it is called a convex optimization problem.
In a convex optimization problem, only linear equality constraint of form $\mathbf{A} \mathbf{x} = \mathbf{b}$ is allowed (why?).
Mosek
, Gurobi
, Cplex
, cvx
, Convex.jl
, ...), just like numerical linear algebra libraries BLAS and LAPACK. Current technology can solve convex problems with up to thousands of variables and constraints.Like numerical linear algebra, getting familiar with good optimization softwares broadens the scope and scale of problems we are able to solve in statistics.
Following table lists some of the best convex optimization softwares. Use of modeling tools cvx
(for Matlab) or Convex.jl
(for Julia), coupled with solvers Mosek
and/or Gurobi
, is highly recommended.
LP | MILP | SOCP | MISOCP | SDP | GP | NLP | MINLP | R | Matlab | Julia | Python | Cost | ||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
modeling tools | ||||||||||||||||
MathProgBase.jl | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | O | ||||||||
Convex.jl | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | O | |||||||||
cvx | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | A | |||||||
convex solvers | ||||||||||||||||
Mosek | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | A | ||||
Gurobi | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | A | |||||||
CPLEX | ✔️ | ✔️ | ✔️ | ✔️ | ? | ✔️ | ✔️ | ✔️ | A | |||||||
SCS | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | O | |||||||||
SeDuMi | ✔️ | ✔️ | ✔️ | ? | ✔️ | O | ||||||||||
SDPT3 | ✔️ | ✔️ | ✔️ | ? | ✔️ | O | ||||||||||
NLP solvers | ||||||||||||||||
KNITRO | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | $ | |||||||
NLopt | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | O | ||||||||||
Ipopt | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ | O |
O: open source
A: free academic license
$: commercial
Difference between modeling tool and solvers.
Solvers (Mosek
, Gurobi
, Cplex
, ...) are concrete software implementation of optimization algorithms.
Modeling tools such as cvx
(for Matlab) and Convex.jl
(Julia analog of cvx) implement the disciplined convex programming (DCP) paradigm proposed by Grant and Boyd (2008) http://stanford.edu/~boyd/papers/disc_cvx_prog.html. DCP prescribes a set of simple rules from which users can construct convex optimization problems easily.
Modeling tools usually have the capability to use a variety of solvers. But modeling tools are solver agnostic so users do not have to worry about specific solver interface.
http://hua-zhou.github.io/teaching/biostatm280-2016winter/cs.html
http://hua-zhou.github.io/teaching/st790-2015spr/ST790-2015-HW4.pdf
Nonnegative least squares (NNLS), Lasso regression, Elastic net, generalized lasso, support vector machine (svm), Huber loss regression.
Group lasso, square root lasso, image denoising.
http://hua-zhou.github.io/teaching/biostatm280-2016winter/lasso.html
http://hua-zhou.github.io/teaching/st790-2015spr/ST790-2015-HW5.pdf
http://hua-zhou.github.io/teaching/st790-2015spr/ST790-2015-HW6.pdf