Output

This section documents Sleipnir's diagnostic output when the diagnostics option is set to true. We'll show the diagnostics for the following problem:

       max xy
       x,y
subject to x + 3y = 36

from sleipnir.optimization import Problem
 
problem = Problem()
 
x, y = problem.decision_variable(2)
 
problem.maximize(x * y)
problem.subject_to(x + 3 * y == 36)
 
problem.solve(diagnostics=True)

Exit conditions

First, Sleipnir prints the user-configured exit conditions.

User-configured exit conditions:
  ↳ error below 1e-08
  ↳ iteration callback requested stop
  ↳ executed 5000 iterations

The user-configurable exit conditions include the error tolerance, maximum iterations, and timeout passed to the solve() call; and iteration callbacks added to the Problem returning true.

Problem size and structure

Then, Sleipnir prints the problem's size and structure.

Problem structure:
  ↳ quadratic cost function
  ↳ linear equality constraints
  ↳ no inequality constraints
 
2 decision variables
1 equality constraint
  ↳ 1 linear
0 inequality constraints

Then, Sleipnir prints the solver selected based on that information:

Invoking SQP solver

Available solvers include:

No-op for trivial problems
Newton for unconstrained problems
Sequential Quadratic Programming (SQP) for equality-constrained problems
Interior-point method (IPM) for inequality-constrained problems

Setup time trace

Then, Sleipnir prints a time trace of its autodiff setup.

┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃     time trace           percentage     duration ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│setup                 100.00%▕█████████▏     0.011│
│↳ ∇f(x)                 0.00%▕         ▏     0.000│
│↳ ∇²ₓₓL                81.82%▕███████▎ ▏     0.009│
│  ↳ ∇²ₓₓL_f            36.36%▕███▎     ▏     0.004│
│  ↳ ∇²ₓₓL_c            36.36%▕███▎     ▏     0.004│
│↳ ∂cₑ/∂x                9.09%▕▊        ▏     0.001│
└──────────────────────────────────────────────────┘

The headings are defined as follows:

Heading	Description
time trace	Tree of setup steps
percentage	Percentage of setup time
duration	Duration of setup step in milliseconds

Iterations

After the solver takes each step, it prints a row of iteration diagnostics in a table format.

┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃iter   duration    error       cost      infeas.   complem.    μ       δ     γ    |p_pr|   |p_du|    α_pr     α_du   ↩ ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│   0       0.016 1.7998e-03 -1.0800e+02 6.0167e-10 0.00e+00 0.00e+00 10⁻⁴  10⁻¹⁰ 1.80e+01 6.00e+00 1.00e+00 1.00e+00  0│
│   1       0.003 1.1997e-07 -1.0800e+02 9.9476e-14 0.00e+00 0.00e+00 10⁻⁴  10⁻¹⁰ 2.40e-03 1.00e-03 1.00e+00 1.00e+00  0│
│   2       0.002 4.9987e-12 -1.0800e+02 0.0000e+00 0.00e+00 0.00e+00 10⁻⁴  10⁻¹⁰ 2.00e-07 5.33e-08 1.00e+00 1.00e+00  0│
└───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────┘

The headings are defined as follows:

Heading	Description
iter	Iteration number with one of the suffixes below denoting the iteration type = normal `s` = second-order correction `r` = feasibility restoration
duration	Duration of iteration in milliseconds
error	Infinity norm of scaled KKT condition errors
cost	Cost function value at current iterate
infeas.	Constraint infeasibility at current iterate
complem.	Complementary slackness at current iterate (sᵀz)
μ	Barrier parameter
δ	Iteration matrix Hessian regularization factor (δ in lhs + [δI, 0; 0, -γI])
γ	Iteration matrix constraint Jacobian regularization factor (γ in lhs + [δI, 0; 0, -γI])
\|p_pr\|	Infinity norm of full primal step
\|p_du\|	Infinity norm of full dual step
α_pr	Primal step size α_pr ∈ [0, 1] that scales down the full primal step
α_du	Dual step size α_du ∈ [0, 1] that scales down the full dual step
↩	Number of line search backtracks

Solver time trace

At the end of the solve, the solver prints a time trace of itself.

┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃     time trace           percentage       total      each    runs┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│solver                100.00%▕█████████▏      0.055     0.055    1│
│↳ setup                 9.09%▕▊        ▏      0.005     0.005    1│
│↳ iteration            38.18%▕███▍     ▏      0.021     0.007    3│
│  ↳ feasibility check   0.00%▕         ▏      0.000     0.000    3│
│  ↳ callbacks           0.00%▕         ▏      0.000     0.000    3│
│  ↳ KKT matrix build    3.64%▕▎        ▏      0.002     0.000    3│
│  ↳ KKT matrix decomp   7.27%▕▋        ▏      0.004     0.001    3│
│  ↳ KKT system solve    1.82%▕▏        ▏      0.001     0.000    3│
│  ↳ line search        14.55%▕█▎       ▏      0.008     0.002    3│
│    ↳ SOC               0.00%▕         ▏      0.000     0.000    0│
│  ↳ feas. restoration   0.00%▕         ▏      0.000     0.000    0│
│  ↳ f(x)                0.00%▕         ▏      0.000     0.000    4│
│  ↳ ∇f(x)               5.45%▕▍        ▏      0.003     0.000    4│
│  ↳ ∇²ₓₓL               1.82%▕▏        ▏      0.001     0.000    4│
│  ↳ ∇²ₓₓL_c             0.00%▕         ▏      0.000     0.000    0│
│  ↳ cₑ(x)               1.82%▕▏        ▏      0.001     0.000    4│
│  ↳ ∂cₑ/∂x              0.00%▕         ▏      0.000     0.000    4│
└──────────────────────────────────────────────────────────────────┘

The headings are defined as follows:

Heading	Description
time trace	Tree of solver steps
percentage	Percentage of solve time
total	Total time across all runs in milliseconds
each	Average time for each run in milliseconds
runs	Number of runs

The function evaluations are defined as follows:

Function	Description
f(x)	Cost function value
∇f(x)	Cost function gradient
∇²ₓₓL	Lagrangian Hessian
∇²ₓₓL_f	Cost part of Lagrangian Hessian
∇²ₓₓL_c	Constraint part of Lagrangian Hessian
cₑ(x)	Equality constraint value
∂cₑ/∂x	Equality constraint Jacobian
cᵢ(x)	Inequality constraint value
∂cᵢ/∂x	Inequality constraint Jacobian

Exit status

Finally, the solver prints its exit status.

Exit: success

Possible exit statuses include:

Status	Value	Description
SUCCESS	0	Solved the problem to the desired tolerance.
CALLBACK_REQUESTED_STOP	1	The solver returned its solution so far after the user requested a stop.
TOO_FEW_DOFS	-1	The solver determined the problem to be overconstrained and gave up.
LOCALLY_INFEASIBLE	-2	The solver determined the problem to be locally infeasible and gave up.
GLOBALLY_INFEASIBLE	-3	The problem setup frontend determined the problem to have an empty feasible region.
FACTORIZATION_FAILED	-4	The linear system factorization failed.
FEASIBILITY_RESTORATION_FAILED	-5	The solver failed to reach the desired tolerance, and feasibility restoration failed to converge.
NONFINITE_INITIAL_GUESS	-6	The solver encountered nonfinite initial cost, constraints, or derivatives and gave up.
DIVERGING_ITERATES	-7	The solver encountered diverging primal iterates xₖ and/or sₖ and gave up.
MAX_ITERATIONS_EXCEEDED	-8	The solver returned its solution so far after exceeding the maximum number of iterations.
TIMEOUT	-9	The solver returned its solution so far after exceeding the maximum elapsed wall clock time.

Negative values indicate errors.

Problem formulation tips

Optimizing the problem formulation

Cost functions and constraints can have the following orders:

none (i.e., there is no cost function or are no constraints)
constant
linear
quadratic
nonlinear

For nonlinear problems, the solver calculates the Hessian of the cost function and the Jacobians of the constraints at each iteration. However, problems with lower order cost functions and constraints can be solved faster. For example, the following only need to be computed once because they're constant:

the Hessian of a quadratic or lower cost function
the Jacobian of linear or lower constraints

A problem is constant if:

the cost function is constant or lower
the equality constraints are constant or lower
the inequality constraints are constant or lower

A problem is a linear program (LP) if:

the cost function is linear
the equality constraints are linear or lower
the inequality constraints are linear or lower

A problem is a quadratic program (QP) if:

the cost function is quadratic
the equality constraints are linear or lower
the inequality constraints are linear or lower

All other problems are nonlinear programs (NLPs).

Suppressing constant pruning

Sleipnir aggressively simplifies constant expressions in the expression graph. This includes:

multiplication by 0 or 1
division by 1
addition or subtraction by 0
math functions evaluating constants

If the user intends to reuse the problem formulation with different constants, the expression pruning can be suppressed in two ways.

Make an empty (linear) Variable, then set its value. This can negatively impact autodiff caching.
a = Variable()

a.set_value(0)
Create a constant Variable with a value that won't be pruned (expression-dependent).
a = Variable(2)

Avoiding numerical issues

Instead of using distance (2-norm) for the cost function, use sum-of-squares. The distance calculation's square root is nonlinear and has a limited domain, whereas sum-of-squares has the same minimum, is quadratic, and has no domain restriction. In other words, use minimize(x ** 2 + y ** 2 + z ** 2) instead of minimize(hypot(x, y, z)).

Deduplicating autodiff work

Store common subexpressions in intermediate variables and reuse them instead of writing out the subexpressions each time. This ensures common subexpressions in the expression tree are only traversed and updated once.

Minimum-time problems

The obvious problem formulation for minimum-time problems uses one dt shared across all timesteps.

import sleipnir as slp
 
N = 100
T_max = 5.0
 
problem = slp.optimization.Problem()
 
x = problem.decision_variable(N + 1)
v = problem.decision_variable(N)
 
dt = problem.decision_variable()
dt.set_value(T_max / N)
problem.subject_to(dt > 0)
problem.subject_to(dt < T_max / N)
 
for k in range(N):
    problem.subject_to(x[k + 1] == x[k] + v[k] * dt)
 
problem.minimize(dt)
 
problem.solve()

The nonzero initial value for dt avoids a degenerate case, and the upper bound prevents the solver exploiting discretization artifacts.

This formulation can have feasibility issues though per section 15.3 "Elimination of variables" of "Numerical Optimization, 2nd Ed.". Instead, we recommend using a separate dt for each timestep, with them all equality-constrained.

import sleipnir as slp
 
N = 100
T_max = 5.0
 
problem = slp.optimization.Problem()
 
x = problem.decision_variable(N + 1)
v = problem.decision_variable(N)
 
dt = problem.decision_variable(N)
problem.subject_to(dt > 0)
problem.subject_to(dt < T_max / N)
for k in range(N - 1):
    problem.subject_to(dt[k] == dt[k + 1])
 
for k in range(N):
    dt[k].set_value(T_max / N)
    problem.subject_to(x[k + 1] == x[k] + v[k] * dt[k])
 
problem.minimize(sum(dt))
 
problem.solve()