PSDP Problem

Definitions

\[ PSDP: \min_{P\in S^n_{\geq}} ||F - PG|| \]

\[ SP: \min_{S\in S^n} ||F - SG|| \]

Here, \(F, G \in R^{\ n\times m}\).

Background

\(\tilde{S}\) denotes a minimizer for the SP problem.
\(\tilde{P}\) denotes a minimizer for the PSDP problem.
\(\tilde{S}GG'+ GG'\tilde{S} = FG' + GF' = Q\)
When \(rank(G) = n\), a sufficient condition for \(\tilde{S}\) to be in \(S^n_{\geq}\) is that \(Q \in S^n_{\geq}\).
A sufficient condition for \(\tilde{S}\) to be in \(S^n_{\geq}\) is that \(FG' \in S^m_{\geq}\).
\(rank(G) = n\) is also necessary and sufficient for uniqueness of minimizers in case of PSDP.

Previous Approches

The objective function and the feasible set in case of PSDP are both convex.
We can use convex optimization which directly exploits the convexity of \(S^n_{\geq}\).
Solutions involve finite convergence towards global minimas bringing the approximate minimizer arbitrarily close to the unique global minimizer.

Approach of this paper

\[ PSDP^*: \min_{E \in R^{n\times n}} ||F - E'EG|| \]

\[ \tilde{P}_{\text{global}} = \tilde{E}_{\text{local}}'\tilde{E}_{\text{local}} \]

Algorithm ideas:

Steepest descent (doesn’t converge, proven)
Newton’s algorithm (modified, superior than known convex programs of that time)
Where do we obtain the data? Take stuff, perturb by errors to make sure that \(\tilde{S} \neq \tilde{P}\).

Newton’s method:

\[ x_{k + 1} = x_k - \frac{f'(x_k)}{f''(x_k)} \]

Has quadratic convergence.
Can find complex zeroes.
If you have \(n\) variables, then \(n^2\) operations is needed to find \(\nabla^2 f\) (Hessian) and \(n^3\) operations to find the inverse of the Hessian.

Quasi Newtown method:

\[ GD:\ \ \ \ x_{k + 1} = x_k - \eta \cdot I\cdot \nabla f(x_k) \]

\[ NM:\ \ \ \ x_{k + 1} = x_k - \eta \cdot \frac{1}{\nabla^{2}f(x_k)}\cdot \nabla f(x_k) \]

\[ QNM:\ \ \ \ x_{k + 1} = x_k - \eta \cdot H_k\cdot \nabla f(x_k) \]

A good choice for \(H\) can be the following. This is the BFGS method and \(H_k\) holds curvature information. Key idea is to match curvature along trajectory.

\[ H = \begin{bmatrix} 1& 1\\ \end{bmatrix} \]