Scaled Gradient Methods for Ill-conditioned Low-rank Matrix and Tensor Estimation

Tong, Tian

doi:10.1184/R1/21514656.v1

Tong_cmu_0041E_10781.pdf (3.89 MB)

Scaled Gradient Methods for Ill-conditioned Low-rank Matrix and Tensor Estimation

thesis

posted on 2022-11-16, 22:50 authored by Tian TongTian Tong

Many problems encountered in machine learning and signal processing can be formulated as estimating a low-rank object from incomplete, and possibly corrupted, linear measurements; prominent examples include matrix completion and tensor completion. Through the lens of matrix and tensor factorization, one of the most popular approaches is to employ simple iterative algorithms such as gradient descent to recover the low-rank factors directly, which allow for small memory and computation footprints. However, the convergence rate of gradient descent depends linearly, and sometimes even quadratically, on the condition number of the low-rank object, and therefore, slows down painstakingly when the problem is ill-conditioned. This thesis introduces a new algorithm: scaled gradient descent (ScaledGD), which provably converges linearly at a constant rate independent of the condition number of the low-rank object, while maintaining the low per-iteration cost of gradient descent. In addition, a nonsmooth variant of ScaledGD provides further robustness to corruptions by optimizing the least absolute deviation loss. In total, ScaledGD highlights the power of appropriate preconditioning in accelerating nonconvex statistical estimation, where the iterationvarying preconditioners promote desirable invariance properties of the trajectory with respect to the symmetry in low-rank factorization.

History

Date

2022-02-18

Degree Type

Dissertation

Department

Electrical and Computer Engineering

Degree Name

Doctor of Philosophy (PhD)

Advisor(s)

Yuejie Chi

Usage metrics

Keywords

machine learning signal processing scaled gradient descent Low-rank Matrix

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Scaled Gradient Methods for Ill-conditioned Low-rank Matrix and Tensor Estimation

History

Date

Degree Type

Department

Degree Name

Advisor(s)

Usage metrics

Categories

Keywords

Licence

Exports