Algorithms for Learning Latent Models : Establishing Tractability to Approaching Optimality

Bakshi, Ainesh

doi:10.1184/R1/21834105.v1

CMU-CS-22-146.pdf (2.83 MB)

Algorithms for Learning Latent Models : Establishing Tractability to Approaching Optimality

thesis

posted on 2023-01-09, 19:21 authored by Ainesh BakshiAinesh Bakshi

Modern machine learning relies on algorithms that fit expressive latent models to large datasets. While such tasks are easy in low dimensions, real-world datasets are truly high-dimensional, often leading to computational intractability. Additionally, a prerequisite to deploying models in real-world systems is to ensure that their behavior degrades gracefully when the modeling assumptions no longer hold. Therefore, there is a growing need for efficient algorithms that fit reliable and robust models to data and are accompanied with provable guarantees.

In this thesis, we focus on designing such efficient and robust algorithms for learning latent variable models. In particular, we investigate two complementary regimes arising in learning latent models: establishing computational tractability and approaching computational optimality. The first regime considers learning highdimensional latent models where no efficient algorithms were known. We resolve several central open questions in this regime, by providing the first polynomial time algorithms for robustly learning a mixture of Gaussians, robust linear regression and learning two-layer neural networks. The second regime considers models where polynomial time algorithms were already well-established. Here, we show that we can obtain algorithms with information-theoretically minimal running time and sample complexity. In particular, we show that for several low-rank models there is no statistical vs. computational trade-off.

History

Date

2022-08-27

Degree Type

Dissertation

Department

Computer Science

Degree Name

Doctor of Philosophy (PhD)

Advisor(s)

Pravesh Kothari, David Woodruff

Usage metrics

Keywords

Latent Models Robust Statistics Sum-of-Squares Numerical Linear Algebra

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Algorithms for Learning Latent Models : Establishing Tractability to Approaching Optimality

History

Date

Degree Type

Department

Degree Name

Advisor(s)

Usage metrics

Categories

Keywords

Licence

Exports