Carnegie Mellon University
Browse

Representing Time: Towards Pragmatic Multivariate Time Series Modeling

Download (9.34 MB)
thesis
posted on 2024-07-22, 19:45 authored by Cristian Ignacio Challu

 Time series models are specialized in learning temporal dependencies among observations and interactions between multiple features in a data stream. During  the last decade, the unprecedented success of Deep Learning (DL) models in Computer Vision and Natural Language Processing has steadily permeated to time series tasks. From Recurrent Neural Networks to Transformers, new advancements in  architectural design improved capabilities and performance. Despite this success,  I identify several challenges to adopting current state-of-the-art (SoTA) methods,  including handling distribution shifts and missing data, computational complexity,  and interpretability.  

The success of DL models is usually attributed to their ability to discover helpful  data representations automatically. Multivariate time series models involve high dimensional objects with numerous time series and temporal observations. However,  they often exhibit strong temporal dependencies and inter-feature relations. In  this thesis, I propose to design DL architectures and algorithms for forecasting  and anomaly detection tasks that leverage these dependencies to induce efficient  learning of representations that satisfy desirable properties that can (i) improve the  models’ performance, (ii) improve robustness by favoring domain adaptation, and  (iii) reduce over-parameterization to improve scalability. The completed work is  organized in three parts, presenting seven novel model types and algorithms that  achieve state of the art performance in various tasks while addressing key adoption  challenges.  

In the first part, I explore the dynamic latent space principle and design latent  temporal representations to make robust anomaly detection and forecasting models.  In the second part, I present a novel scalable and interpretable model for multi-step  forecasting based on a novel non-linear frequency decomposition with connections  to Wavelet theory. It also features two extensions on using multivariate exogenous  covariates for high-impact domains, including energy and healthcare. Finally, in  the third part, I present a large-scale study on enabling conditions, on both model  design and data characteristics, for transferability of pre-trained models on time  series tasks 

History

Date

2024-04-02

Degree Type

  • Dissertation

Department

  • Machine Learning

Degree Name

  • Doctor of Philosophy (PhD)

Advisor(s)

Artur Dubrawski

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC