Carnegie Mellon University
Browse
kim_sds_2018.pdf (5.09 MB)

Local Structure and Inference in Large Random Graphs

Download (5.09 MB)
thesis
posted on 2020-12-18, 18:51 authored by Nicolas KimNicolas Kim

Modern social networks now contain on the order of 100 million to 1 billion active profiles. The graph structure of these networks contains an unprecedented amount of information about the social connectivity of the world; however, contemporary algorithms used to estimate random graph models from data are computationally intractable at this scale. To address this, we turn to the graphon random graph model. Graphons are suited to modeling data with a large number of nodes, and where exchangeability of the nodes can be assumed.

Various estimation procedures for graphons exist in the literature. However, accurate estimation requires either an unbiased sample or some way to account for potential bias in the sampling scheme. We present a framework for working with small edge-induced samples of graphons. To account for the sampling bias, we first establish theory that describes the sampling distribution of these subgraphs. Building on this, we propose a procedure for undoing the sampling bias. This allows for estimation of whole-network structure from only an ego network. An interesting consequence of these results is that the mutual friend counts found on most social media platforms contain a great deal of information about the local community structure. Finally, we characterize the asymptotic behavior of a different network model, the Edge-Degeneracy exponential random graph model.

History

Date

2018-05-03

Degree Type

  • Dissertation

Department

  • Statistics

Degree Name

  • Doctor of Philosophy (PhD)

Advisor(s)

Alessandro Rinaldo

Usage metrics

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC