Taylor & Francis Group
Browse
uasa_a_1778482_sm5880.pdf (129.65 kB)

Estimation of Subgraph Densities in Noisy Networks

Download (129.65 kB)
Version 2 2020-08-24, 09:28
Version 1 2020-06-09, 17:55
journal contribution
posted on 2020-08-24, 09:28 authored by Jinyuan Chang, Eric D. Kolaczyk, Qiwei Yao

While it is common practice in applied network analysis to report various standard network summary statistics, these numbers are rarely accompanied by uncertainty quantification. Yet any error inherent in the measurements underlying the construction of the network, or in the network construction procedure itself, necessarily must propagate to any summary statistics reported. Here we study the problem of estimating the density of an arbitrary subgraph, given a noisy version of some underlying network as data. Under a simple model of network error, we show that consistent estimation of such densities is impossible when the rates of error are unknown and only a single network is observed. Accordingly, we develop method-of-moment estimators of network subgraph densities and error rates for the case where a minimal number of network replicates are available. These estimators are shown to be asymptotically normal as the number of vertices increases to infinity. We also provide confidence intervals for quantifying the uncertainty in these estimates based on the asymptotic normality. To construct the confidence intervals, a new and nonstandard bootstrap method is proposed to compute asymptotic variances, which is infeasible otherwise. We illustrate the proposed methods in the context of gene coexpression networks. Supplementary materials for this article are available online.

Funding

Chang was supported in part by the Fundamental Research Funds for the Central Universities of China, the National Natural Science Foundation of China (grant nos. 11871401 and 71991472), the funds of Fok Ying-Tong Education Foundation for Young Teachers in the Higher Education Institutions of China, and the Center of Statistical Research and the Joint Lab of Data Science and Business Intelligence at SWUFE. Kolaczyk was supported in part by the US Air Force Office of Scientific Research.

History

Usage metrics

    Journal of the American Statistical Association

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC