Bayesian Dynamic Modeling for Streaming Network Data
Streaming network data of various forms arises in many applications, raising interest in research to model and quantify the nature of stochasticity and structure in dynamics underlying such data. One example context is that of traffic flow count data in networks, such as in automobile or aviation transportation, certain directed social network contexts, and Internet studies. Using an example of Internet browser traffic flows through site-segments of an international news website, I present Bayesian analyses of two new, linked classes of models which, in tandem, allow fast, scalable and interpretable Bayesian inference on dynamic patterns over time underlying flows. I develop two kinds of flexible state-space models for streaming count data, able to adaptively characterize and quantify network dynamics efficiently in real-time. These models are then used as emulators of more structured, time-varying gravity models that allow formal dissection of network dynamics. This yields interpretable inferences on traffic flow characteristics, and on dynamics in interactions among network nodes. Bayesian monitoring theory defines a strategy for sequential model assessment and adaptation in cases when network flow data deviates from model-based predictions. Exploratory and sequential monitoring analyses of evolving traffic on a network of web site-segments in e-commerce demonstrate the utility of this coupled Bayesian emulation approach to analysis of streaming network count data.
A second, different dynamic network context is that involving relational data. Examples include contexts of binary network data indicating communications or relationships between pairs of network nodes over time. Some popular examples include friendships over social networks and communications between different functional zones in brain. Using an example of co-movements of company stock indices, I develop and compare two different approaches. One involves latent threshold models mapping latent processes to binary entries via a probabilistic link function, a second involves dynamic generalized linear models for binary outcomes. Analyses implement using Markov chain Monte Carlo methods are available for these models, but naturally computationally demanding and not scalable to relevant network dimensions for many contexts. In contrast, dynamic generalized linear models can implemented using fast, effective approximate Bayesian computations for both sequential and retrospective analyses to enable linear-time computations. I also demonstrate the use of a model decoupling/recoupling strategy to enable scaling in network size.
Bayesian model emulation
Decouple/Recouple
Dynamic gravity model
Dynamic network flow model
Local linear growth model
Monitoring and anomaly detection

This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States License.
Rights for Collection: Duke Dissertations
Works are deposited here by their authors, and represent their research and opinions, not that of Duke University. Some materials and descriptions may include offensive content. More info
Related items
Showing items related by title, author, creator, and subject.
-
Voter Models On Graphs
Huo, Ran (2019)The voter model which describes the flow of information through interactions between neighbors has been widely studied in the field of probability. In this paper we study two variations of the voter model, one is called ... -
A Survey of Users of Corporate Planning Models
Naylor, Thomas; Schauland, Horst (1976-05)Why are over 2,000 corporations either using, developing, or planning to develop some form of corporate simulation model? What types of companies are using corporate planning models? How are they being used? Which resources ... -
A Mean Field Approach to Watershed Hydrology
Bartlett Jr., Mark Stephan (2016)Society-induced changes to the environment are altering the effectiveness of existing management strategies for sustaining natural and agricultural ecosystem productivity. At the watershed scale, natural and agro-ecosystems ...