Researchers across a wide array of disciplines are interested in finding the most influential subjects in a network. In a network setting, intervention effects and health outcomes can spill over from one node to another through network ties, and influential subjects are expected to have a greater impact than others. For this reason, network research in public health has attempted to maximize health and behavioral changes by intervening on a subset of influential subjects. Although influence is often defined only implicitly in most of the literature, the operative notion of influence is inherently causal in many cases: influential subjects are those we should intervene on to achieve the greatest overall effect across the entire network. In this work, we define a causal notion of influence using potential outcomes. We review existing influence measures, such as node centrality, that largely rely on the particular features of the network structure and/or on certain diffusion models that predict the pattern of information or diseases spreads through network ties. We provide simulation studies to demonstrate when popular centrality measures can agree with our causal measure of influence. As an illustrative example, we apply several popular centrality measures to the HIV risk network in the Transmission Reduction Intervention Project and demonstrate the assumptions under which each centrality can represent the causal influence of each participant in the study.
Finding influential subjects in a network using a causal framework
Biometrics, 79 (4), 3715-3727. doi: 10.1111/biom.13841. PMCID: PMC10423748.