Hello. Sign in to personalize your visit. New user? Register now.  
Journal of Computational and Graphical Statistics
Building an Effective Representation for Dynamic Networks

To cite this paper:
Shawndra Hill, Deepak K Agarwal, Robert Bell, Chris Volinsky. Journal of Computational and Graphical Statistics. September 1, 2006, 15(3): 584-608. doi:10.1198/106186006X139162.

Shawndra Hill,Deepak K. Agarwal,Robert Bell,Chris Volinsky

Shawndra Hill has accepted a position as Assistant Professor, Operations and Information Management Department, Wharton School of the University of Pennsylvania, 3730 Walnut Street, Suite 500, Philadelphia, PA 19104. Deepak K. Agarwal is Senior Technical Specialist, Robert Bell is Senior Technical Specialist, and Chris Volinsky is Director, Statistics Research Department, AT&T Labs–Research, 180 Park Avenue, Florham Park, NJ 07932.



A dynamic network is a special type of network composed of connected transactors which have repeated evolving interaction. Data on large dynamic networks such as telecommunications networks and the Internet are pervasive. However, representing dynamic networks in a manner that is conducive to efficient large-scale analysis is a challenge. In this article, we represent dynamic graphs using a data structure introduced in an earlier article. We advocate their representation because it accounts for the evolution of relationships between transactors through time, mitigates noise at the local transactor level, and allows for the removal of stale relationships. Our work improves on their heuristic arguments by formalizing the representation with three tunable parameters. In doing this, we develop a generic framework for evaluating and tuning any dynamic graph. We show that the storage saving approximations involved in the representation do not affect predictive performance, and typically improve it. We motivate our approach using a fraud detection example from the telecommunications industry, and demonstrate that we can outperform published results on the fraud detection task. In addition, we present a preliminary analysis on Web logs and e-mail networks.

All papers
Previous Next