TY - GEN
T1 - A dynamic nonparametric model for characterizing the topical communities in social streams
AU - Liu, Ziqi
AU - Zheng, Qinghua
AU - Wang, Fei
AU - Tian, Zhenhua
AU - Li, Bo
N1 - Publisher Copyright:
Copyright © SIAM.
PY - 2014
Y1 - 2014
N2 - Latent variable models have proven to be a useful tool for discovering latent structures from observational data. However, the data in social networks often come as streams, i.e., both text content (e.g., emails, user postings) and network structure (e.g., user friendship) evolve over time. To capture the time-evolving latent structures in such social streams, we propose a fully nonparametric Dynamic Topical Community Model (nDTCM), where infinite latent community variables coupled with infinite latent topic variables in each epoch, and the temporal dependencies between variables across epochs are modeled via the rich-gets-richer scheme. We focus on characterizing three dynamic aspects in social streams: the number of communities or topics changes (e.g., new communities or topics are born and old ones die out); the popularity of communities or topics evolves; the semantics such as community topic distribution, community participant distribution and topic word distribution drift. Furthermore, we develop an effective online posterior inference algorithm for nDTCM, which is concordant with the online nature of social streams. Experiments using real-world data show the effectiveness of our model at discovering the dynamic topical communities in social streams.
AB - Latent variable models have proven to be a useful tool for discovering latent structures from observational data. However, the data in social networks often come as streams, i.e., both text content (e.g., emails, user postings) and network structure (e.g., user friendship) evolve over time. To capture the time-evolving latent structures in such social streams, we propose a fully nonparametric Dynamic Topical Community Model (nDTCM), where infinite latent community variables coupled with infinite latent topic variables in each epoch, and the temporal dependencies between variables across epochs are modeled via the rich-gets-richer scheme. We focus on characterizing three dynamic aspects in social streams: the number of communities or topics changes (e.g., new communities or topics are born and old ones die out); the popularity of communities or topics evolves; the semantics such as community topic distribution, community participant distribution and topic word distribution drift. Furthermore, we develop an effective online posterior inference algorithm for nDTCM, which is concordant with the online nature of social streams. Experiments using real-world data show the effectiveness of our model at discovering the dynamic topical communities in social streams.
KW - Bayesian nonparametric models
KW - Social streams
KW - Topical Community
UR - https://www.scopus.com/pages/publications/84959887789
U2 - 10.1137/1.9781611973440.44
DO - 10.1137/1.9781611973440.44
M3 - 会议稿件
AN - SCOPUS:84959887789
T3 - SIAM International Conference on Data Mining 2014, SDM 2014
SP - 379
EP - 387
BT - SIAM International Conference on Data Mining 2014, SDM 2014
A2 - Zaki, Mohammed J.
A2 - Banerjee, Arindam
A2 - Parthasarathy, Srinivasan
A2 - Ning-Tan, Pang
A2 - Obradovic, Zoran
A2 - Kamath, Chandrika
PB - Society for Industrial and Applied Mathematics Publications
T2 - 14th SIAM International Conference on Data Mining, SDM 2014
Y2 - 24 April 2014 through 26 April 2014
ER -