Open Access Issue
Event Detection and Identification of Influential Spreaders in Social Media Data Streams
Big Data Mining and Analytics 2018, 1 (1): 34-46
Published: 25 January 2018

Microblogging, a popular social media service platform, has become a new information channel for users to receive and exchange the most up-to-date information on current events. Consequently, it is a crucial platform for detecting newly emerging events and for identifying influential spreaders who have the potential to actively disseminate knowledge about events through microblogs. However, traditional event detection models require human intervention to detect the number of topics to be explored, which significantly reduces the efficiency and accuracy of event detection. In addition, most existing methods focus only on event detection and are unable to identify either influential spreaders or key event-related posts, thus making it challenging to track momentous events in a timely manner. To address these problems, we propose a Hypertext-Induced Topic Search (HITS) based Topic-Decision method (TD-HITS), and a Latent Dirichlet Allocation (LDA) based Three-Step model (TS-LDA). TD-HITS can automatically detect the number of topics as well as identify associated key posts in a large number of posts. TS-LDA can identify influential spreaders of hot event topics based on both post and user information. The experimental results, using a Twitter dataset, demonstrate the effectiveness of our proposed methods for both detecting events and identifying influential spreaders.

Open Access Issue
A Novel Multilevel Index Model for Distributed Service Repositories
Tsinghua Science and Technology 2017, 22 (3): 273-281
Published: 04 May 2017

This study, based on the theory of equivalence relations, proposes a novel multilevel index model for decentralized service repositories to eliminate redundant information and enhance the time-management quality of the service retrieval process of the service repository architecture. An efficient resource discovery algorithm based on Discrete Hash Tables is presented to enable efficient and effective retrieval services among different distributed repositories. The performance of the proposed model and the supporting algorithms have been evaluated in a distributed environment. Experimental results validate the effectiveness of our proposed indexing model and search algorithm.

total 2