Identifying semantically meaningful sub-communities within twitter blogosphere

Dionisios N. Sotiropoulos, Chris D. Kounavis, George M. Giaglis

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    This paper addresses the problem of semantically meaningful group detection within a sub-community of twitter micro-bloggers by utilizing a topic modeling, multi-objective clustering approach. The proposed group detection method is anchored on the Latent Dirichlet Allocation (LDA) topic modeling technique, aiming at identifying clusters of twitter users that are optimal in terms of both spatial and topical compactness. Specifically, the group detection problem is formulated as a multi-objective optimization problem taking into consideration two complementary cluster formation directives. The first objective, related to spatial compactness, is achieved by minimizing the overall deviation from the corresponding cluster centers. The second, related to topical compactness, is achieved by minimizing the portion of probability mass assigned to low probability topics for the corresponding cluster centroids. In our approach, optimization is performed by employing a multi-objective genetic algorithm, which results in a variety of cluster structures that are significantly more interpretable than cluster assignments obtained with traditional single-objective clustering algorithms.

    Original languageEnglish
    Title of host publicationIISA 2013 - 4th International Conference on Information, Intelligence, Systems and Applications
    Pages275-280
    Number of pages6
    DOIs
    Publication statusPublished - 2013
    Event4th International Conference on Information, Intelligence, Systems and Applications, IISA 2013 - Piraeus-Athens, Greece
    Duration: 10 Jul 201312 Jul 2013

    Other

    Other4th International Conference on Information, Intelligence, Systems and Applications, IISA 2013
    Country/TerritoryGreece
    CityPiraeus-Athens
    Period10/07/1312/07/13

    Fingerprint

    Dive into the research topics of 'Identifying semantically meaningful sub-communities within twitter blogosphere'. Together they form a unique fingerprint.

    Cite this