Hua-Ping Zhang, Rui-Qi Zhang, Yan-Ping Zhao and Bao-Jun Ma. Big Data Modeling and Analysis of Microblog Ecosystem. International Journal of Automation and Computing, vol. 11, no. 2, pp. 119-127, 2014. https://doi.org/10.1007/s11633-014-0774-9
Citation: Hua-Ping Zhang, Rui-Qi Zhang, Yan-Ping Zhao and Bao-Jun Ma. Big Data Modeling and Analysis of Microblog Ecosystem. International Journal of Automation and Computing, vol. 11, no. 2, pp. 119-127, 2014. https://doi.org/10.1007/s11633-014-0774-9

Big Data Modeling and Analysis of Microblog Ecosystem

doi: 10.1007/s11633-014-0774-9
Funds:

This work was partly supported by National Natural Science Foundation of China (No.61272362), National Basic Research Program of China (973 Program) (No.2013CB329606), and High-Tech Development Plan of Xinjiang (No.201212124).

  • Received Date: 2013-07-31
  • Rev Recd Date: 2013-11-21
  • Publish Date: 2014-04-01
  • Recent progress of Web 2.0 applications has witnessed the rapid development of microblog in China, which has already been one of the most important ways for online communications, especially on sharing information. This paper tries to make an in-depth investigation on the big data modeling and analysis of microblog ecosystem in China by using a real dataset containing over 17 million records of SinaWeibo users. First, we present the detailed geography, gender, authentication, education and age analysis of microblog users in this dataset. Then we conduct the numerical features distribution analysis, propose the user influence formula and calculate the influences for different kinds of microblog users. Finally, user content intention analysis is performed to reveal users' most concerns in their daily life.

     

  • loading
  • [1]
    Q. Gao, F. Abel, G. J. Houben, Y. Yu. A comparative study of users' microblogging behavior on Sina Weibo and Twitter. User Modeling, Adaptation, and Personalization. Springer, Berlin Heidelberg, vol.7379, pp.88-101, 2012.
    [2]
    P. Bao, H. W. Shen, J. Huang, X. Q. Cheng. Popularity prediction in microblogging network: A case study on Sina Weibo. In Proceedings of the 22nd International Conference on World Wide Web, Republic and Canton of Geneva, Switzerland, pp.177-178, 2013.
    [3]
    Z. B. Guo, Z. T. Li, H. Tu, L. Li. Characterizing user behavior in Weibo. In Proceedings of the 3rd FTRA International Conference on Mobile, Ubiquitous, and Intelligent Computing (MUSIC), IEEE, Vancouver, BC, USA, pp.60-65, 2012.
    [4]
    A. Java, X. D. Song, T. Finin, B. Tseng. Why we twitter: Understanding microblogging usage and communities. In Proceedings of the 9th WebKDD and the 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, University of Maryland, San Jose, CA, USA, pp.56-65, 2007.
    [5]
    A. Mislove, S. Lehmann, Y. Ahn, J. Onnela, J. NielsRosenquist. Understanding the demographics of twitter users. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media, AAAI, Menlo Park, pp.554-557, 2009.
    [6]
    H. Kwak, C. Lee, H. Park, S. Moon. What is Twitter, a social network or a news media? In Proceedings of the 19th International Conference on World Wide Web, ACM, New York, USA, pp.591-600, 2010.
    [7]
    Q. Yan, L. Wu, L. Zheng. Social network based microblog user behavior analysis. Physica A: Statistical Mechanics and Its Applications, vol.392, no.7, pp.1712-1723, 2013.
    [8]
    A. L. Barabási. The origin of bursts and heavy tails in human dynamics. Nature, vol.435, no.7039, pp.207-211, 2005.
    [9]
    O. Tsur, A. Rappoport. What's in a hashtag? Content based prediction of the spread of ideas in microblogging communities. In Proceedings of the 5th ACM International Conference on Web Search and Data Mining (WSDM'12), ACM, Seattle, USA, pp.643-652, 2012.
    [10]
    J. A. Mangai, V. S. Kumar, S. A. A. Balamurugan. A novel feature selection framework for automatic web page classification. International Journal of Automation and Computing, vol.9, no.4, pp.442-448, 2012.
    [11]
    S. A. A. Balamurugan, R. Rajaram. Effective and efficient feature selection for large-scale data using Bayes' theorem. International Journal of Automation and Computing, vol.6, no.1, pp.62-71, 2009.
    [12]
    E. Bakshy, J. Hofman, W. Mason, D. Watts. Everyone's an influencer: Quantifying influence on twitter. In Proceedings of the 4th ACM International Conference on Web Search and Data Mining, ACM, Hong Kong, China, pp.65-74, 2011.
    [13]
    M. Cha, H. Haddadi, F. Benevenuto, P. K. Gummadi. Measuring user influence in Twitter: The million follower fallacy. In Proceedings of the 4th International AAAI Conference on Weblogs and Social Media (ICWSM'10), AAAI, Washington, USA, pp.10-17, 2010.
    [14]
    D. Quercia, L. Capra, J. Crowcroft. The social world of Twitter: Topics, geography, and emotions. In Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (ICWSM'12), AAAI, Dublin, Ireland, pp.298-305, 2012.
    [15]
    D. M. Romero, W. Galuba, S. Asur, B. A. Huberman. Influence and passivity in social media. In Proceedings of the 20th International Conference Companion on World Wide Web (WWW'11), ACM, Hyderabad, India, pp.113-114, 2011.
    [16]
    Z. Yang, J. Y. Guo, K. K. Cai, J. Tang, J. Z. Li, L. Zhang, Z. Su. Understanding retweeting behaviors in social networks. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, ACM, New York, USA, pp.1633-1636, 2010.
    [17]
    R. Lee, S. Wakamiya, K. Sumiya. Discovery of unusual regional social activities using geo-tagged microblogs. World Wide Web, vol.14, no.4, pp.321-349, 2011.
    [18]
    J. Lingad, S. Karimi, J. Yin. Location extraction from disaster-related microblogs. In Proceedings of the 22nd International Conference on World Wide Web Companion, Republic and Canton of Geneva, Switzerland, pp.1017-1020, 2013.
    [19]
    H. Kwak, H. Chun, S. Moon Fragile. Online relationship: A first look at unfollow dynamics in Twitter. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp.1091-1100, 2011.
    [20]
    R. Wang, Y. S. Jin. An empirical study on the relationship between the followers' number and influence of microblogging. In Proceedings of the 2010 International Conference on E-Business and E-Government (ICEE), IEEE, Guangzhou, China, pp.2014-2017, 2010.
    [21]
    W. L. Chen, S. Y. Cheng, X. He, F. Jiang. Influence rank: An efficient social influence measurement for millions of users in microblog. In Proceedings of the 2nd International Conference on Cloud and Green Computing (CGC), IEEE, Xiangtan, China, pp.563-570, 2012.
    [22]
    H. P. Zhang, S. L. Wang. Management and Index Method of Perfect Double Array TRIE Tree Dictionary, Patent No. 200510130690. 3. 2005-12-21, China.
    [23]
    H. P. Zhang, Y. L. Huang, C. C. Gong. An Extraction Method and System of Terminology, Patent No. 200710121839.0, China.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    用微信扫码二维码

    分享至好友和朋友圈

    Article Metrics

    Article views (5764) PDF downloads(3575) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return