Willingness Maximization for Ego Network Data Extraction in Multiple Online Social Networks

被引:1
|
作者
Hsu, Bay-Yuan [1 ]
Yeh, Lo-Yao [2 ]
Chang, Ming-Yi [3 ]
Shen, Chih-Ya [4 ]
机构
[1] Natl Tsing Hua Univ, Dept Ind Engn & Engn Management, Hsinchu 300, Taiwan
[2] Natl Cent Univ, Dept Informat Management, Taoyuan City 320, Taiwan
[3] Fu Jen Catholic Univ, Dept Sociol, New Taipei City 242, Taiwan
[4] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 300, Taiwan
关键词
Willingness; approximation algorithm; crawling; ego networks; online social networks; COMMUNITY SEARCH;
D O I
10.1109/TKDE.2022.3207150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
network (ego network) data are very important for evaluating algorithms and machine learning approaches in Online Social Networks (OSNs). Nevertheless, obtaining the ego network data from OSNs is not a trivial task. Conventional manual approaches are time-consuming, and sometimes the ego network data are quite incomplete because only a small number of users would agree to provide their data. This is because there are two important factors that should be considered simultaneously for this data acquisition task: i) users' willingness to provide their data, and ii) the structure of the ego network. However, addressing the above two factors to obtain the more complete ego network data has not received much research attention. Therefore, in this paper, we address this issue by proposing a family of new research problems. The first proposed problem, named Willingness Maximization for Ego Network Extraction in Online Social Networks (WMEgo), identifies a set of ego networks from a single OSN, such that the willingness of the users to provide their data is maximized. We prove that WMEgo is NP-hard and propose a 1/2 (1- 1/e)-approximation algorithm, named Ego Network Identification with Maximum Willingness (EIMW). Furthermore, we extend the idea of WMEgo to multiple social networks and formulate a new research problem, named Willingness Maximization on Multiple Social Networks for Ego Network Extraction (WM2Ego), which is able to effectively obtain ego network data from multiple social networks simultaneously. We propose a 1/2-approximation algorithm, named Maximum Expansion for UNified EXpenses (MUNEX) for a special case of WM(2)Ego and then design a constant-ratio approximation algorithm to the general WM(2)Ego problem, named Maximum Expansion with Expense Examination (M3E). We conduct two evaluation studies with 672 and 1,052 volunteers to validate the proposed WMEgo and WM2Ego problems, respectively, and show that they are able to obtain much more complete ego network data compared to other baselines. We also perform extensive experiments on multiple real datasets to demonstrate that the proposed approaches significantly outperform the other baselines.
引用
收藏
页码:8672 / 8686
页数:15
相关论文
共 50 条
  • [21] Distributed Influence Maximization for Large-Scale Online Social Networks
    Tang, Jing
    Zhu, Yuqing
    Tang, Xueyan
    Han, Kai
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 81 - 95
  • [22] Credit distribution for influence maximization in online social networks with node features
    Deng, Xiaoheng
    Pan, Yan
    Shen, Hailan
    Gui, Jingsong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (02) : 979 - 990
  • [23] An Effective Simulated Annealing for Influence Maximization Problem of Online Social Networks
    Liu, Shi-Jui
    Chen, Chi-Yuan
    Tsai, Chun-Wei
    8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 : 478 - 483
  • [24] ELP: Link prediction in social networks based on ego network perspective
    Mishra, Shivansh
    Singh, Shashank Sheshar
    Kumar, Ajay
    Biswas, Bhaskar
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 605
  • [25] Credit Distribution and Influence Maximization in Online Social Networks Using Node Features
    Deng, Xiaoheng
    Pan, Yan
    Wu, You
    Gui, Jingsong
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 2093 - 2100
  • [26] Personality assessment using multiple online social networks
    Shally Bhardwaj
    Pradeep K. Atrey
    Mukesh K. Saini
    Abdulmotaleb El Saddik
    Multimedia Tools and Applications, 2016, 75 : 13237 - 13269
  • [27] Personality assessment using multiple online social networks
    Bhardwaj, Shally
    Atrey, Pradeep K.
    Saini, Mukesh K.
    El Saddik, Abdulmotaleb
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (21) : 13237 - 13269
  • [28] Towards establishing the effect of self-similarity on influence maximization in online social networks
    Saxena, Bhawna
    Saxena, Vikas
    SOCIAL NETWORK ANALYSIS AND MINING, 2020, 10 (01)
  • [29] Online multicasting for network capacity maximization in energy-constrained ad hoc networks
    Liang, Weifa
    Guo, Xiaoxing
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2006, 5 (09) : 1215 - 1227
  • [30] Modeling Data Dissemination in Online Social Networks: A Geographical Perspective on Bounding Network Traffic Load
    Wang, Cheng
    Tang, Shaojie
    Yang, Lei
    Guo, Yi
    Li, Fan
    Jiang, Changjun
    MOBIHOC'14: PROCEEDINGS OF THE 15TH ACM INTERNATIONAL SYMPOSIUM ON MOBILE AD HOC NETWORKING AND COMPUTING, 2014, : 53 - 62