999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Mining potential social relationship with active learning in LBSN①

2017-06-27 08:09:23WangHaiping王海平ZhangHongWangYongBingJia
High Technology Letters 2017年2期

Wang Haiping (王海平), Zhang Hong, Wang Yong, Bing Jia

(*Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, P.R.China) (**National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, P.R. China) (***Henan Worker’s Cultural Palace, Zhengzhou 450007, P.R.China)

Mining potential social relationship with active learning in LBSN①

Wang Haiping (王海平)*, Zhang Hong②**, Wang Yong②**, Bing Jia***

(*Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, P.R.China) (**National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, P.R. China) (***Henan Worker’s Cultural Palace, Zhengzhou 450007, P.R.China)

Rapid development of local-based social network (LBSN) makes it more convenient for researchers to carry out studies related to social network. Mining potential social relationship in LBSN is the most important one. Traditionally, researchers use topological relation of social network or telecommunication network to mine potential social relationship. But the effect is unsatisfactory as the network can not provide complete information of topological relation. In this work, a new model called PSRMAL is proposed for mining potential social relationships with LBSN. With the model, better performance is obtained and guaranteed, and experiments verify the effectiveness.

data preprocessing, feature fusion, active learning

0 Introduction

Local-based social network (LBSN) is a new kind of social network where people could mark their positions information, and it developed rapidly in recent years. But LBSN differs from traditional social network as people could mark their positions information in the network. With the extensive use of smart phones, a large number of local-based social networks like Foursquares and Gowalla have emerged and have been drawing people’s attention. Besides, traditional social networks like Facebook and Twitter also add the position information in their products to improve their popularity. In this way, people could publish their statuses in the form of text or picture marked with geographical information in LBSN.

Nowadays, millions of check-ins appear in LBSN every day, which provide sufficient information for the study of social network, including social relationship mining, recommendation of goods and services, community detection, etc. As mining of social relationship is the basis of many studies, it has been drawing wide attention of researchers. Traditionally, researchers use topological relation of social network or telecommunication network to mine potential social relationship[1]. As LBSN could be viewed as the combination of traditional social relationship and marks with position information, potential social relationships could also be mined by traditional methods, however, in which obvious disadvantage exists. As people do not always use a certain local-based social network to communicate with their friends, the relational network extracted from the LBSN could not fully cover their relationships. In other word, features extracted from the existing relational network could not describe the attribute completely. Therefore, researchers have studied mining the potential social relationships by using geographical position information.

Ref.[2] discovered the relation between relationships and geographical position information, and verified the effectiveness to infer potential relationships with geographical information. Ref.[3] defined computation methods to extract features from LBSN and mined potential relationships later. Ref.[4] also studied the problem of inferring links with geographical information to be proved the effective Ref.[5] proposed an entropy-based model (EBM) to infer social connections,and further estimate the strength of social connections with spatial information. However, the studies mentioned above mainly focused on designing features, and less considered about preprocessing data and improving the prediction model. In this paper, a new model called PSRMAL is brought out for mining potential social relationships to predict people’s potential social relational network, combined with geographical information extracted from LBSN.

The rest of the paper is organized as follows: In Section 1, a method for designing PSRMAL is explained. The experiment is described in Section 2. Finally, conclusion is given in Section 3.

1 Design of PSRMAL

In this section, a method for devising model PSRMAL will be introduced in detail. The model can be viewed as two parts. The first is to extract features from LBSN, and the second is to train the model and further improve its performance. Fig.1 shows the structure for mining potential social relationships from LBSN, and the process could be divided into four steps, i.e. region partition, feature computation, feature fusion and active learning. The detailed description is as follows.

Fig.1 The structure for mining potential social relationships

1.1 Region partition

As each check-in corresponds to a GPS record, regions could be partitioned by clustering people’s GPS records of check-in. Generally, there are four methods for clustering, i.e. partitioning methods, hierarchical methods, density-based methods and grid-based methods. Partitioning, hierarchical and grid-based methods are designed to find spherical-shaped clusters, while positions where people check in are not always in regular shapes. Therefore, density-based methods are used to obtain segment regions in this work.Although density-based cluster is suitable for partitioning regions, it is still insufficient to ensure the rationality. In this paper, partition regions are brought out with three density-based cluster methods, i.e. DBSCAN (density-based spatial clustering of applications with noise), OPTICS (ordering points to identify the clustering structure), and DENCLUE (clustering based on density distribution functions). After region partition, each check-in record corresponds to three region marks to guarantee the rationality of partition.

1.2 Feature computation

Generally, people appearing at common positions are likely to be friends[6]. The more frequently they do, the more likely to be. Therefore, three methods that have usually been used for similarity computation in social network are applied, i.e common neighbors, Jaccard index and Cosine index[7]. Here positions are used that people check in to replace the nodes in network. Then features could be computed with three methods and features ComP, JacP and CosP could be obtained respectively. Here ComP denotes the common positions that two persons have checked in, JacP denotes the value that computed with Jaccard index for two persons, and CosP denotes values computed with cosine index. The computational formulas are shown as follows.

ComPij=φi∩φj

(1)

JacPij=φi∩φj/φi∪φj

(2)

(3)

1.3Featurefusionwithlogistic

Withthefeaturedefinitionmethodsmentionedinsubsection1.2,threedifferentkindsoffeaturesetsXN, XS, XE∈n×dcould be extracted, and XN, XS, XEdenote the feature sets extracted from LBSN with DBSCAN, OPTICS and DENCLUE respectively. Then letxNijdenotes theipair of persons’ position features extracted withjfeature computation method, while positions are obtained by DBSCAN. Similarly, letxSijandxEijdenote features when positions are obtained by OPTICS and DENCLUE correspondingly.

Then the fusion feature could be computed as

xij=αjxNij+βjxSij+γjxEij

(4)

whereαj,βjandγjrepresenttheweightsofdifferentfeatureswithcomputationj,and3nvalues will be obtained. To calculate the weighted values, the feature sets are united as

(5)

where R=[1,1…1]T∈1×d, and each column of XUis

xUi=(xNi1,xNi2…,xNin,xSi1,xSi2,…,xSin,xEi1,xEi2, …,xEin, 1)T∈(3n+1)×1

Correspondingly, W can be expressed as W=[αT,βT,γT,c]T∈(3n+1)×1,whereα=[α1,α2…αn]T∈n×1, β=[β1,β2,βn]T∈n×1andγ=[γ1,γ2…γn]T∈n×1are the weight sets of different features, andcisaconstantargument.

Forconvenience,formW=(w1,w2…w3n,c)T∈(3n+1)×1isusedtodenotetheweightset,andtheweightsetsofdifferentfeaturesareα=[w1,w2…wn]T∈n×1, β=[wn+1,wn+2…w2n]T∈n×1andγ=[w1,w2…wn]T∈n×1.

Inthiswork,logisticregressionisappliedtocalculatetheparameters.Theprobabilitythattwopersonshaverelationshipornotcanbeexpressedas

(6)

(7)

LethW(xU)=g(WTxU), and combine the two equations above to obtainp(y|xU, W)=hW(xU)y(1-hW(xU)1-y)

The likelihood function could be expressed as

(8)

And the log-likelihood is

(9)

Whenl(W) reaches the maximum, W is the weight set. In this study, gradient decent is used to solve it. The elements of W can be got as

(10)

Thenwjcouldbegotwhenthepartialderivateconverges,andtheresultis

(11)

1.4Activelearning

Inthiswork,activelearningisusedtoimprovetheperformanceforminingofpotentialsocialrelationship.Activelearningisproposedrelativetopassivelearning[8-13].Passivelearningreferstoselectingsamplesrandomlyfromthedatasetandlabelthem,thentrainsamodelwiththelabeledsamplesandclassifyunlabeledoneswiththemodel.However,theremayexistproblemssuchasinformationredundancy,excessivenoise,asthesamplesfortrainingmodelsarefetchedrandomly,whichwouldseriouslyaffecttheeffectivenessofclassification[14-19].Activelearningistodividethesamplelabelingworkintotwosteps.Firstly,itistolabelafewsamplesastheinitialtrainingset,fortrainingabasicclassifier,andlabeltheothersampleswiththeclassifier.Secondly,itistoselectacertainnumberofsamplesthatarehardtoconfirmtheirclassesaccordingtotheresult,andlabeltheseonesmanually.Thenewlabeledsamplestotheinitialtrainingsetareadded,andthefinalclassifierwiththenewtrainingsetistrained[20-25].Asthenewtrainingsetfetchedinthiswaycontainsmorecomprehensiveinformationofthedataset,amorerobustmodelwouldbeobtained.

2 Experiment

Inthissection,theperformanceofPSRMALisevaluatedwithaclassicalgorithm,supportvectormachine(SVM).Besides,socialrelationshipsarealsominedwiththreesinglefeaturesascontrastexperiments.Firstly,thefeaturesshouldbeextractedwiththreedifferentmethods.Secondly,thefeaturesarefusedwithlogisticregression,whiletheparameterW for fusion is shown in Table1.

Table 1 weight set

With the value of weight set W, each member of fusion feature X could be obtained as

Xij=αjxNij+βjxsij

(12)

Lastly,thepotentialsocialrelationshipswillbeminedbyusingXN, XS, XEand X respectively. In Fig.2(a)~(c), it is to express the experiment results of different methods for dividing positions, including DBSCAN, OPTICS, DENCLUE and the fusion. Let N denote the positions divided with DBSCAN,S denote OPTICS, E denote DENCLUE and F denote the fusion. As can be seen in the figures, the performance of fusion feature outperforms single features in almost all cases. The fusion is crucial to guarantee the stability of model and achieve high performance. Besides, active learning also contributes to enhance the performance. As the comprehensive assessment, F-measure is more persuasive. The values obtained by using XN, XS, XEand X are 42.18%, 41.08%, 38.43% and 44.34%, and the F-measure of fusion feature is boosted by 2.16%, 3.26% and 5.92% respectively.

(a) Index of precision

(b) Index of recall

(c) Index of F-measure

3 Conclusion and future work

In this study, a new model PSRMAL is proposed for mining potential social relationships with geographical information in LBSN. The importance of region partition is emphasized and the region is segmented with three different cluster methods, in which the rationality of partition is ensured. Then the features are fused with logistical and the performance of model is further improved with active learning mothed. Experiments prove the effectiveness of PSRMAL. In the future, more energy will be put to the efficiency of model to make it more suitable for real-time processing.

Reference

[ 1] Adamic L A, Adar E. Friends and neighbors on the web.SocialNetworks, 2001, 25(3): 211-230

[ 2] Cho E, Myers S A, Leskovec J. Friendship and mobility user movement in location-based social networks. In: Proceeding of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, USA, 2011. 1082-1090

[ 3] Wang D, Pedreschi D, Song C, et al. Human mobility, social ties, and link prediction. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, USA, 2011. 1100-1108

[ 4] Scellato S, Noulas A, Mascolo C. Exploiting place features in link prediction on location-based social networks. In: Proceedings the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, USA, 2011. 1046-1054

[ 5] Pham H, Shahabi C, Liu Y. EBM: an entropy-based model to infer social strength from spatiotemporal data. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, New York, USA, 2013. 265-276

[ 6] Pham H, Hu L, Shahabi C. Towards integrating real-world spatiotemporal data with social networks. In: Proceedings of the 19th ACM SIGSPATIAL, New York, USA, 2011. 453-457

[ 7] Moyano L G, Thomae O R M, Frias-Martinez E. Uncovering the spatio-temporal structure of social networks using cell phone records. In: Proceedings the 12th International Conference on Data Mining Workshops (ICDMW 2012), Brussels, Belgium, 2012. 242-249

[ 8] Zhang X Y, Wang S, Yun X. Bidirectional active learning: a two-way exploration into unlabeled and labeled dataset.IEEETransactionsonNeuralNetworksandLearningSystems, 2015, 26(12): 3034-3044

[ 9] Zhang X Y, Wang S, Zhu X, et al. Update vs. upgrade: modeling with indeterminate multi-class active learning.Neurocomputing, 2015, 162: 163-170

[10] Zhang X. Interactive patent classification based on multi-classifier fusion and active learning.Neurocomputing, 2014, 127(3): 200-205

[11] Zhang X Y, Cheng J, Xu C, et al. Multi-view multi-label active learning for image classification. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), Cancun, Mexico, 2009. 258-261

[12] Zhang X Y, Xu C, Cheng J, et al. Automatic semantic annotation for video blogs. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), Hannover, Germany, 2008. 121-124

[13] Zhang X Y, Cheng J, Lu H, et al. Selective sampling based on dynamic certainty propagation for image retrieval. In: Proceedings of the Advances in Multimedia Modeling (MMM), Kyoto, Japan, 2008. 425-435

[14] Zhang X Y, Cheng J, Lu H, et al. Weighted co-SVM for image retrieval with MVB strategy. In: Proceedings of the IEEE International Conference on Image Processing (ICIP), San Antonio, USA, 2007. 517-520

[15] Wang S, Zhang X Y, Yun X, et al. Joint recovery and representation learning for robust correlation estimation based on partially observed data. In: Proceedings of the IEEE International Conference on Data Mining Workshop, Atlantic City, USA, 2015. 1-7

[16] Zhang X Y. Preference modeling for personalized retrieval based on browsing history analysis.IEEJTransactionsonElectricalandElectronicEngineering, 2013, 8 (S1): 81-87

[17] Zhu X B, Jin X, Zhang X Y, et al. Context-aware local abnormality detection in crowded scene.ScienceChinaInformationSciences(SCIS), 2015, 58(5): 1-11

[18] Zhu G, Wang J, Wu Y, et al. MC-HOG correlation tracking with saliency proposal. In: Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, USA, 2016.1-7

[19] Zhang Y, Xu C, Zhang X, et al. Personalized retrieval of sports video based on multi-modal analysis and user preference acquisition.MultimediaToolsandApplications, 2009, 44(2): 305-330

[20] Zhang X Y, Hou Z, Zhu X, et al. Robust malware detection with dual-lane AdaBoost. In: Proceedings of the IEEE International Conference on Computer Communications, San Francisco, USA, 2016. 1051-1052

[21] Zhang X Y, Zhang K, Yun X, et al. Location-based correlation estimation in social network via collaborative learning. In: Proceedings of the IEEE International Conference on Computer Communications, San Francisco, USA, 2016. 1073-1074

[22] Zhang X Y, Wang S, Zhang L, et al. Ensemble feature selection with discriminative and representative properties for malware detection. In: Proceedings of the IEEE International Conference on Computer Communications, San Francisco, USA, 2016. 674-675

[23] Zhang Y, Zhang X, Xu C, et al. Personalized retrieval of sports video. In: Proceedings of the ACM Multimedia Workshop, Augsburg, Germany, 2007. 313-322

[24] Zhang X Y. Effective search with saliency-based matching and cluster-based browsing.HighTechnologyLetters, 2013, 19(1): 105-109

[25] Zhang X Y. Dynamic batch selective sampling based on version space analysis.HighTechnologyLetters, 2012, 18(2): 208-213

Wang Haiping, born in 1987. He is in pursuit of Ph.D degree, and is currently an engineer in Institute of Information Engineering, Chinese Academy of Sciences. He received his Master degree from College of Information of Renmin University of China in 2012. He also received his B.S. degree from Beijing Technology and Business University in 2009. His research interests include the design of algorithms for parallel processing, big data analysis and text mining.

10.3772/j.issn.1006-6748.2017.02.012

①Supported by the National Natural Science Foundation of China (No. 61501457).

②To whom correspondence should be addressed. E-mail: zhangh@isc.org.cn, wangyong@cert.org.cn

on Apr. 20, 2016

主站蜘蛛池模板: 国产精品55夜色66夜色| 亚洲αv毛片| 91区国产福利在线观看午夜 | 亚洲视频欧美不卡| 国产精品亚洲一区二区在线观看| 国产欧美在线| 亚洲综合极品香蕉久久网| 免费观看男人免费桶女人视频| 在线永久免费观看的毛片| 国产精品区视频中文字幕| 亚洲国产成人精品无码区性色| 国产欧美精品一区二区| 久久精品女人天堂aaa| 五月激情综合网| 囯产av无码片毛片一级| 欧美日韩中文字幕在线| 欧美一级大片在线观看| 亚洲欧美另类专区| 亚洲av片在线免费观看| 美女无遮挡免费视频网站| 日韩精品一区二区三区视频免费看| 美女被操91视频| 国产真实乱了在线播放| 国内精自视频品线一二区| 午夜不卡视频| 超清人妻系列无码专区| 国内99精品激情视频精品| 97超碰精品成人国产| 性喷潮久久久久久久久| 欧美亚洲国产一区| 精品一区二区三区自慰喷水| 免费国产无遮挡又黄又爽| 久久精品国产999大香线焦| 欧美色香蕉| AV熟女乱| 亚洲动漫h| 一级毛片无毒不卡直接观看| 日本黄色a视频| 五月综合色婷婷| 国产欧美专区在线观看| 亚洲AV无码乱码在线观看裸奔| 精品人妻AV区| 国产门事件在线| 六月婷婷激情综合| 三级国产在线观看| 999福利激情视频| 欧美成人国产| 色哟哟国产成人精品| 久久精品中文字幕少妇| 国产成人精品三级| aaa国产一级毛片| 中文字幕人妻av一区二区| 久久精品人人做人人爽电影蜜月| 久久福利片| 无码精品福利一区二区三区| P尤物久久99国产综合精品| 国产97公开成人免费视频| 国产亚洲精品在天天在线麻豆 | 亚洲国产综合精品一区| 亚洲中文字幕无码mv| 国产精品刺激对白在线| 一本综合久久| 一本大道香蕉久中文在线播放| 在线精品视频成人网| 国产色爱av资源综合区| 国产第三区| 精品国产黑色丝袜高跟鞋| 婷婷色中文网| 亚洲人成网址| 欧美性爱精品一区二区三区 | www.狠狠| av在线5g无码天天| 波多野结衣第一页| 欧美精品v欧洲精品| 最新国产精品鲁鲁免费视频| 九九久久99精品| 国产高潮流白浆视频| 亚洲一级无毛片无码在线免费视频| 久久婷婷人人澡人人爱91| 天堂岛国av无码免费无禁网站| 亚洲一区毛片| 一区二区在线视频免费观看|