基于用户主题模型的微博用户兴趣挖掘(英文)

来源 :中国通信 | 被引量 : 0次 | 上传用户:yaleqd
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Microblogs have become an important platform for people to publish,transform information and acquire knowledge.This paper focuses on the problem of discovering user interest in microblogs.In this paper,we propose a topic mining model based on Latent Dirichlet Allocation(LDA) named user-topic model.For each user,the interests are divided into two parts by different ways to generate the microblogs:original interest and retweet interest.We represent a Gibbs sampling implementation for inference the parameters of our model,and discover not only user’s original interest,but also retweet interest.Then we combine original interest and retweet interest to compute interest words for users.Experiments on a dataset of Sina microblogs demonstrate that our model is able to discover user interest effectively and outperforms existing topic models in this task.And we find that original interest and retweet interest are similar and the topics of interest contain user labels.The interest words discovered by our model reflect user labels,but range is much broader. Microblogs have become an important platform for people to publish, transform information and acquire knowledge. This paper focuses on the problem of discovering user interest in microblogs.In this paper, we propose a topic mining model based on Latent Dirichlet Allocation (LDA) named user -topic model.For each user, the interests are divided into two parts by different ways to generate the microblogs: original interest and retweetinterest.We represent a Gibbs sampling implementation for inference the parameters of our model, and discover not only user’s original interest , but also retweet interest.Then we worked original interest and retweet interest to compute interest words for users.Experiments on a dataset of Sina microblogs demonstrate that our model is able to discover user interest effectively and outperforms existing topic models in this task.And we find that original interest and retweet interest are similar and the topics of interest contain user labels.The interest words discovere d by our model reflect user labels, but range is much broader.
其他文献
从90年代起,我校学生进行研究性学习的主要教学载体,是以学生自主选修和课外活动小组为教学组织主要形态,之后开发了部分学生参加的课题研究的教学载体。但在进一步深刻领会
美国MotivePower业公司最近收到了来自南非、韩国、西班牙、印度、巴西和德国铁路客户关于购买机车零部件的订货,订货总额为500万美元。另外,该公司最近还收购了两家子公司,它们
<正>春节快要到了,春联是中国人过春节必不可少的一样东西。说到春联,大家就再熟悉不过了,但"立体发光镭射系列春联"却与众不同。"立体发光镭射系列春联"不仅是春联产品的重
我曾与《名作欣赏》有过不少联系,所以也比较注意它。记得《名作欣赏》2000年第2期的内封上,登过一幅美国当代画家安德鲁斯·怀斯的名画,题为《炒栗子》。这是20世纪中叶的画
用人首先要求用人者先要使用好自己,创人先创己,用人先自用。用人活动的成败,取决于用人者的素质,取决于用人者谋略和决策;用人者也有一个被用的问题,这是因为领导者一般都有两重属
在县城里,许多人都知道我们局办公室的赵主任是个秀才。赵主任痴爱写小说,可是趴了几年桌子,却没一篇变成铅字。但他不灰心,仍乐此不疲。一天,赵主任来到刘局长的办公室,说:
本文论述了对高校学生数字化档案进行渗透入侵测试的必要性,分析了高校学生数字化档案渗透入侵测试现状及其产生的原因,并针对高校学生数字化档案的服务特点和位于高技术人群
铁十四局隧道工程处成立于1994年10月。4年多来,走出了一条以质量、效益为中心的发展之路。经济效益从负债2000多万元发展到积累资产1.7亿元。施工质量获2项鲁班金像奖(1项
北大附中副校长程翔是位语文老师。一次在批改学生作文时,发现一篇题为《一块手帕》的文章构思精巧,立意新颖,读后回味无穷。程老师被深深吸引住了。他决定利用作文评讲课朗
由包头市和呼和浩特铁路局共同投资的包头铁路据客站日前止式开且_。新客站投资9000万元,预计将J2000年底建成投人使用。据呼铁局有关负责人介绍,包头新客站占地面调155万m’,可