摘 要: 通过挖掘商品评论中的评价对象,可以得知用户更关心商品哪些方面的属性,从而帮助企业改进商品, 帮助用户选择商品。因此,商品评价对象的挖掘具有重要的意义。本文提出了一种用于商品评价对象挖掘的领域词典构 建方法:首先基于LDA模型,提出了一种领域基础词典的构建方法;然后,分别提出了基于词汇之间的PMI值和基于依 存句法分析的领域词典扩充方法。本文基于京东商城的洗衣液产品真实评论数据集,使用构建的词典分别进行了一级标 签评价对象挖掘和二级标签评价对象挖掘的实验。实验结果表明,本文提出的方法在进行评价对象挖掘时具有良好的性 能;相比一级标签评价对象,扩充后的词典对二级标签评价对象挖掘的效果有更好的提升。 |
关键词: 领域词典;对象挖掘;商品评论;LDA;PMI |
中图分类号: TP391
文献标识码: A
|
基金项目: 本文受the National Key R&D Program of China under grant(2018YFB1004700)资助. |
|
A Method on Domain Dictionary Construction for Object Mining on Commodity Comments |
SHI Yuxin,YANG Zeqing,ZHAO Zhibin,YAO Lan
|
( School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China)
|
Abstract: Enterprises hope to be aided by object mining on comments of their products,which reveals the clients' concerns,to improve their manufacturing.This object mining also makes sense to subsequent consumers while they are making their choice.Therefore,it is significant to mine objects of a comment.This paper proposes a method on domain dictionary construction for object mining on comments of commodity:Firstly,a method based on the LDA model,a basic domain dictionary is proposed;then,the domain dictionary expansion methods based on the PMI value of words and dependency parsing are proposed respectively.Data applied for experiments in this paper is from detergent sale data of JD.COM.The dictionaries are applied on this data set for the first-level and second-level label object mining.The experimental results prove the proposed method’s great potential in object mining.Compared with the first-level label object mining,the extensive dictionary has improved the second-level label object mining. |
Keywords: domain dictionary;object mining;commodity comment;LDA;PMI |