摘要
针对通用搜索引擎信息量大、查询不准确、深度不够等问题,提出了基于Web的产品属性抽取这一新的搜索引擎服务模式。基于Web的产品属性抽取实际就是一个自动分类问题,其任务是:在给定的分类体系下,根据相关产品模板自动地判断属性的是非。完成此任务的关键在于寻找有效的特征值;确定相关分类规则,最终通过P、R和F指标来评价分类算法。
Carrying out Web-based product attribute extraction is one of the new search engine service patterns, it is put forward in relation that the general search engine is informative, inquiries inaccurate and not enough depth. Web-based product attribute extraction is a actual automatic classification problem, the task is: In a given classification system, in accordance with the relevant product template carry automatically attribute judge of right and wrong. Currently, the key is to search the effective feature value, determine the relevant classification rules, through P, R and F indicators assess the classification algorithm finally.
出处
《上海第二工业大学学报》
2008年第1期29-34,共6页
Journal of Shanghai Polytechnic University
关键词
属性抽取
分类规则
特征值
最大熵
attribute extraction
classification rule
feature value
maximum entropy