事件抽取是自然语言处理(Natural Language Processing,NLP)领域的一个研究热点。现有的事件抽取模型大多基于小规模训练集,无法应用于大规模开放领域。针对大规模开放域事件抽取中事件表征困难的问题,提出了一种基于Zipf’s共生矩阵分...事件抽取是自然语言处理(Natural Language Processing,NLP)领域的一个研究热点。现有的事件抽取模型大多基于小规模训练集,无法应用于大规模开放领域。针对大规模开放域事件抽取中事件表征困难的问题,提出了一种基于Zipf’s共生矩阵分解的事件向量计算方法。首先,从开放语料中提取事件元组作为事件标签,并对事件元组进行抽象、剪枝和消歧。然后,利用Zipf’s共生矩阵表示事件的上下文分布,利用主成分分析(Principal Component Analysis,PCA)对共生矩阵进行分解,得到初始事件向量,并利用自编码器对初始事件向量进行非线性变换。采用最近邻检测和事件检测两种任务对事件向量的性能进行测试,结果表明,基于Zipf’s共生矩阵分解得到的事件向量能够对事件之间的相似性和相关性信息进行全局性表征,避免编码过细而造成语义偏移。展开更多
In recent years,context aware technology has been widely used in many fields,such as internet of vehicles(IoV).Consistent context information plays a vital role in adapting a system to rapidly changing situations.Howe...In recent years,context aware technology has been widely used in many fields,such as internet of vehicles(IoV).Consistent context information plays a vital role in adapting a system to rapidly changing situations.However,sensor's precision variance,equipment heterogeneity,network delay and the difference of statistical algorithms can lead to inconsistency context and inappropriate services.In this paper,we present an effective algorithm of context inconsistent elimination which is based on feedback and adjusted basic reliability distribution.Through feedback,each sensor's perception precision can be obtained,and with the adjusted basic reliability distribution scheme,we can make full use of all context information by adjusting the influence of every context on whole judgment based on sensor's perception precision and threshold of sensor's perception precision,and then eliminate context inconsistency.In order to evaluate the performance of the proposed context inconsistency elimination algorithm,context aware rate is defined.The simulation results show that the proposed context inconsistency elimination algorithm can obtain the best context aware rate in most cases for the varied error rates of sensors.展开更多
文摘事件抽取是自然语言处理(Natural Language Processing,NLP)领域的一个研究热点。现有的事件抽取模型大多基于小规模训练集,无法应用于大规模开放领域。针对大规模开放域事件抽取中事件表征困难的问题,提出了一种基于Zipf’s共生矩阵分解的事件向量计算方法。首先,从开放语料中提取事件元组作为事件标签,并对事件元组进行抽象、剪枝和消歧。然后,利用Zipf’s共生矩阵表示事件的上下文分布,利用主成分分析(Principal Component Analysis,PCA)对共生矩阵进行分解,得到初始事件向量,并利用自编码器对初始事件向量进行非线性变换。采用最近邻检测和事件检测两种任务对事件向量的性能进行测试,结果表明,基于Zipf’s共生矩阵分解得到的事件向量能够对事件之间的相似性和相关性信息进行全局性表征,避免编码过细而造成语义偏移。
基金supported by Scientific Research Foundation for the Excellent Young and Middle-aged Scientists of Shandong Province(No.BS2012DX024)Independent Innovation Foundation of Shandong University(No.2012ZD035)Technical Innovative Project of Shandong Province(No.201230201031,No.201320201024)
文摘In recent years,context aware technology has been widely used in many fields,such as internet of vehicles(IoV).Consistent context information plays a vital role in adapting a system to rapidly changing situations.However,sensor's precision variance,equipment heterogeneity,network delay and the difference of statistical algorithms can lead to inconsistency context and inappropriate services.In this paper,we present an effective algorithm of context inconsistent elimination which is based on feedback and adjusted basic reliability distribution.Through feedback,each sensor's perception precision can be obtained,and with the adjusted basic reliability distribution scheme,we can make full use of all context information by adjusting the influence of every context on whole judgment based on sensor's perception precision and threshold of sensor's perception precision,and then eliminate context inconsistency.In order to evaluate the performance of the proposed context inconsistency elimination algorithm,context aware rate is defined.The simulation results show that the proposed context inconsistency elimination algorithm can obtain the best context aware rate in most cases for the varied error rates of sensors.