A semantics-based model is proposed to enable weakened hedges, such as "more or less" and "roughly" in the context of linguistic multi-criteria decision making. First, the resemblance relations are defined based o...A semantics-based model is proposed to enable weakened hedges, such as "more or less" and "roughly" in the context of linguistic multi-criteria decision making. First, the resemblance relations are defined based on the semantics of terms on the domain. Then, the hedges can be represented after the upper and loose upper approximations of a linguistic term are derived. Accordingly, some compact formulae can be derived for the semantics of linguistic expressions with hedges. Parameters in these formulae are objectively determined according to the semantics of original terms. The proposed model presents a more natural way to express the decision information under uncertainties and its semantics is clear. The proposed model is clarified by solving the problem of evaluation and selection of sustainable innovative energy technologies. Computational results demonstrate that the model can deal with various uncertainties of the problem. Finally, the model is compared with existing techniques and extended to the case when the semantics of terms are represented by trapezoidal fuzzy numbers.展开更多
Category-based statistic language model is an important method to solve the problem of sparse data.But there are two bottlenecks:1) The problem of word clustering.It is hard to find a suitable clustering method with g...Category-based statistic language model is an important method to solve the problem of sparse data.But there are two bottlenecks:1) The problem of word clustering.It is hard to find a suitable clustering method with good performance and less computation.2) Class-based method always loses the prediction ability to adapt the text in different domains.In order to solve above problems,a definition of word similarity by utilizing mutual information was presented.Based on word similarity,the definition of word set similarity was given.Experiments show that word clustering algorithm based on similarity is better than conventional greedy clustering method in speed and performance,and the perplexity is reduced from 283 to 218.At the same time,an absolute weighted difference method was presented and was used to construct vari-gram language model which has good prediction ability.The perplexity of vari-gram model is reduced from 234.65 to 219.14 on Chinese corpora,and is reduced from 195.56 to 184.25 on English corpora compared with category-based model.展开更多
基金The National Natural Science Foundation of China(No.61273209)the Scientific Research Foundation of Graduate School of Southeast University(No.YBJJ1528)the Scientific Innovation Research of College Graduates in Jiangsu Province(No.KYLX15-0191)
文摘A semantics-based model is proposed to enable weakened hedges, such as "more or less" and "roughly" in the context of linguistic multi-criteria decision making. First, the resemblance relations are defined based on the semantics of terms on the domain. Then, the hedges can be represented after the upper and loose upper approximations of a linguistic term are derived. Accordingly, some compact formulae can be derived for the semantics of linguistic expressions with hedges. Parameters in these formulae are objectively determined according to the semantics of original terms. The proposed model presents a more natural way to express the decision information under uncertainties and its semantics is clear. The proposed model is clarified by solving the problem of evaluation and selection of sustainable innovative energy technologies. Computational results demonstrate that the model can deal with various uncertainties of the problem. Finally, the model is compared with existing techniques and extended to the case when the semantics of terms are represented by trapezoidal fuzzy numbers.
基金Project(60763001) supported by the National Natural Science Foundation of ChinaProject(2010GZS0072) supported by the Natural Science Foundation of Jiangxi Province,ChinaProject(GJJ12271) supported by the Science and Technology Foundation of Provincial Education Department of Jiangxi Province,China
文摘Category-based statistic language model is an important method to solve the problem of sparse data.But there are two bottlenecks:1) The problem of word clustering.It is hard to find a suitable clustering method with good performance and less computation.2) Class-based method always loses the prediction ability to adapt the text in different domains.In order to solve above problems,a definition of word similarity by utilizing mutual information was presented.Based on word similarity,the definition of word set similarity was given.Experiments show that word clustering algorithm based on similarity is better than conventional greedy clustering method in speed and performance,and the perplexity is reduced from 283 to 218.At the same time,an absolute weighted difference method was presented and was used to construct vari-gram language model which has good prediction ability.The perplexity of vari-gram model is reduced from 234.65 to 219.14 on Chinese corpora,and is reduced from 195.56 to 184.25 on English corpora compared with category-based model.