[目的/意义]研究“睡美人”文献的识别方法,对尽早发现重要科技成就及其发明人、加快科技成果转化以及完善学术评价方法等均具有重要意义。[方法/过程]针对高校学术论文成果评价这一特定场景,提出“先客观指标粗筛、后多维参数细选”的...[目的/意义]研究“睡美人”文献的识别方法,对尽早发现重要科技成就及其发明人、加快科技成果转化以及完善学术评价方法等均具有重要意义。[方法/过程]针对高校学术论文成果评价这一特定场景,提出“先客观指标粗筛、后多维参数细选”的研究思路,组合使用K值算法和三指标法,对东北大学发表于Web of Science核心合集的论文样本集进行了“睡美人”文献挖掘的实证研究。[结果/结论]该方法共识别出12篇“睡美人”文献,并对其被引特征、期刊特征、睡眠特征、内容特征等因素进行了分析。实获数据处理结果表明了该方法的有效性,相关研究方法和结果可对东北大学学术论文评价提供重要参考。展开更多
For photovoltaic power prediction,a kind of sparse representation modeling method using feature extraction techniques is proposed.Firstly,all these factors affecting the photovoltaic power output are regarded as the i...For photovoltaic power prediction,a kind of sparse representation modeling method using feature extraction techniques is proposed.Firstly,all these factors affecting the photovoltaic power output are regarded as the input data of the model.Next,the dictionary learning techniques using the K-mean singular value decomposition(K-SVD)algorithm and the orthogonal matching pursuit(OMP)algorithm are used to obtain the corresponding sparse encoding based on all the input data,i.e.the initial dictionary.Then,to build the global prediction model,the sparse coding vectors are used as the input of the model of the kernel extreme learning machine(KELM).Finally,to verify the effectiveness of the combined K-SVD-OMP and KELM method,the proposed method is applied to a instance of the photovoltaic power prediction.Compared with KELM,SVM and ELM under the same conditions,experimental results show that different combined sparse representation methods achieve better prediction results,among which the combined K-SVD-OMP and KELM method shows better prediction results and modeling accuracy.展开更多
The K-multiple-means(KMM)retains the simple and efficient advantages of the K-means algorithm by setting multiple subclasses,and improves its effect on non-convex data sets.And aiming at the problem that it cannot be ...The K-multiple-means(KMM)retains the simple and efficient advantages of the K-means algorithm by setting multiple subclasses,and improves its effect on non-convex data sets.And aiming at the problem that it cannot be applied to the Internet on a multi-view data set,a multi-view K-multiple-means(MKMM)clustering method is proposed in this paper.The new algorithm introduces view weight parameter,reserves the design of setting multiple subclasses,makes the number of clusters as constraint and obtains clusters by solving optimization problem.The new algorithm is compared with some popular multi-view clustering algorithms.The effectiveness of the new algorithm is proved through the analysis of the experimental results.展开更多
In 5 G Ultra-dense Network(UDN), resource allocation is an efficient method to manage inter-small-cell interference. In this paper, a two-stage resource allocation scheme is proposed to supervise interference and reso...In 5 G Ultra-dense Network(UDN), resource allocation is an efficient method to manage inter-small-cell interference. In this paper, a two-stage resource allocation scheme is proposed to supervise interference and resource allocation while establishing a realistic scenario of three-tier heterogeneous network architecture. The scheme consists of two stages: in stage I, a two-level sub-channel allocation algorithm and a power control method based on the logarithmic function are applied to allocate resource for Macrocell and Picocells, guaranteeing the minimum system capacity by considering the power limitation and interference coordination; in stage II, an interference management approach based on K-means clustering is introduced to divide Femtocells into different clusters. Then, a prior sub-channel allocation algorithm is employed for Femtocells in diverse clusters to mitigate the interference and promote system performance. Simulation results show that the proposed scheme contributes to the enhancement of system throughput and spectrum efficiency while ensuring the system energy efficiency.展开更多
Classification systems such as Slope Mass Rating(SMR) are currently being used to undertake slope stability analysis. In SMR classification system, data is allocated to certain classes based on linguistic and experien...Classification systems such as Slope Mass Rating(SMR) are currently being used to undertake slope stability analysis. In SMR classification system, data is allocated to certain classes based on linguistic and experience-based criteria. In order to eliminate linguistic criteria resulted from experience-based judgments and account for uncertainties in determining class boundaries developed by SMR system,the system classification results were corrected using two clustering algorithms, namely K-means and fuzzy c-means(FCM), for the ratings obtained via continuous and discrete functions. By applying clustering algorithms in SMR classification system, no in-advance experience-based judgment was made on the number of extracted classes in this system, and it was only after all steps of the clustering algorithms were accomplished that new classification scheme was proposed for SMR system under different failure modes based on the ratings obtained via continuous and discrete functions. The results of this study showed that, engineers can achieve more reliable and objective evaluations over slope stability by using SMR system based on the ratings calculated via continuous and discrete functions.展开更多
针对电力巡线无人机检测绝缘子缺陷,具有缺陷绝缘子样本数据不均衡、采集难度大等问题,提出一种基于YOLOV5(you only look once V5)算法的绝缘子异常检测模型。首先借助YOLOV5目标检测算法定位绝缘子位置,再把绝缘子图像输入到残差网络...针对电力巡线无人机检测绝缘子缺陷,具有缺陷绝缘子样本数据不均衡、采集难度大等问题,提出一种基于YOLOV5(you only look once V5)算法的绝缘子异常检测模型。首先借助YOLOV5目标检测算法定位绝缘子位置,再把绝缘子图像输入到残差网络提取多层金字塔特征,然后通过K邻近值算法判断特征层像素是否为离群点,由此可判断绝缘子是否存在缺陷。所提方法无须负样本绝缘子图像,仅通过正样本即可训练网络;与常用方法相比,所提算法的准确率和召回率均为最高,表明所提方法泛化性和稳定性较好。展开更多
Recently clustering techniques have been used to automatically discover typical user profiles. In general, it is a challenging problem to design effective similarity measure between the session vectors which are usual...Recently clustering techniques have been used to automatically discover typical user profiles. In general, it is a challenging problem to design effective similarity measure between the session vectors which are usually high-dimensional and sparse. Two approaches for mining typical user profiles, based on matrix dimensionality reduction, are presented. In these approaches, non-negative matrix factorization is applied to reduce dimensionality of the session-URL matrix, and the projecting vectors of the user-session vectors are clustered into typical user-session profiles using the spherical k -means algorithm. The results show that two algorithms are successful in mining many typical user profiles in the user sessions.展开更多
高速铁路隧道环境中采用LTE-R(long term evolution-railway)无线通信位置指纹定位解算时,针对加权K值邻近位置指纹解算精度低的问题,利用混沌粒子群算法优化权值的良好性能,提出基于混沌粒子群优化的加权K值邻近算法对列车位置指纹定...高速铁路隧道环境中采用LTE-R(long term evolution-railway)无线通信位置指纹定位解算时,针对加权K值邻近位置指纹解算精度低的问题,利用混沌粒子群算法优化权值的良好性能,提出基于混沌粒子群优化的加权K值邻近算法对列车位置指纹定位在线阶段进行指纹匹配解算,分别讨论了指纹间距取25 m、50 m、100 m时混沌粒子群优化加权K值邻近算法的收敛性和精确性.仿真结果表明:经混沌粒子群优化的加权K值邻近算法收敛速度更快,定位解算结果精度更高;在提高列车位置指纹定位精度方面,比单纯的加权K值邻近算法以及经粒子群优化的加权K值邻近算法更具优越性,当指纹间距取25 m时,列车定位误差小于25 m的概率高达96%,使隧道环境中列车位置指纹定位精度得到有效改善.展开更多
Sample entropy can reflect the change of level of new information in signal sequence as well as the size of the new information. Based on the sample entropy as the features of speech classification, the paper firstly ...Sample entropy can reflect the change of level of new information in signal sequence as well as the size of the new information. Based on the sample entropy as the features of speech classification, the paper firstly extract the sample entropy of mixed signal, mean and variance to calculate each signal sample entropy, finally uses the K mean clustering to recognize. The simulation results show that: the recognition rate can be increased to 89.2% based on sample entropy.展开更多
The impulsive components induced by bearing faults are key features for assessing gear-box bearing faults.However,because of heavy background noise and the interferences of other vibrations,it is difficult to extract ...The impulsive components induced by bearing faults are key features for assessing gear-box bearing faults.However,because of heavy background noise and the interferences of other vibrations,it is difficult to extract these impulsive components caused by faults,particularly early faults,from the measured vibration signals.To capture the high-level structure of impulsive components embedded in measured vibration signals,a dictionary learning method called shift-invariant K-means singular value decomposition(SI-K-SVD)dictionary learning is used to detect the early faults of gear-box bearings.Although SI-K-SVD is more flexible and adaptable than existing methods,the improper selection of two SI-K-SVD-related parameters,namely,the number of iterations and the pattern lengths,has an adverse influence on fault detection performance.Therefore,the sparsity of the envelope spectrum(SES)and the kurtosis of the envelope spectrum(KES)are used to select these two key parameters,respectively.SI-K-SVD with the two selected optimal parameter values,referred to as optimal parameter SI-K-SVD(OP-SI-K-SVD),is proposed to detect gear-box bearing faults.The proposed method is verified by both simulations and an experiment.Compared to the state-of-the-art methods,namely,empirical model decomposition,wavelet transform and K-SVD,OP-SI-K-SVD has better performance in diagnosing the early faults of a gear-box bearing.展开更多
Customers are of great importance to E-commerce in intense competition.It is known that twenty percent customers produce eighty percent profiles.Thus,how to find these customers is very critical.Customer lifetime valu...Customers are of great importance to E-commerce in intense competition.It is known that twenty percent customers produce eighty percent profiles.Thus,how to find these customers is very critical.Customer lifetime value(CLV) is presented to evaluate customers in terms of recency,frequency and monetary(RFM) variables.A novel model is proposed to analyze customers purchase data and RFM variables based on ordered weighting averaging(OWA) and K-Means cluster algorithm.OWA is employed to determine the weights of RFM variables in evaluating customer lifetime value or loyalty.K-Means algorithm is used to cluster customers according to RFM values.Churn customers could be found out by comparing RFM values of every cluster group with average RFM.Questionnaire is conducted to investigate which reasons cause customers dissatisfaction.Rank these reasons to help E-commerce improve services.The experimental results have demonstrated that the model is effective and reasonable.展开更多
文摘[目的/意义]研究“睡美人”文献的识别方法,对尽早发现重要科技成就及其发明人、加快科技成果转化以及完善学术评价方法等均具有重要意义。[方法/过程]针对高校学术论文成果评价这一特定场景,提出“先客观指标粗筛、后多维参数细选”的研究思路,组合使用K值算法和三指标法,对东北大学发表于Web of Science核心合集的论文样本集进行了“睡美人”文献挖掘的实证研究。[结果/结论]该方法共识别出12篇“睡美人”文献,并对其被引特征、期刊特征、睡眠特征、内容特征等因素进行了分析。实获数据处理结果表明了该方法的有效性,相关研究方法和结果可对东北大学学术论文评价提供重要参考。
基金National Natural Science Foundation of China(No.51467008)。
文摘For photovoltaic power prediction,a kind of sparse representation modeling method using feature extraction techniques is proposed.Firstly,all these factors affecting the photovoltaic power output are regarded as the input data of the model.Next,the dictionary learning techniques using the K-mean singular value decomposition(K-SVD)algorithm and the orthogonal matching pursuit(OMP)algorithm are used to obtain the corresponding sparse encoding based on all the input data,i.e.the initial dictionary.Then,to build the global prediction model,the sparse coding vectors are used as the input of the model of the kernel extreme learning machine(KELM).Finally,to verify the effectiveness of the combined K-SVD-OMP and KELM method,the proposed method is applied to a instance of the photovoltaic power prediction.Compared with KELM,SVM and ELM under the same conditions,experimental results show that different combined sparse representation methods achieve better prediction results,among which the combined K-SVD-OMP and KELM method shows better prediction results and modeling accuracy.
基金National Youth Natural Science Foundationof China(No.61806006)Innovation Program for Graduate of Jiangsu Province(No.KYLX160-781)Project Supported by Jiangsu University Superior Discipline Construction Project。
文摘The K-multiple-means(KMM)retains the simple and efficient advantages of the K-means algorithm by setting multiple subclasses,and improves its effect on non-convex data sets.And aiming at the problem that it cannot be applied to the Internet on a multi-view data set,a multi-view K-multiple-means(MKMM)clustering method is proposed in this paper.The new algorithm introduces view weight parameter,reserves the design of setting multiple subclasses,makes the number of clusters as constraint and obtains clusters by solving optimization problem.The new algorithm is compared with some popular multi-view clustering algorithms.The effectiveness of the new algorithm is proved through the analysis of the experimental results.
基金partially supported by the Major Project of National Science and Technology of China under Grants No. 2016ZX03002010003 and No. 2015ZX03001033-002
文摘In 5 G Ultra-dense Network(UDN), resource allocation is an efficient method to manage inter-small-cell interference. In this paper, a two-stage resource allocation scheme is proposed to supervise interference and resource allocation while establishing a realistic scenario of three-tier heterogeneous network architecture. The scheme consists of two stages: in stage I, a two-level sub-channel allocation algorithm and a power control method based on the logarithmic function are applied to allocate resource for Macrocell and Picocells, guaranteeing the minimum system capacity by considering the power limitation and interference coordination; in stage II, an interference management approach based on K-means clustering is introduced to divide Femtocells into different clusters. Then, a prior sub-channel allocation algorithm is employed for Femtocells in diverse clusters to mitigate the interference and promote system performance. Simulation results show that the proposed scheme contributes to the enhancement of system throughput and spectrum efficiency while ensuring the system energy efficiency.
文摘Classification systems such as Slope Mass Rating(SMR) are currently being used to undertake slope stability analysis. In SMR classification system, data is allocated to certain classes based on linguistic and experience-based criteria. In order to eliminate linguistic criteria resulted from experience-based judgments and account for uncertainties in determining class boundaries developed by SMR system,the system classification results were corrected using two clustering algorithms, namely K-means and fuzzy c-means(FCM), for the ratings obtained via continuous and discrete functions. By applying clustering algorithms in SMR classification system, no in-advance experience-based judgment was made on the number of extracted classes in this system, and it was only after all steps of the clustering algorithms were accomplished that new classification scheme was proposed for SMR system under different failure modes based on the ratings obtained via continuous and discrete functions. The results of this study showed that, engineers can achieve more reliable and objective evaluations over slope stability by using SMR system based on the ratings calculated via continuous and discrete functions.
文摘针对电力巡线无人机检测绝缘子缺陷,具有缺陷绝缘子样本数据不均衡、采集难度大等问题,提出一种基于YOLOV5(you only look once V5)算法的绝缘子异常检测模型。首先借助YOLOV5目标检测算法定位绝缘子位置,再把绝缘子图像输入到残差网络提取多层金字塔特征,然后通过K邻近值算法判断特征层像素是否为离群点,由此可判断绝缘子是否存在缺陷。所提方法无须负样本绝缘子图像,仅通过正样本即可训练网络;与常用方法相比,所提算法的准确率和召回率均为最高,表明所提方法泛化性和稳定性较好。
文摘Recently clustering techniques have been used to automatically discover typical user profiles. In general, it is a challenging problem to design effective similarity measure between the session vectors which are usually high-dimensional and sparse. Two approaches for mining typical user profiles, based on matrix dimensionality reduction, are presented. In these approaches, non-negative matrix factorization is applied to reduce dimensionality of the session-URL matrix, and the projecting vectors of the user-session vectors are clustered into typical user-session profiles using the spherical k -means algorithm. The results show that two algorithms are successful in mining many typical user profiles in the user sessions.
文摘Sample entropy can reflect the change of level of new information in signal sequence as well as the size of the new information. Based on the sample entropy as the features of speech classification, the paper firstly extract the sample entropy of mixed signal, mean and variance to calculate each signal sample entropy, finally uses the K mean clustering to recognize. The simulation results show that: the recognition rate can be increased to 89.2% based on sample entropy.
基金Project(51875481) supported by the National Natural Science Foundation of ChinaProject(2682017CX011) supported by the Fundamental Research Foundations for the Central Universities,China+2 种基金Project(2017M623009) supported by the China Postdoctoral Science FoundationProject(2017YFB1201004) supported by the National Key Research and Development Plan for Advanced Rail Transit,ChinaProject(2019TPL_T08) supported by the Research Fund of the State Key Laboratory of Traction Power,China
文摘The impulsive components induced by bearing faults are key features for assessing gear-box bearing faults.However,because of heavy background noise and the interferences of other vibrations,it is difficult to extract these impulsive components caused by faults,particularly early faults,from the measured vibration signals.To capture the high-level structure of impulsive components embedded in measured vibration signals,a dictionary learning method called shift-invariant K-means singular value decomposition(SI-K-SVD)dictionary learning is used to detect the early faults of gear-box bearings.Although SI-K-SVD is more flexible and adaptable than existing methods,the improper selection of two SI-K-SVD-related parameters,namely,the number of iterations and the pattern lengths,has an adverse influence on fault detection performance.Therefore,the sparsity of the envelope spectrum(SES)and the kurtosis of the envelope spectrum(KES)are used to select these two key parameters,respectively.SI-K-SVD with the two selected optimal parameter values,referred to as optimal parameter SI-K-SVD(OP-SI-K-SVD),is proposed to detect gear-box bearing faults.The proposed method is verified by both simulations and an experiment.Compared to the state-of-the-art methods,namely,empirical model decomposition,wavelet transform and K-SVD,OP-SI-K-SVD has better performance in diagnosing the early faults of a gear-box bearing.
基金supported by the Natural Science Foundation under Grant Nos.71273139,60804047the Social Science Foundation of Chinese Ministry of Education under Grant No.12YJC630271
文摘Customers are of great importance to E-commerce in intense competition.It is known that twenty percent customers produce eighty percent profiles.Thus,how to find these customers is very critical.Customer lifetime value(CLV) is presented to evaluate customers in terms of recency,frequency and monetary(RFM) variables.A novel model is proposed to analyze customers purchase data and RFM variables based on ordered weighting averaging(OWA) and K-Means cluster algorithm.OWA is employed to determine the weights of RFM variables in evaluating customer lifetime value or loyalty.K-Means algorithm is used to cluster customers according to RFM values.Churn customers could be found out by comparing RFM values of every cluster group with average RFM.Questionnaire is conducted to investigate which reasons cause customers dissatisfaction.Rank these reasons to help E-commerce improve services.The experimental results have demonstrated that the model is effective and reasonable.