k-means is a popular clustering algorithm because of its simplicity and scalability to handle large datasets.However,one of its setbacks is the challenge of identifying the correct k-hyperparameter value.Tuning this v...k-means is a popular clustering algorithm because of its simplicity and scalability to handle large datasets.However,one of its setbacks is the challenge of identifying the correct k-hyperparameter value.Tuning this value correctly is critical for building effective k-means models.The use of the traditional elbow method to help identify this value has a long-standing literature.However,when using this method with certain datasets,smooth curves may appear,making it challenging to identify the k-value due to its unclear nature.On the other hand,various internal validation indexes,which are proposed as a solution to this issue,may be inconsistent.Although various techniques for solving smooth elbow challenges exist,k-hyperparameter tuning in high-dimensional spaces still remains intractable and an open research issue.In this paper,we have first reviewed the existing techniques for solving smooth elbow challenges.The identified research gaps are then utilized in the development of the new technique.The new technique,referred to as the ensemble-based technique of a self-adapting autoencoder and internal validation indexes,is then validated in high-dimensional space clustering.The optimal k-value,tuned by this technique using a voting scheme,is a trade-off between the number of clusters visualized in the autoencoder’s latent space,k-value from the ensemble internal validation index score and one that generates a value of 0 or close to 0 on the derivative f″′(k)(1+f′(k)^(2))−3 f″(k)^(2)f″((k)2f′(k),at the elbow.Experimental results based on the Cochran’s Q test,ANOVA,and McNemar’s score indicate a relatively good performance of the newly developed technique in k-hyperparameter tuning.展开更多
Various index structures have recently been proposed to facilitate high-dimensional KNN queries, among which the techniques of approximate vector presentation and one-dimensional (1D) transformation can break the curs...Various index structures have recently been proposed to facilitate high-dimensional KNN queries, among which the techniques of approximate vector presentation and one-dimensional (1D) transformation can break the curse of dimensionality. Based on the two techniques above, a novel high-dimensional index is proposed, called Bit-code and Distance based index (BD). BD is based on a special partitioning strategy which is optimized for high-dimensional data. By the definitions of bit code and transformation function, a high-dimensional vector can be first approximately represented and then transformed into a 1D vector, the key managed by a B+-tree. A new KNN search algorithm is also proposed that exploits the bit code and distance to prune the search space more effectively. Results of extensive experiments using both synthetic and real data demonstrated that BD out- performs the existing index structures for KNN search in high-dimensional spaces.展开更多
When atoms are accelerated in the vacuum,entanglement among atoms will degrade compared with the initial situation before the acceleration.In this study,we propose a novel and interesting view that the lost entangleme...When atoms are accelerated in the vacuum,entanglement among atoms will degrade compared with the initial situation before the acceleration.In this study,we propose a novel and interesting view that the lost entanglement can be recovered completely when the high-dimensional spacetime is exploited,in the case that the acceleration is not too large,since the entanglement loss rate caused by the large acceleration is faster than the recovery process.We also calculate the entanglement change caused by the anti-Unruh effect and found that the lost entanglement could just be recovered part by the anti-Unruh effect,and the anti-Unruh effect could only appear for a finite range of acceleration when the interaction time scale is approximately shorter than the reciprocal of the energy gap in two dimensional spacetime.The limit case of zero acceleration is also investigated,which gives an analytical interpretation for the increase or recovery of entanglement.展开更多
This paper studies the target controllability of multilayer complex networked systems,in which the nodes are highdimensional linear time invariant(LTI)dynamical systems,and the network topology is directed and weighte...This paper studies the target controllability of multilayer complex networked systems,in which the nodes are highdimensional linear time invariant(LTI)dynamical systems,and the network topology is directed and weighted.The influence of inter-layer couplings on the target controllability of multi-layer networks is discussed.It is found that even if there exists a layer which is not target controllable,the entire multi-layer network can still be target controllable due to the inter-layer couplings.For the multi-layer networks with general structure,a necessary and sufficient condition for target controllability is given by establishing the relationship between uncontrollable subspace and output matrix.By the derived condition,it can be found that the system may be target controllable even if it is not state controllable.On this basis,two corollaries are derived,which clarify the relationship between target controllability,state controllability and output controllability.For the multi-layer networks where the inter-layer couplings are directed chains and directed stars,sufficient conditions for target controllability of networked systems are given,respectively.These conditions are easier to verify than the classic criterion.展开更多
Speech emotion recognition(SER)uses acoustic analysis to find features for emotion recognition and examines variations in voice that are caused by emotions.The number of features acquired with acoustic analysis is ext...Speech emotion recognition(SER)uses acoustic analysis to find features for emotion recognition and examines variations in voice that are caused by emotions.The number of features acquired with acoustic analysis is extremely high,so we introduce a hybrid filter-wrapper feature selection algorithm based on an improved equilibrium optimizer for constructing an emotion recognition system.The proposed algorithm implements multi-objective emotion recognition with the minimum number of selected features and maximum accuracy.First,we use the information gain and Fisher Score to sort the features extracted from signals.Then,we employ a multi-objective ranking method to evaluate these features and assign different importance to them.Features with high rankings have a large probability of being selected.Finally,we propose a repair strategy to address the problem of duplicate solutions in multi-objective feature selection,which can improve the diversity of solutions and avoid falling into local traps.Using random forest and K-nearest neighbor classifiers,four English speech emotion datasets are employed to test the proposed algorithm(MBEO)as well as other multi-objective emotion identification techniques.The results illustrate that it performs well in inverted generational distance,hypervolume,Pareto solutions,and execution time,and MBEO is appropriate for high-dimensional English SER.展开更多
The objective of reliability-based design optimization(RBDO)is to minimize the optimization objective while satisfying the corresponding reliability requirements.However,the nested loop characteristic reduces the effi...The objective of reliability-based design optimization(RBDO)is to minimize the optimization objective while satisfying the corresponding reliability requirements.However,the nested loop characteristic reduces the efficiency of RBDO algorithm,which hinders their application to high-dimensional engineering problems.To address these issues,this paper proposes an efficient decoupled RBDO method combining high dimensional model representation(HDMR)and the weight-point estimation method(WPEM).First,we decouple the RBDO model using HDMR and WPEM.Second,Lagrange interpolation is used to approximate a univariate function.Finally,based on the results of the first two steps,the original nested loop reliability optimization model is completely transformed into a deterministic design optimization model that can be solved by a series of mature constrained optimization methods without any additional calculations.Two numerical examples of a planar 10-bar structure and an aviation hydraulic piping system with 28 design variables are analyzed to illustrate the performance and practicability of the proposed method.展开更多
The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based o...The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based on complete data. This paper studies the optimal estimation of high-dimensional covariance matrices based on missing and noisy sample under the norm. First, the model with sub-Gaussian additive noise is presented. The generalized sample covariance is then modified to define a hard thresholding estimator , and the minimax upper bound is derived. After that, the minimax lower bound is derived, and it is concluded that the estimator presented in this article is rate-optimal. Finally, numerical simulation analysis is performed. The result shows that for missing samples with sub-Gaussian noise, if the true covariance matrix is sparse, the hard thresholding estimator outperforms the traditional estimate method.展开更多
目的:对比三维多回波恢复梯度回波(3D MERGE)、三维可变反转角快速自旋回波(3D SPACE STIR)序列在腰椎间盘突出症(LDH)检查中的应用效果。方法:选择2020年1月~2022年11月收治的135例LDH患者,回顾性分析患者临床和磁共振成像(MRI)资料,...目的:对比三维多回波恢复梯度回波(3D MERGE)、三维可变反转角快速自旋回波(3D SPACE STIR)序列在腰椎间盘突出症(LDH)检查中的应用效果。方法:选择2020年1月~2022年11月收治的135例LDH患者,回顾性分析患者临床和磁共振成像(MRI)资料,所有患者均接受常规MRI扫描及3D MERGE、3D SPACE STIR序列扫描,对比3D MERGE、3D SPACE STIR序列测量神经根直径的一致性,评价两种序列的图像质量参数[信噪比(SNR)、对比噪声比(CNR)]、图像清晰度评分。结果:3D MERGE和3D SPACE STIR序列测量的L3~S1神经根直径比较差异无统计学意义(P>0.05),且两组序列测量的L3、L4、L5和S1直径均显示出较高相关性(r=0.957,0.986,0.975,0.972,P<0.05);3D MERGE序列的SNR及CNR均高于3D SPACE STIR序列,神经根显示分级、图像清晰度评分优于3D SPACE STIR序列,差异有统计学意义(P<0.05)。结论:3D MERGE、3D SPACE STIR序列在LDH神经根直径测量中具有极高一致性,3D MERGE序列较3D SPACE STIR序列能够更清晰显示神经跟的解剖形态,图像质量更好。展开更多
In this paper,an Observation Points Classifier Ensemble(OPCE)algorithm is proposed to deal with High-Dimensional Imbalanced Classification(HDIC)problems based on data processed using the Multi-Dimensional Scaling(MDS)...In this paper,an Observation Points Classifier Ensemble(OPCE)algorithm is proposed to deal with High-Dimensional Imbalanced Classification(HDIC)problems based on data processed using the Multi-Dimensional Scaling(MDS)feature extraction technique.First,dimensionality of the original imbalanced data is reduced using MDS so that distances between any two different samples are preserved as well as possible.Second,a novel OPCE algorithm is applied to classify imbalanced samples by placing optimised observation points in a low-dimensional data space.Third,optimization of the observation point mappings is carried out to obtain a reliable assessment of the unknown samples.Exhaustive experiments have been conducted to evaluate the feasibility,rationality,and effectiveness of the proposed OPCE algorithm using seven benchmark HDIC data sets.Experimental results show that(1)the OPCE algorithm can be trained faster on low-dimensional imbalanced data than on high-dimensional data;(2)the OPCE algorithm can correctly identify samples as the number of optimised observation points is increased;and(3)statistical analysis reveals that OPCE yields better HDIC performances on the selected data sets in comparison with eight other HDIC algorithms.This demonstrates that OPCE is a viable algorithm to deal with HDIC problems.展开更多
螺旋藻(Spirulina)藻蓝蛋白具有独特的理化特性及生理功能,是药物、食品和化妆品的天然原料,具有较大的开发潜力。为探讨螺旋藻藻蓝蛋白的研究现状与发展前景,对中国知网和Web of Science数据库中1990—2023年发表的文献进行检索并筛选...螺旋藻(Spirulina)藻蓝蛋白具有独特的理化特性及生理功能,是药物、食品和化妆品的天然原料,具有较大的开发潜力。为探讨螺旋藻藻蓝蛋白的研究现状与发展前景,对中国知网和Web of Science数据库中1990—2023年发表的文献进行检索并筛选,使用Cite Space软件对文章发文量、研究团队及研究热点进行图谱分析。综合分析可知,国内年发文量偏少,呈平稳趋势;国外年发文量持续上升,尤其近几年发文量迅速增长,且发文量超过了100篇;国外研究热点集中于藻蓝蛋白在食品、医药行业的应用方面,而国内研究热点集中在提取纯化、稳定性、功能活性的研究与应用,下一步应结合研究现状开发适合规模化生产的提取纯化工艺,进一步加强藻蓝蛋白研究的广度与深度;国内外研究群体主要是高校的相关生物技术学院或研究机构等,总体来讲,学者间存在较为密切的合作,但研究机构间尚未形成紧密的合作关系,在地域上比较分散,各大高校和研究机构应突破地区或机构间的各种限制,促进该研究领域的深度融合和快速发展,深入挖掘藻蓝蛋白在各个领域的潜在应用。展开更多
目的梳理国内多发伤急救相关研究文献,分析研究现状、热点和趋势,为我国多发伤急救研究提供借鉴和指导。方法检索中国知网数据库中2011—2021年关于多发伤急救的相关文献,使用Cite Space 6.1.R3可视化软件对该领域的年发文量、机构、作...目的梳理国内多发伤急救相关研究文献,分析研究现状、热点和趋势,为我国多发伤急救研究提供借鉴和指导。方法检索中国知网数据库中2011—2021年关于多发伤急救的相关文献,使用Cite Space 6.1.R3可视化软件对该领域的年发文量、机构、作者、关键词进行分析。结果最终纳入多发伤急救研究文献2519篇,整体发文数量较平稳,以2016年为小高峰;发文量最高的机构是华中科技大学附属同济医院。多发伤急救研究热点包括院前急救、并发症护理、风险因素分析和预后效果评估,研究前沿包括不同多发伤人群的诊断、治疗、手术和护理体会等方面。结论本文通过可视化分析国内多发伤急救研究的热点及趋势,指明了多发伤目前研究存在的问题和未来研究发展的方向,为进一步完善多发伤急救卫生服务和管理体系提供指导。展开更多
The performance of conventional similarity measurement methods is affected seriously by the curse of dimensionality of high-dimensional data.The reason is that data difference between sparse and noisy dimensionalities...The performance of conventional similarity measurement methods is affected seriously by the curse of dimensionality of high-dimensional data.The reason is that data difference between sparse and noisy dimensionalities occupies a large proportion of the similarity,leading to the dissimilarities between any results.A similarity measurement method of high-dimensional data based on normalized net lattice subspace is proposed.The data range of each dimension is divided into several intervals,and the components in different dimensions are mapped onto the corresponding interval.Only the component in the same or adjacent interval is used to calculate the similarity.To validate this method,three data types are used,and seven common similarity measurement methods are compared.The experimental result indicates that the relative difference of the method is increasing with the dimensionality and is approximately two or three orders of magnitude higher than the conventional method.In addition,the similarity range of this method in different dimensions is [0,1],which is fit for similarity analysis after dimensionality reduction.展开更多
As a crucial data preprocessing method in data mining,feature selection(FS)can be regarded as a bi-objective optimization problem that aims to maximize classification accuracy and minimize the number of selected featu...As a crucial data preprocessing method in data mining,feature selection(FS)can be regarded as a bi-objective optimization problem that aims to maximize classification accuracy and minimize the number of selected features.Evolutionary computing(EC)is promising for FS owing to its powerful search capability.However,in traditional EC-based methods,feature subsets are represented via a length-fixed individual encoding.It is ineffective for high-dimensional data,because it results in a huge search space and prohibitive training time.This work proposes a length-adaptive non-dominated sorting genetic algorithm(LA-NSGA)with a length-variable individual encoding and a length-adaptive evolution mechanism for bi-objective highdimensional FS.In LA-NSGA,an initialization method based on correlation and redundancy is devised to initialize individuals of diverse lengths,and a Pareto dominance-based length change operator is introduced to guide individuals to explore in promising search space adaptively.Moreover,a dominance-based local search method is employed for further improvement.The experimental results based on 12 high-dimensional gene datasets show that the Pareto front of feature subsets produced by LA-NSGA is superior to those of existing algorithms.展开更多
高质量教师是高质量教育发展的中坚力量。教师信念作为教师专业素养构成的关键要素,对促进教师专业发展、提升教师质量具有重要作用与影响。为借鉴国际体育教师信念研究的成果与经验,促进国内对体育教师信念的研究,研究利用CiteSpace软...高质量教师是高质量教育发展的中坚力量。教师信念作为教师专业素养构成的关键要素,对促进教师专业发展、提升教师质量具有重要作用与影响。为借鉴国际体育教师信念研究的成果与经验,促进国内对体育教师信念的研究,研究利用CiteSpace软件,对Web of Science核心合集数据库中1960—2022年的英文文献进行可视化研究。发现:体育教师信念研究高潮出现于2021年,载文数量最多的期刊是Journal of Teaching in Physical Education;研究中心度最高的国家是美国,核心圈层的代表学者是Richards KAR、Kulinna PH和Curtner-smith MD等人;研究热点趋势集中于体力活动促进、职业社会化、批判性教学法、职前体育教师、专业发展等方面。启示:国内未来研究应重点关注体育教师信念对课程改革的影响以及促进职前、职后阶段体育教师信念的发展。展开更多
文摘k-means is a popular clustering algorithm because of its simplicity and scalability to handle large datasets.However,one of its setbacks is the challenge of identifying the correct k-hyperparameter value.Tuning this value correctly is critical for building effective k-means models.The use of the traditional elbow method to help identify this value has a long-standing literature.However,when using this method with certain datasets,smooth curves may appear,making it challenging to identify the k-value due to its unclear nature.On the other hand,various internal validation indexes,which are proposed as a solution to this issue,may be inconsistent.Although various techniques for solving smooth elbow challenges exist,k-hyperparameter tuning in high-dimensional spaces still remains intractable and an open research issue.In this paper,we have first reviewed the existing techniques for solving smooth elbow challenges.The identified research gaps are then utilized in the development of the new technique.The new technique,referred to as the ensemble-based technique of a self-adapting autoencoder and internal validation indexes,is then validated in high-dimensional space clustering.The optimal k-value,tuned by this technique using a voting scheme,is a trade-off between the number of clusters visualized in the autoencoder’s latent space,k-value from the ensemble internal validation index score and one that generates a value of 0 or close to 0 on the derivative f″′(k)(1+f′(k)^(2))−3 f″(k)^(2)f″((k)2f′(k),at the elbow.Experimental results based on the Cochran’s Q test,ANOVA,and McNemar’s score indicate a relatively good performance of the newly developed technique in k-hyperparameter tuning.
基金Project (No. [2005]555) supported by the Hi-Tech Research and De-velopment Program (863) of China
文摘Various index structures have recently been proposed to facilitate high-dimensional KNN queries, among which the techniques of approximate vector presentation and one-dimensional (1D) transformation can break the curse of dimensionality. Based on the two techniques above, a novel high-dimensional index is proposed, called Bit-code and Distance based index (BD). BD is based on a special partitioning strategy which is optimized for high-dimensional data. By the definitions of bit code and transformation function, a high-dimensional vector can be first approximately represented and then transformed into a 1D vector, the key managed by a B+-tree. A new KNN search algorithm is also proposed that exploits the bit code and distance to prune the search space more effectively. Results of extensive experiments using both synthetic and real data demonstrated that BD out- performs the existing index structures for KNN search in high-dimensional spaces.
基金supported by the National Natural Science Foundation of China(Grant Nos.12375057,11947301,and 12047502)the Fundamental Research Funds for the Central UniversitiesChina University of Geosciences(Wuhan)(Grant No.G1323523064)。
文摘When atoms are accelerated in the vacuum,entanglement among atoms will degrade compared with the initial situation before the acceleration.In this study,we propose a novel and interesting view that the lost entanglement can be recovered completely when the high-dimensional spacetime is exploited,in the case that the acceleration is not too large,since the entanglement loss rate caused by the large acceleration is faster than the recovery process.We also calculate the entanglement change caused by the anti-Unruh effect and found that the lost entanglement could just be recovered part by the anti-Unruh effect,and the anti-Unruh effect could only appear for a finite range of acceleration when the interaction time scale is approximately shorter than the reciprocal of the energy gap in two dimensional spacetime.The limit case of zero acceleration is also investigated,which gives an analytical interpretation for the increase or recovery of entanglement.
基金supported by the National Natural Science Foundation of China (U1808205)Hebei Natural Science Foundation (F2000501005)。
文摘This paper studies the target controllability of multilayer complex networked systems,in which the nodes are highdimensional linear time invariant(LTI)dynamical systems,and the network topology is directed and weighted.The influence of inter-layer couplings on the target controllability of multi-layer networks is discussed.It is found that even if there exists a layer which is not target controllable,the entire multi-layer network can still be target controllable due to the inter-layer couplings.For the multi-layer networks with general structure,a necessary and sufficient condition for target controllability is given by establishing the relationship between uncontrollable subspace and output matrix.By the derived condition,it can be found that the system may be target controllable even if it is not state controllable.On this basis,two corollaries are derived,which clarify the relationship between target controllability,state controllability and output controllability.For the multi-layer networks where the inter-layer couplings are directed chains and directed stars,sufficient conditions for target controllability of networked systems are given,respectively.These conditions are easier to verify than the classic criterion.
文摘Speech emotion recognition(SER)uses acoustic analysis to find features for emotion recognition and examines variations in voice that are caused by emotions.The number of features acquired with acoustic analysis is extremely high,so we introduce a hybrid filter-wrapper feature selection algorithm based on an improved equilibrium optimizer for constructing an emotion recognition system.The proposed algorithm implements multi-objective emotion recognition with the minimum number of selected features and maximum accuracy.First,we use the information gain and Fisher Score to sort the features extracted from signals.Then,we employ a multi-objective ranking method to evaluate these features and assign different importance to them.Features with high rankings have a large probability of being selected.Finally,we propose a repair strategy to address the problem of duplicate solutions in multi-objective feature selection,which can improve the diversity of solutions and avoid falling into local traps.Using random forest and K-nearest neighbor classifiers,four English speech emotion datasets are employed to test the proposed algorithm(MBEO)as well as other multi-objective emotion identification techniques.The results illustrate that it performs well in inverted generational distance,hypervolume,Pareto solutions,and execution time,and MBEO is appropriate for high-dimensional English SER.
基金supported by the Innovation Fund Project of the Gansu Education Department(Grant No.2021B-099).
文摘The objective of reliability-based design optimization(RBDO)is to minimize the optimization objective while satisfying the corresponding reliability requirements.However,the nested loop characteristic reduces the efficiency of RBDO algorithm,which hinders their application to high-dimensional engineering problems.To address these issues,this paper proposes an efficient decoupled RBDO method combining high dimensional model representation(HDMR)and the weight-point estimation method(WPEM).First,we decouple the RBDO model using HDMR and WPEM.Second,Lagrange interpolation is used to approximate a univariate function.Finally,based on the results of the first two steps,the original nested loop reliability optimization model is completely transformed into a deterministic design optimization model that can be solved by a series of mature constrained optimization methods without any additional calculations.Two numerical examples of a planar 10-bar structure and an aviation hydraulic piping system with 28 design variables are analyzed to illustrate the performance and practicability of the proposed method.
文摘The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based on complete data. This paper studies the optimal estimation of high-dimensional covariance matrices based on missing and noisy sample under the norm. First, the model with sub-Gaussian additive noise is presented. The generalized sample covariance is then modified to define a hard thresholding estimator , and the minimax upper bound is derived. After that, the minimax lower bound is derived, and it is concluded that the estimator presented in this article is rate-optimal. Finally, numerical simulation analysis is performed. The result shows that for missing samples with sub-Gaussian noise, if the true covariance matrix is sparse, the hard thresholding estimator outperforms the traditional estimate method.
文摘目的:对比三维多回波恢复梯度回波(3D MERGE)、三维可变反转角快速自旋回波(3D SPACE STIR)序列在腰椎间盘突出症(LDH)检查中的应用效果。方法:选择2020年1月~2022年11月收治的135例LDH患者,回顾性分析患者临床和磁共振成像(MRI)资料,所有患者均接受常规MRI扫描及3D MERGE、3D SPACE STIR序列扫描,对比3D MERGE、3D SPACE STIR序列测量神经根直径的一致性,评价两种序列的图像质量参数[信噪比(SNR)、对比噪声比(CNR)]、图像清晰度评分。结果:3D MERGE和3D SPACE STIR序列测量的L3~S1神经根直径比较差异无统计学意义(P>0.05),且两组序列测量的L3、L4、L5和S1直径均显示出较高相关性(r=0.957,0.986,0.975,0.972,P<0.05);3D MERGE序列的SNR及CNR均高于3D SPACE STIR序列,神经根显示分级、图像清晰度评分优于3D SPACE STIR序列,差异有统计学意义(P<0.05)。结论:3D MERGE、3D SPACE STIR序列在LDH神经根直径测量中具有极高一致性,3D MERGE序列较3D SPACE STIR序列能够更清晰显示神经跟的解剖形态,图像质量更好。
基金National Natural Science Foundation of China,Grant/Award Number:61972261Basic Research Foundations of Shenzhen,Grant/Award Numbers:JCYJ20210324093609026,JCYJ20200813091134001。
文摘In this paper,an Observation Points Classifier Ensemble(OPCE)algorithm is proposed to deal with High-Dimensional Imbalanced Classification(HDIC)problems based on data processed using the Multi-Dimensional Scaling(MDS)feature extraction technique.First,dimensionality of the original imbalanced data is reduced using MDS so that distances between any two different samples are preserved as well as possible.Second,a novel OPCE algorithm is applied to classify imbalanced samples by placing optimised observation points in a low-dimensional data space.Third,optimization of the observation point mappings is carried out to obtain a reliable assessment of the unknown samples.Exhaustive experiments have been conducted to evaluate the feasibility,rationality,and effectiveness of the proposed OPCE algorithm using seven benchmark HDIC data sets.Experimental results show that(1)the OPCE algorithm can be trained faster on low-dimensional imbalanced data than on high-dimensional data;(2)the OPCE algorithm can correctly identify samples as the number of optimised observation points is increased;and(3)statistical analysis reveals that OPCE yields better HDIC performances on the selected data sets in comparison with eight other HDIC algorithms.This demonstrates that OPCE is a viable algorithm to deal with HDIC problems.
文摘螺旋藻(Spirulina)藻蓝蛋白具有独特的理化特性及生理功能,是药物、食品和化妆品的天然原料,具有较大的开发潜力。为探讨螺旋藻藻蓝蛋白的研究现状与发展前景,对中国知网和Web of Science数据库中1990—2023年发表的文献进行检索并筛选,使用Cite Space软件对文章发文量、研究团队及研究热点进行图谱分析。综合分析可知,国内年发文量偏少,呈平稳趋势;国外年发文量持续上升,尤其近几年发文量迅速增长,且发文量超过了100篇;国外研究热点集中于藻蓝蛋白在食品、医药行业的应用方面,而国内研究热点集中在提取纯化、稳定性、功能活性的研究与应用,下一步应结合研究现状开发适合规模化生产的提取纯化工艺,进一步加强藻蓝蛋白研究的广度与深度;国内外研究群体主要是高校的相关生物技术学院或研究机构等,总体来讲,学者间存在较为密切的合作,但研究机构间尚未形成紧密的合作关系,在地域上比较分散,各大高校和研究机构应突破地区或机构间的各种限制,促进该研究领域的深度融合和快速发展,深入挖掘藻蓝蛋白在各个领域的潜在应用。
文摘目的梳理国内多发伤急救相关研究文献,分析研究现状、热点和趋势,为我国多发伤急救研究提供借鉴和指导。方法检索中国知网数据库中2011—2021年关于多发伤急救的相关文献,使用Cite Space 6.1.R3可视化软件对该领域的年发文量、机构、作者、关键词进行分析。结果最终纳入多发伤急救研究文献2519篇,整体发文数量较平稳,以2016年为小高峰;发文量最高的机构是华中科技大学附属同济医院。多发伤急救研究热点包括院前急救、并发症护理、风险因素分析和预后效果评估,研究前沿包括不同多发伤人群的诊断、治疗、手术和护理体会等方面。结论本文通过可视化分析国内多发伤急救研究的热点及趋势,指明了多发伤目前研究存在的问题和未来研究发展的方向,为进一步完善多发伤急救卫生服务和管理体系提供指导。
基金Supported by the National Natural Science Foundation of China(No.61502475)the Importation and Development of High-Caliber Talents Project of the Beijing Municipal Institutions(No.CIT&TCD201504039)
文摘The performance of conventional similarity measurement methods is affected seriously by the curse of dimensionality of high-dimensional data.The reason is that data difference between sparse and noisy dimensionalities occupies a large proportion of the similarity,leading to the dissimilarities between any results.A similarity measurement method of high-dimensional data based on normalized net lattice subspace is proposed.The data range of each dimension is divided into several intervals,and the components in different dimensions are mapped onto the corresponding interval.Only the component in the same or adjacent interval is used to calculate the similarity.To validate this method,three data types are used,and seven common similarity measurement methods are compared.The experimental result indicates that the relative difference of the method is increasing with the dimensionality and is approximately two or three orders of magnitude higher than the conventional method.In addition,the similarity range of this method in different dimensions is [0,1],which is fit for similarity analysis after dimensionality reduction.
基金supported in part by the National Natural Science Foundation of China(62172065,62072060)。
文摘As a crucial data preprocessing method in data mining,feature selection(FS)can be regarded as a bi-objective optimization problem that aims to maximize classification accuracy and minimize the number of selected features.Evolutionary computing(EC)is promising for FS owing to its powerful search capability.However,in traditional EC-based methods,feature subsets are represented via a length-fixed individual encoding.It is ineffective for high-dimensional data,because it results in a huge search space and prohibitive training time.This work proposes a length-adaptive non-dominated sorting genetic algorithm(LA-NSGA)with a length-variable individual encoding and a length-adaptive evolution mechanism for bi-objective highdimensional FS.In LA-NSGA,an initialization method based on correlation and redundancy is devised to initialize individuals of diverse lengths,and a Pareto dominance-based length change operator is introduced to guide individuals to explore in promising search space adaptively.Moreover,a dominance-based local search method is employed for further improvement.The experimental results based on 12 high-dimensional gene datasets show that the Pareto front of feature subsets produced by LA-NSGA is superior to those of existing algorithms.
文摘高质量教师是高质量教育发展的中坚力量。教师信念作为教师专业素养构成的关键要素,对促进教师专业发展、提升教师质量具有重要作用与影响。为借鉴国际体育教师信念研究的成果与经验,促进国内对体育教师信念的研究,研究利用CiteSpace软件,对Web of Science核心合集数据库中1960—2022年的英文文献进行可视化研究。发现:体育教师信念研究高潮出现于2021年,载文数量最多的期刊是Journal of Teaching in Physical Education;研究中心度最高的国家是美国,核心圈层的代表学者是Richards KAR、Kulinna PH和Curtner-smith MD等人;研究热点趋势集中于体力活动促进、职业社会化、批判性教学法、职前体育教师、专业发展等方面。启示:国内未来研究应重点关注体育教师信念对课程改革的影响以及促进职前、职后阶段体育教师信念的发展。