期刊文献+
共找到30,176篇文章
< 1 2 250 >
每页显示 20 50 100
ARHCS (Automatic Rainfall Half-Life Cluster System): A Landslides Early Warning System (LEWS) Using Cluster Analysis and Automatic Threshold Definition
1
作者 Cassiano Antonio Bortolozo Luana Albertani Pampuch +8 位作者 Marcio Roberto Magalhães De Andrade Daniel Metodiev Adenilson Roberto Carvalho Tatiana Sussel Gonçalves Mendes Tristan Pryer Harideva Marturano Egas Rodolfo Moreda Mendes Isadora Araújo Sousa Jenny Power 《International Journal of Geosciences》 CAS 2024年第1期54-69,共16页
A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in vari... A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in various ways, but most often they are based on previous landslide data. This approach introduces several limitations. For instance, there is a requirement for the location to have been previously monitored in some way to have this type of information recorded. Another significant limitation is the need for information regarding the location and timing of incidents. Despite the current ease of obtaining location information (GPS, drone images, etc.), the timing of the event remains challenging to ascertain for a considerable portion of landslide data. Concerning rainfall monitoring, there are multiple ways to consider it, for instance, examining accumulations over various intervals (1 h, 6 h, 24 h, 72 h), as well as in the calculation of effective rainfall, which represents the precipitation that actually infiltrates the soil. However, in the vast majority of cases, both the thresholds and the rain monitoring approach are defined manually and subjectively, relying on the operators’ experience. This makes the process labor-intensive and time-consuming, hindering the establishment of a truly standardized and rapidly scalable methodology on a large scale. In this work, we propose a Landslides Early Warning System (LEWS) based on the concept of rainfall half-life and the determination of thresholds using Cluster Analysis and data inversion. The system is designed to be applied in extensive monitoring networks, such as the one utilized by Cemaden, Brazil’s National Center for Monitoring and Early Warning of Natural Disasters. 展开更多
关键词 Landslides Early Warning System (LEWS) cluster analysis LANDSLIDES Brazil
下载PDF
Comparative Analysis of Differences among Northern,Jiangnan,and Lingnan Classical Private Gardens Using Principal Component Cluster Method
2
作者 Lijuan Sun Hui Wang 《Journal of Architectural Research and Development》 2024年第5期20-29,共10页
This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among ... This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among classical private gardens in the Northern,Jiangnan,and Lingnan regions.The study examines nine classical private gardens from Northern China,Jiangnan,and Lingnan by utilizing the advanced tool of principal component cluster analysis.Based on literature analysis and field research,273 variables were selected for principal component analysis,from which four components with higher contribution rates were chosen for further study.Subsequently,we employed clustering analysis techniques to compare the differences among the three types of gardens.The results reveal that the first principal component effectively highlights the differences between Jiangnan and Lingnan private gardens.The second principal component serves as the key to defining the types of Northern private gardens and distinguishing them from the other two types,and the third principal component indicates that Lingnan private gardens can be categorized into two distinct types as well. 展开更多
关键词 Classical gardens Private gardens DIFFERENCES Principal component analysis cluster analysis
下载PDF
Comprehensive K-Means Clustering
3
作者 Ethan Xiao 《Journal of Computer and Communications》 2024年第3期146-159,共14页
The k-means algorithm is a popular data clustering technique due to its speed and simplicity. However, it is susceptible to issues such as sensitivity to the chosen seeds, and inaccurate clusters due to poor initial s... The k-means algorithm is a popular data clustering technique due to its speed and simplicity. However, it is susceptible to issues such as sensitivity to the chosen seeds, and inaccurate clusters due to poor initial seeds, particularly in complex datasets or datasets with non-spherical clusters. In this paper, a Comprehensive K-Means Clustering algorithm is presented, in which multiple trials of k-means are performed on a given dataset. The clustering results from each trial are transformed into a five-dimensional data point, containing the scope values of the x and y coordinates of the clusters along with the number of points within that cluster. A graph is then generated displaying the configuration of these points using Principal Component Analysis (PCA), from which we can observe and determine the common clustering patterns in the dataset. The robustness and strength of these patterns are then examined by observing the variance of the results of each trial, wherein a different subset of the data keeping a certain percentage of original data points is clustered. By aggregating information from multiple trials, we can distinguish clusters that consistently emerge across different runs from those that are more sensitive or unlikely, hence deriving more reliable conclusions about the underlying structure of complex datasets. Our experiments show that our algorithm is able to find the most common associations between different dimensions of data over multiple trials, often more accurately than other algorithms, as well as measure stability of these clusters, an ability that other k-means algorithms lack. 展开更多
关键词 k-means clustering
下载PDF
Optimization of constitutive parameters of foundation soils k-means clustering analysis 被引量:7
4
作者 Muge Elif Orakoglu Cevdet Emin Ekinci 《Research in Cold and Arid Regions》 CSCD 2013年第5期626-636,共11页
The goal of this study was to optimize the constitutive parameters of foundation soils using a k-means algorithm with clustering analysis. A database was collected from unconfined compression tests, Proctor tests and ... The goal of this study was to optimize the constitutive parameters of foundation soils using a k-means algorithm with clustering analysis. A database was collected from unconfined compression tests, Proctor tests and grain distribution tests of soils taken from three different types of foundation pits: raft foundations, partial raft foundations and strip foundations. k-means algorithm with clustering analysis was applied to determine the most appropriate foundation type given the un- confined compression strengths and other parameters of the different soils. 展开更多
关键词 foundation soil regression model k-means clustering analysis
下载PDF
基于改进K-means聚类的轨道交通基础设施分布式光伏发电典型场景生成及出力特性分析
5
作者 陈凯 雷琪 李豆萌 《电气工程学报》 CSCD 北大核心 2024年第2期364-372,共9页
受限于自然条件,光伏出力具有很强的随机性。为准确评估轨道交通基础设施分布式光伏发电的光伏出力特性,提出一种基于改进K-means聚类算法的轨道交通基础设施分布式光伏发电典型场景生成方法,并基于此进行光伏出力特性分析。首先,基于... 受限于自然条件,光伏出力具有很强的随机性。为准确评估轨道交通基础设施分布式光伏发电的光伏出力特性,提出一种基于改进K-means聚类算法的轨道交通基础设施分布式光伏发电典型场景生成方法,并基于此进行光伏出力特性分析。首先,基于分布式光伏发电设施以及气象数据,利用PVsyst软件模拟光伏发电出力数据。然后,针对基本K-means聚类算法聚类参数和初始聚类中心盲目性高的问题,结合聚类有效性指标(Density based index,DBI)和层次聚类对其进行改进并利用改进K-means聚类算法生成光伏典型日出力场景。最后,基于华中地区某地轨道交通基础设施分布式光伏系统对所提方法的有效性和优越性进行验证,并通过定性和定量分析各典型场景的出力特性揭示轨道交通基础设施分布式光伏出力的规律和特点。 展开更多
关键词 分布式光伏出力 改进k-means聚类算法 典型出力场景 出力特性分析
下载PDF
基于改进K-means聚类和皮尔逊相关系数户变关系异常诊断 被引量:4
6
作者 周纲 黄瑞 +3 位作者 刘度度 张芝敏 胡军华 高云鹏 《电测与仪表》 北大核心 2024年第3期76-82,152,共8页
用电信息采集系统易出现台区户变关系错误问题,传统诊断技术主要针对少用户台区出现异常用户情况,但对于多达数百用户台区,存在多相邻台区异常用户特征提取难题。文中首先通过主成分分析对GIS系统获取台区总表和用户电表电压数据实现降... 用电信息采集系统易出现台区户变关系错误问题,传统诊断技术主要针对少用户台区出现异常用户情况,但对于多达数百用户台区,存在多相邻台区异常用户特征提取难题。文中首先通过主成分分析对GIS系统获取台区总表和用户电表电压数据实现降维,建立改进K-means聚类提取电压数据特征,提出改进皮尔逊相关系数算法分析待检测用户,据此建立基于改进K-means聚类和改进皮尔逊相关系数的户变关系异常诊断方法,实现多异常用户所属正确台区诊断。实际算例分析结果表明,文中提出算法在识别同一台区一个及多个异常用户、不同台区多个异常用户情况下均能有效实现异常用户的准确检测与分析,相比传统检测方法,实现简单且准确性更高。 展开更多
关键词 户变关系 GIS系统 主成分分析 改进k-means聚类
下载PDF
A State of Art Analysis of Telecommunication Data by k-Means and k-Medoids Clustering Algorithms
7
作者 T. Velmurugan 《Journal of Computer and Communications》 2018年第1期190-202,共13页
Cluster analysis is one of the major data analysis methods widely used for many practical applications in emerging areas of data mining. A good clustering method will produce high quality clusters with high intra-clus... Cluster analysis is one of the major data analysis methods widely used for many practical applications in emerging areas of data mining. A good clustering method will produce high quality clusters with high intra-cluster similarity and low inter-cluster similarity. Clustering techniques are applied in different domains to predict future trends of available data and its uses for the real world. This research work is carried out to find the performance of two of the most delegated, partition based clustering algorithms namely k-Means and k-Medoids. A state of art analysis of these two algorithms is implemented and performance is analyzed based on their clustering result quality by means of its execution time and other components. Telecommunication data is the source data for this analysis. The connection oriented broadband data is given as input to find the clustering quality of the algorithms. Distance between the server locations and their connection is considered for clustering. Execution time for each algorithm is analyzed and the results are compared with one another. Results found in comparison study are satisfactory for the chosen application. 展开更多
关键词 k-means ALGORITHM k-Medoids ALGORITHM DATA clusterING Time COMPLEXITY TELECOMMUNICATION DATA
下载PDF
Campus Economic Analysis Based on K-Means Clustering and Hotspot Mining
8
作者 Xiuzhang Yang Shuai Wu +2 位作者 Huan Xia Yuanbo Li Xin Li 《Review of Educational Theory》 2020年第2期42-50,共9页
With the advent of the era of big data and the development and construction of smart campuses,the campus is gradually moving towards digitalization,networking and informationization.The campus card is an important par... With the advent of the era of big data and the development and construction of smart campuses,the campus is gradually moving towards digitalization,networking and informationization.The campus card is an important part of the construction of a smart campus,and the massive data it generates can indirectly reflect the living conditions of students at school.In the face of the campus card,how to quickly and accurately obtain the information required by users from the massive data sets has become an urgent problem that needs to be solved.This paper proposes a data mining algorithm based on K-Means clustering and time series.It analyzes the consumption data of a college student’s card to deeply mine and analyze the daily life consumer behavior habits of students,and to make an accurate judgment on the specific life consumer behavior.The algorithm proposed in this paper provides a practical reference for the construction of smart campuses in universities,and has important theoretical and application values. 展开更多
关键词 Machine learning k-means clustering Data mining Consumer behavior Campus economy Economic regionalization
下载PDF
基于K-means算法的病种成本聚类分析及精细化管理探究
9
作者 刘嘉慧 张萍 +1 位作者 曹瑾音 张芷菁 《卫生经济研究》 北大核心 2024年第8期37-40,44,共5页
目的:探索构建基于“投入-产出-频次”三个维度的病种成本分析模型,完善以病种为基础、以价值为导向的医院运营管理体系,助力精细化管理水平提升。方法:以某三甲医院为样本,基于某年度病案首页数据,运用K-means算法对病种进行聚类分析... 目的:探索构建基于“投入-产出-频次”三个维度的病种成本分析模型,完善以病种为基础、以价值为导向的医院运营管理体系,助力精细化管理水平提升。方法:以某三甲医院为样本,基于某年度病案首页数据,运用K-means算法对病种进行聚类分析。结果:将病种聚类为六类,从整体来看样本医院病种结构较好,但基层病种占比仍较大。结论:医院可以对病种进行精细化分类管理,对“重要价值病种”给予资源倾斜,对“一般价值病种”优化诊疗流程和模式,同时开展病种成本监测及分析,控制成本。 展开更多
关键词 聚类分析 病种成本 精细化管理
下载PDF
Composition Analysis and Identification of Ancient Glass Products Based on L1 Regularization Logistic Regression
10
作者 Yuqiao Zhou Xinyang Xu Wenjing Ma 《Applied Mathematics》 2024年第1期51-64,共14页
In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluste... In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics. 展开更多
关键词 Glass Composition L1 Regularization Logistic Regression Model k-means clustering analysis Elbow Rule Parameter Verification
下载PDF
Statistical Analysis of Abilities to Give Consent to Health Data Processing
11
作者 Antonella Massari Biagio Solarino +5 位作者 Paola Perchinunno Angela Maria D’Uggento Marcello Benevento Viviana D’Addosio Vittoria Claudia De Nicolò Samuela L’Abbate 《Applied Mathematics》 2024年第8期508-542,共35页
The recent pandemic crisis has highlighted the importance of the availability and management of health data to respond quickly and effectively to health emergencies, while respecting the fundamental rights of every in... The recent pandemic crisis has highlighted the importance of the availability and management of health data to respond quickly and effectively to health emergencies, while respecting the fundamental rights of every individual. In this context, it is essential to find a balance between the protection of privacy and the safeguarding of public health, using tools that guarantee transparency and consent to the processing of data by the population. This work, starting from a pilot investigation conducted in the Polyclinic of Bari as part of the Horizon Europe Seeds project entitled “Multidisciplinary analysis of technological tracing models of contagion: the protection of rights in the management of health data”, has the objective of promoting greater patient awareness regarding the processing of their health data and the protection of privacy. The methodology used the PHICAT (Personal Health Information Competence Assessment Tool) as a tool and, through the administration of a questionnaire, the aim was to evaluate the patients’ ability to express their consent to the release and processing of health data. The results that emerged were analyzed in relation to the 4 domains in which the process is divided which allows evaluating the patients’ ability to express a conscious choice and, also, in relation to the socio-demographic and clinical characteristics of the patients themselves. This study can contribute to understanding patients’ ability to give their consent and improve information regarding the management of health data by increasing confidence in granting the use of their data for research and clinical management. 展开更多
关键词 PRIVACY Health Data Consent cluster analysis LOGIT
下载PDF
Study Progress Analysis of Effluent Quality Prediction in Activated Sludge Process Based on CiteSpace
12
作者 Kemeng Xue 《Journal of Water Resource and Protection》 CAS 2024年第6期450-465,共16页
In this paper, CiteSpace, a bibliometrics software, was adopted to collect research papers published on the Web of Science, which are relevant to biological model and effluent quality prediction in activated sludge pr... In this paper, CiteSpace, a bibliometrics software, was adopted to collect research papers published on the Web of Science, which are relevant to biological model and effluent quality prediction in activated sludge process in the wastewater treatment. By the way of trend map, keyword knowledge map, and co-cited knowledge map, specific visualization analysis and identification of the authors, institutions and regions were concluded. Furthermore, the topics and hotspots of water quality prediction in activated sludge process through the literature-co-citation-based cluster analysis and literature citation burst analysis were also determined, which not only reflected the historical evolution progress to a certain extent, but also provided the direction and insight of the knowledge structure of water quality prediction and activated sludge process for future research. 展开更多
关键词 Biological Model Effluent Quality Prediction Activated Sludge Process CITESPACE Knowledge Map Co-Citation cluster analysis
下载PDF
Investigation of the J-TEXT plasma events by k-means clustering algorithm 被引量:1
13
作者 李建超 张晓卿 +11 位作者 张昱 Abba Alhaji BALA 柳惠平 周帼红 王能超 李达 陈忠勇 杨州军 陈志鹏 董蛟龙 丁永华 the J-TEXT Team 《Plasma Science and Technology》 SCIE EI CAS CSCD 2023年第8期38-43,共6页
Various types of plasma events emerge in specific parameter ranges and exhibit similar characteristics in diagnostic signals,which can be applied to identify these events.A semisupervised machine learning algorithm,th... Various types of plasma events emerge in specific parameter ranges and exhibit similar characteristics in diagnostic signals,which can be applied to identify these events.A semisupervised machine learning algorithm,the k-means clustering algorithm,is utilized to investigate and identify plasma events in the J-TEXT plasma.This method can cluster diverse plasma events with homogeneous features,and then these events can be identified if given few manually labeled examples based on physical understanding.A survey of clustered events reveals that the k-means algorithm can make plasma events(rotating tearing mode,sawtooth oscillations,and locked mode)gathering in Euclidean space composed of multi-dimensional diagnostic data,like soft x-ray emission intensity,edge toroidal rotation velocity,the Mirnov signal amplitude and so on.Based on the cluster analysis results,an approximate analytical model is proposed to rapidly identify plasma events in the J-TEXT plasma.The cluster analysis method is conducive to data markers of massive diagnostic data. 展开更多
关键词 k-means cluster analysis plasma event machine learning
下载PDF
Exploring Motor Imagery EEG: Enhanced EEG Microstate Analysis with GMD-Driven Density Canopy Method
14
作者 Xin Xiong Jing Zhang +3 位作者 Sanli Yi Chunwu Wang Ruixiang Liu Jianfeng He 《Computers, Materials & Continua》 SCIE EI 2024年第6期4659-4681,共23页
The analysis of microstates in EEG signals is a crucial technique for understanding the spatiotemporal dynamics of brain electrical activity.Traditional methods such as Atomic Agglomerative Hierarchical Clustering(AAH... The analysis of microstates in EEG signals is a crucial technique for understanding the spatiotemporal dynamics of brain electrical activity.Traditional methods such as Atomic Agglomerative Hierarchical Clustering(AAHC),K-means clustering,Principal Component Analysis(PCA),and Independent Component Analysis(ICA)are limited by a fixed number of microstate maps and insufficient capability in cross-task feature extraction.Tackling these limitations,this study introduces a Global Map Dissimilarity(GMD)-driven density canopy K-means clustering algorithm.This innovative approach autonomously determines the optimal number of EEG microstate topographies and employs Gaussian kernel density estimation alongside the GMD index for dynamic modeling of EEG data.Utilizing this advanced algorithm,the study analyzes the Motor Imagery(MI)dataset from the GigaScience database,GigaDB.The findings reveal six distinct microstates during actual right-hand movement and five microstates across other task conditions,with microstate C showing superior performance in all task states.During imagined movement,microstate A was significantly enhanced.Comparison with existing algorithms indicates a significant improvement in clustering performance by the refined method,with an average Calinski-Harabasz Index(CHI)of 35517.29 and a Davis-Bouldin Index(DBI)average of 2.57.Furthermore,an information-theoretical analysis of the microstate sequences suggests that imagined movement exhibits higher complexity and disorder than actual movement.By utilizing the extracted microstate sequence parameters as features,the improved algorithm achieved a classification accuracy of 98.41%in EEG signal categorization for motor imagery.A performance of 78.183%accuracy was achieved in a four-class motor imagery task on the BCI-IV-2a dataset.These results demonstrate the potential of the advanced algorithm in microstate analysis,offering a more effective tool for a deeper understanding of the spatiotemporal features of EEG signals. 展开更多
关键词 EEG microstate motor imagery k-means clustering algorithm gaus sian kernel function shannon entropy Lempel-Ziv complexity
下载PDF
Evolution and spatiotemporal analysis of earthquake public opinion based on social media data
15
作者 Chenyu Wang Yanjun Ye +2 位作者 Yingqiao Qiu Chen Li Meiqing Du 《Earthquake Science》 2024年第5期387-406,共20页
As critical conduits for the dissemination of online public opinion,social media platforms offer a timely and effective means for managing emergencies during major disasters,such as earthquakes.This study focuses on t... As critical conduits for the dissemination of online public opinion,social media platforms offer a timely and effective means for managing emergencies during major disasters,such as earthquakes.This study focuses on the analysis of online public opinions following the Maduo M7.4 earthquake in Qinghai Province and the Yangbi M6.4 earthquake in Yunnan Province.By collecting,cleaning,and organizing post-earthquake Sina Weibo(short for Weibo)data,we employed the Latent Dirichlet Allocation(LDA)model to extract information pertinent to public opinion on these earthquakes.This analysis included a comparison of the nature and temporal evolution of online public opinions related to both events.An emotion analysis,utilizing an emotion dictionary,categorized the emotional content of post-earthquake Weibo posts,facilitating a comparative study of the characteristics and temporal trends of online public emotions following the earthquakes.The findings were visualized using Geographic Information System(GIS)techniques.The analysis revealed certain commonalities in online public opinion following both earthquakes.Notably,the peak of online engagement occurred within the first 24 hours post-earthquake,with a rapid decline observed between 24 to 48 hours thereafter.The variation in popularity of online public opinion was linked to aftershock occurrences.Adjusted for population factors,online engagement in areas surrounding the earthquake sites and in Sichuan Province was significantly high.Initially dominated by feelings of“fear”and“surprise”,the public sentiment shifted towards a more positive outlook with the onset of rescue operations.However,distinctions in the online public response to each earthquake were also noted.Following the Yangbi earthquake,Yunnan Province reported the highest number of Weibo posts nationwide;in contrast,Qinghai Province ranked third post-Maduo earthquake,attributable to its smaller population size and extensive damage to communication infrastructure.This research offers a methodological approach for the analysis of online public opinion related to earthquakes,providing insights for the enhancement of post-disaster emergency management and public mental health support. 展开更多
关键词 internet public opinion topic clustering emotional analysis psychological crisis intervention
下载PDF
融合专家领域知识和K-means聚类的三支风险评级方法
16
作者 段维怡 梁德翠 《陕西师范大学学报(自然科学版)》 CAS CSCD 北大核心 2024年第3期26-36,共11页
金融和医疗等实际环境中的决策关键在于决策风险的权衡考虑,准确预测和分类风险级别非常必要。然而,传统的群体决策关注专家评价意见的一致性和共识,对于获得客观的专家评价意见和决策质量的考虑较少,在风险评级场景中难以量化和评估决... 金融和医疗等实际环境中的决策关键在于决策风险的权衡考虑,准确预测和分类风险级别非常必要。然而,传统的群体决策关注专家评价意见的一致性和共识,对于获得客观的专家评价意见和决策质量的考虑较少,在风险评级场景中难以量化和评估决策实际效果。因此,引入数据驱动的思想,利用数据和聚类结果辅助发现专家评估意见,在三支决策理论框架下优化群体意见,改进和计算逻辑回归的判别点,并基于UCI和Kaggle的4个信贷风险和疾病诊断公开数据集,完成风险评级分类。通过数据实验的结果可以发现:与经典的机器学习方法相比,文中提出的基于群体决策的三支分类方法更加关注风险的规避,在各个数据集上的分类表现均有稳定且较优的结果,说明通过发现专家领域知识,利用数据的客观信息辅助专家评估风险有助于解决不同背景的决策问题。 展开更多
关键词 专家领域知识 聚类分析 风险评级 三支决策 决策质量
下载PDF
基于融合改进K-means聚类算法的数据检测技术 被引量:3
17
作者 郭克难 《电子设计工程》 2024年第5期41-45,共5页
针对现有医疗财务数据分析系统平台老旧,采用传统K-means算法进行数据处理时性能较差的问题,文中设计了一种财务异常数据检测算法。对于传统K-means算法存在的分类效果不佳、运行效率偏低等不足,该算法结合密度峰值法对样本点的局部密... 针对现有医疗财务数据分析系统平台老旧,采用传统K-means算法进行数据处理时性能较差的问题,文中设计了一种财务异常数据检测算法。对于传统K-means算法存在的分类效果不佳、运行效率偏低等不足,该算法结合密度峰值法对样本点的局部密度和高密度距离进行计算,进而优化簇中心的选择。同时融合PCA降维算法减少了数据的冗余信息,进一步提高了运行效率。通过引入LOF离群检测算法对分簇后的数据进行检测,从而得到异常数据结果。实验测试中,所提算法在人工数据集上的平均ARI指标为0.844,真实数据集的准确率则达到了79.2%,在所有对比算法中均为最优,表明该算法具有良好的性能,可以对财务异常数据进行准确地检测。 展开更多
关键词 k-means聚类 密度峰值检测 主成分分析法 离群检测算法 异常数据检测
下载PDF
Analysis of the Employment Situation of Non Private Enterprises in Various Regions of China
18
作者 Junyi Wang 《Open Journal of Applied Sciences》 2024年第1期131-144,共14页
In the past 30 years, Chinese enterprises have been a hot topic of discussion and concern among the general public in terms of economic and social status, ownership structure, business mechanism, and management level.... In the past 30 years, Chinese enterprises have been a hot topic of discussion and concern among the general public in terms of economic and social status, ownership structure, business mechanism, and management level. Solving the problem of employment for the people is an important prerequisite for their peaceful living and work, as well as a prerequisite and foundation for building a harmonious society. The employment situation of private enterprises has always been of great concern to the outside world, and these two major jobs have always occupied an important position in the employment field of China that cannot be ignored. With the establishment of the market economy system, individual and private enterprises have become important components of the socialist economy, making significant contributions to economic development and social progress. The rapid development of China’s economy, on the one hand, is the embodiment of the superiority of China’s socialist market economic system, and on the other hand, it is the role of the tertiary industry and private enterprises in promoting the national economy. Since the 1990s, China’s private enterprises have become a new economic growth point for local and even national countries, and are one of the important ways to arrange employment and achieve social stability. This paper studies the employment of private enterprises and individuals from the perspective of statistics, extracts relevant data from China statistical Yearbook, uses the relevant knowledge of statistics to process the data, obtains the conclusion and puts forward relevant constructive suggestions. 展开更多
关键词 Correlation analysis of Employment Numbers Factor analysis Principal Component analysis cluster analysis
下载PDF
基于K-means与Word2vec的哺乳文胸评论主题挖掘研究
19
作者 刘妍 刘驰 《人类工效学》 2024年第2期40-45,共6页
目的为了了解消费者在网络平台购买哺乳文胸时的关注侧重点,文章从在线评论中抽取有效关键词构建哺乳文胸主题,并通过计算主题的重要程度协助商家了解消费者关注重点方向。方法选用TF-IDF关键词抽取算法,结合K-means和Word2vec进行语义... 目的为了了解消费者在网络平台购买哺乳文胸时的关注侧重点,文章从在线评论中抽取有效关键词构建哺乳文胸主题,并通过计算主题的重要程度协助商家了解消费者关注重点方向。方法选用TF-IDF关键词抽取算法,结合K-means和Word2vec进行语义聚类、主题识别、主题词挖掘及主题重要度计算。结果哺乳文胸评论文本聚类后的主题重要程度排名是:产品品质(45.47%)、产品外观(35.83%)、产品服务(18.79%)。结论通过该方法能够有效的识别和构建哺乳文胸主题及主题词,同时,通过主题的重要程度,能够了解消费者对于网络平台购买哺乳文胸时关注的重点方向,为哺乳内衣企业进行产品改善及生产等提供理论参考。 展开更多
关键词 服装工程 文本聚类分析 哺乳文胸 在线评论 k-means Word2vec 主题挖掘 主题重要程度 文献计量分析
下载PDF
Incident Detection Based on Differential Analysis
20
作者 Mohammed Ali Elseddig Mohamed Mejri 《Journal of Information Security》 2024年第3期378-409,共32页
Internet services and web-based applications play pivotal roles in various sensitive domains, encompassing e-commerce, e-learning, e-healthcare, and e-payment. However, safeguarding these services poses a significant ... Internet services and web-based applications play pivotal roles in various sensitive domains, encompassing e-commerce, e-learning, e-healthcare, and e-payment. However, safeguarding these services poses a significant challenge, as the need for robust security measures becomes increasingly imperative. This paper presented an innovative method based on differential analyses to detect abrupt changes in network traffic characteristics. The core concept revolves around identifying abrupt alterations in certain characteristics such as input/output volume, the number of TCP connections, or DNS queries—within the analyzed traffic. Initially, the traffic is segmented into distinct sequences of slices, followed by quantifying specific characteristics for each slice. Subsequently, the distance between successive values of these measured characteristics is computed and clustered to detect sudden changes. To accomplish its objectives, the approach combined several techniques, including propositional logic, distance metrics (e.g., Kullback-Leibler Divergence), and clustering algorithms (e.g., K-means). When applied to two distinct datasets, the proposed approach demonstrates exceptional performance, achieving detection rates of up to 100%. 展开更多
关键词 IDS SOC SIEM KL-Divergence k-mean clustering Algorithms Elbow Method
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部