Human-human interaction recognition is crucial in computer vision fields like surveillance,human-computer interaction,and social robotics.It enhances systems’ability to interpret and respond to human behavior precise...Human-human interaction recognition is crucial in computer vision fields like surveillance,human-computer interaction,and social robotics.It enhances systems’ability to interpret and respond to human behavior precisely.This research focuses on recognizing human interaction behaviors using a static image,which is challenging due to the complexity of diverse actions.The overall purpose of this study is to develop a robust and accurate system for human interaction recognition.This research presents a novel image-based human interaction recognition method using a Hidden Markov Model(HMM).The technique employs hue,saturation,and intensity(HSI)color transformation to enhance colors in video frames,making them more vibrant and visually appealing,especially in low-contrast or washed-out scenes.Gaussian filters reduce noise and smooth imperfections followed by silhouette extraction using a statistical method.Feature extraction uses the features from Accelerated Segment Test(FAST),Oriented FAST,and Rotated BRIEF(ORB)techniques.The application of Quadratic Discriminant Analysis(QDA)for feature fusion and discrimination enables high-dimensional data to be effectively analyzed,thus further enhancing the classification process.It ensures that the final features loaded into the HMM classifier accurately represent the relevant human activities.The impressive accuracy rates of 93%and 94.6%achieved in the BIT-Interaction and UT-Interaction datasets respectively,highlight the success and reliability of the proposed technique.The proposed approach addresses challenges in various domains by focusing on frame improvement,silhouette and feature extraction,feature fusion,and HMM classification.This enhances data quality,accuracy,adaptability,reliability,and reduction of errors.展开更多
The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysph...The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysphonia that are caused by voice alteration of vocal folds and their accuracy is between 60%–70%.To enhance detection accuracy and reduce processing speed of dysphonia detection,a novel approach is proposed in this paper.We have leveraged Linear Discriminant Analysis(LDA)to train multiple Machine Learning(ML)models for dysphonia detection.Several ML models are utilized like Support Vector Machine(SVM),Logistic Regression,and K-nearest neighbor(K-NN)to predict the voice pathologies based on features like Mel-Frequency Cepstral Coefficients(MFCC),Fundamental Frequency(F0),Shimmer(%),Jitter(%),and Harmonic to Noise Ratio(HNR).The experiments were performed using Saarbrucken Voice Data-base(SVD)and a privately collected dataset.The K-fold cross-validation approach was incorporated to increase the robustness and stability of the ML models.According to the experimental results,our proposed approach has a 70%increase in processing speed over Principal Component Analysis(PCA)and performs remarkably well with a recognition accuracy of 95.24%on the SVD dataset surpassing the previous best accuracy of 82.37%.In the case of the private dataset,our proposed method achieved an accuracy rate of 93.37%.It can be an effective non-invasive method to detect dysphonia.展开更多
GIS and remote sensing were utilized for prioritizing the W. Mujib catchment. Fifty three fourth-order sub-watersheds were prioritized based on morphometric analysis of linear and shape parameters. ASTER DEM (v.2), to...GIS and remote sensing were utilized for prioritizing the W. Mujib catchment. Fifty three fourth-order sub-watersheds were prioritized based on morphometric analysis of linear and shape parameters. ASTER DEM (v.2), topographical maps, and Arc GIS (10.1) software, have been employed to delineate the 53 sub-basins, to extract the drainage networks, and to compute the required basic, linear, and shape parameters, and to compile the necessary thematic maps such as elevation and slope categories. The land use/land cover map was generated using ERDAS Imagine (2015), LANDSAT 8 image, and supervised classification (Maximum Likelihood Method). Soil map was digitized using the Arc GIS tool. Each sub-basin is prioritized by assigning ranks based on the calculated compound parameter (Cp). The final score for each sub-basin is ascribed as per erosion threat. The 53 sub-watersheds were grouped into four categories of priority: very high (15 sub-basins, 28.3% of the total), high (17 sub-basins, 32% of the total), moderate (16 sub-basins, 30.2% of the total), and low (5 sub-basins, 9.5% of the total). Sub-basins categorized as very high and high priority (60.3% of the total) are subjected to high erosion risk, thus, creating an urgent need for applying soil and water conservation measures. The validity of the prioritized four groups was tested statistically by means of Discriminant Analysis (DA), and a significant difference was found between the four priority classes. A relatively complete separation exists between the recognized priority classes;thus, they are statistically valid, distinct, and different from each other. The present results intend to help decision makers pay sufficient attention to soil and water conservation programs, and to encourage tree plantation over the government-owned sloping land. Such procedures are essential in order to minimize soil erosion loss, and to increase soil moisture on farms, thus, reducing the impact of recurrent droughts and the possibility of flooding downstream.展开更多
Quantitatively determining the sources of dune sand is one of the problems necessarily and urgently to be solved in aeolian landforms and desertification research. Based on the granulometric data of sand materials fro...Quantitatively determining the sources of dune sand is one of the problems necessarily and urgently to be solved in aeolian landforms and desertification research. Based on the granulometric data of sand materials from the Hulun Buir Sandy Land, the paper employs the stepwise discriminant analysis technique (SDA) for two groups to select the principal factors determining the differences between surface loose sediments. The extent of similarity between two statistical populations can be described quantitatively by three factors such as the number of principal variables, Mahalanobis distance D 2 and confidence level 琢for F-test. Results reveal that: 1) Aeolian dune sand in the region mainly derives from Hailar Formation (Q 3 ), while fluvial sand and palaeosol also supply partially source sand for dunes; and 2) in the vicinity of Cuogang Town and west of the broad valley of the lower reaches of Hailar River, fluvial sand can naturally become principal supplier for dune sand.展开更多
The purpose of this study is to apply some statistical and soft computing methods such as Fisher discriminant analysis (FDA) and support vector machines (SVMs) methodology to the determination of pillar stability ...The purpose of this study is to apply some statistical and soft computing methods such as Fisher discriminant analysis (FDA) and support vector machines (SVMs) methodology to the determination of pillar stability for underground mines selected from various coal and stone mines by using some index and mechanical properties, including the width, the height, the ratio of the pillar width to its height, the uniaxial compressive strength of the rock and pillar stress. The study includes four main stages: sampling, testing, modeling and assessment of the model performances. During the modeling stage, two pillar stability prediction models were investigated with FDA and SVMs methodology based on the statistical learning theory. After using 40 sets of measured data in various mines in the world for training and testing, the model was applied to other 6 data for validating the trained proposed models. The prediction results of SVMs were compared with those of FDA as well as the measured field values. The general performance of models developed in this study is close; however, the SVMs exhibit the best performance considering the performance index with the correct classification rate Prs by re-substitution method and Pcv by cross validation method. The results show that the SVMs approach has the potential to be a reliable and practical tool for determination of pillar stability for underground mines.展开更多
Linear discriminant analysis and kernel vector quantization are integrated into vector quantization based speech recognition system for improving the recognition accuracy of Mandarin digits. These techniques increase ...Linear discriminant analysis and kernel vector quantization are integrated into vector quantization based speech recognition system for improving the recognition accuracy of Mandarin digits. These techniques increase the class separability and optimize the clustering procedure. Speaker-dependent (SD) and speaker-independent (SI) experiments are performed to evaluate the performance of the proposed method. The experiment results show that the proposed method is capable of reaching the word error rate of 3.76% in SD case and 6.60 % in SI case. Such a system can be suitable for being embedded in personal digital assistant(PDA), mobile phone and so on to perform voice controlling such as digit dialing, calculating, etc.展开更多
A Bayes discriminant analysis method to identify the risky of complicated goaf in mines was presented. Nine factors influencing the stability of goaf risky, including uniaxial compressive strength of rock, elastic mod...A Bayes discriminant analysis method to identify the risky of complicated goaf in mines was presented. Nine factors influencing the stability of goaf risky, including uniaxial compressive strength of rock, elastic modulus of rock, rock quality designation (RQD), area ratio of pillar, ratio of width to height of pillar, depth of ore body, volume of goaf, dip of ore body and area of goal, were selected as discriminant indexes in the stability analysis of goal. The actual data of 40 goals were used as training samples to establish a discriminant analysis model to identify the stability of goaf. The results show that this discriminant analysis model has high precision and misdiscriminant ratio is 0.025 in re-substitution process. The instability identification of a metal mine was distinguished by using this model and the identification result is identical with that of practical situation.展开更多
In gene prediction, the Fisher discriminant analysis (FDA) is used to separate protein coding region (exon) from non-coding regions (intron). Usually, the positive data set and the negative data set are of the same si...In gene prediction, the Fisher discriminant analysis (FDA) is used to separate protein coding region (exon) from non-coding regions (intron). Usually, the positive data set and the negative data set are of the same size if the number of the data is big enough. But for some situations the data are not sufficient or not equal, the threshold used in FDA may have important influence on prediction results. This paper presents a study on the selection of the threshold. The eigen value of each exon/intron sequence is computed using the Z-curve method with 69 variables. The experiments results suggest that the size and the standard deviation of the data sets and the threshold are the three key elements to be taken into consideration to improve the prediction results.展开更多
Clinical indications for herbal drugs very often only rely on traditional knowledge. Single plant-derived preparations are used for many purposes and cannot be classified to belong to a single category like calming or...Clinical indications for herbal drugs very often only rely on traditional knowledge. Single plant-derived preparations are used for many purposes and cannot be classified to belong to a single category like calming or stimulating drugs. With respect to the brain a unique possibility exists to analyze drug effects by recording the EEG. It is common knowledge that many drugs change the frequency content of electric brain activity. Quantitative analysis of the EEG by Fast Fourier Transformation reveals parameters like spectral power, which can be processed further (CATEEM®). Source density was determined from 17 channels of the quantitative EEG from 10 clinical studies recorded in a relaxed state with open eyes. Linear discriminant analysis was used to differentiate the effects of Placebo (circadian rhythm) from CNS-active herbal drugs in comparison to Valium®. Calmvalera®, L-Theanine, Lasea®, Neurapas®, Neuravena®, Neurexan®, Nutrifin Relax®, Pascoflair®(herbal calming drugs) as well as memoLoges®, Zembrin®(herbal stimulating drugs) induced different changes of the frequency content of brain electric activity. Discriminant analysis revealed that Nutrifin Relax®, Pascoflair®and Suntheanine®could not be separated well from each other indicating a similar mechanism of action. The effect of Valium®was projected at a very isolated position far away from the herbal preparations indicating a totally different mechanism of action. Zembrin®and memoLoges®grouped together with respect to the first three discriminant functions, but were different with respect to the 4th to 6th discriminant function. Lasea®as anxiolytic drug and Neurapas®as antidepressive drug were projected at isolated positions indicating their different clinical indications. The results indicate that discriminant analysis of human quantitative EEG data allows for unique pharmacological description of individual effect profiles of herbal drugs.展开更多
The bug tracking system is well known as the project support tool of open source software. There are many categorical data sets recorded on the bug tracking system. In the past, many reliability assessment methods hav...The bug tracking system is well known as the project support tool of open source software. There are many categorical data sets recorded on the bug tracking system. In the past, many reliability assessment methods have been proposed in the research area of software reliability. Also, there are several software project analyses based on the software effort data such as the earned value management. In particular, the software reliability growth models can </span><span style="font-family:Verdana;">apply to the system testing phase of software development. On the other</span><span style="font-family:Verdana;"> hand, the software effort analysis can apply to all development phase, because the fault data is only recorded on the testing phase. We focus on the big fault data and effort data of open source software. Then, it is difficult to assess by using the typical statistical assessment method, because the data recorded on the bug tracking system is large scale. Also, we discuss the jump diffusion process model based on the estimation method of jump parameters by using the discriminant analysis. Moreover, we analyze actual big fault data to show numerical examples of software effort assessment considering many categorical data set.展开更多
We revisit a comparison of two discriminant analysis procedures, namely the linear combination classifier of Chung and Han (2000) and the maximum likelihood estimation substitution classifier for the problem of classi...We revisit a comparison of two discriminant analysis procedures, namely the linear combination classifier of Chung and Han (2000) and the maximum likelihood estimation substitution classifier for the problem of classifying unlabeled multivariate normal observations with equal covariance matrices into one of two classes. Both classes have matching block monotone missing training data. Here, we demonstrate that for intra-class covariance structures with at least small correlation among the variables with missing data and the variables without block missing data, the maximum likelihood estimation substitution classifier outperforms the Chung and Han (2000) classifier regardless of the percent of missing observations. Specifically, we examine the differences in the estimated expected error rates for these classifiers using a Monte Carlo simulation, and we compare the two classifiers using two real data sets with monotone missing data via parametric bootstrap simulations. Our results contradict the conclusions of Chung and Han (2000) that their linear combination classifier is superior to the MLE classifier for block monotone missing multivariate normal data.展开更多
Semi-supervised discriminant analysis SDA which uses a combination of multiple embedding graphs and kernel SDA KSDA are adopted in supervised speech emotion recognition.When the emotional factors of speech signal samp...Semi-supervised discriminant analysis SDA which uses a combination of multiple embedding graphs and kernel SDA KSDA are adopted in supervised speech emotion recognition.When the emotional factors of speech signal samples are preprocessed different categories of features including pitch zero-cross rate energy durance formant and Mel frequency cepstrum coefficient MFCC as well as their statistical parameters are extracted from the utterances of samples.In the dimensionality reduction stage before the feature vectors are sent into classifiers parameter-optimized SDA and KSDA are performed to reduce dimensionality.Experiments on the Berlin speech emotion database show that SDA for supervised speech emotion recognition outperforms some other state-of-the-art dimensionality reduction methods based on spectral graph learning such as linear discriminant analysis LDA locality preserving projections LPP marginal Fisher analysis MFA etc. when multi-class support vector machine SVM classifiers are used.Additionally KSDA can achieve better recognition performance based on kernelized data mapping compared with the above methods including SDA.展开更多
Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for rep...Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.展开更多
Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In ord...Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In order to get a better visualization effect, a novel fault diagnosis method which combines self-organizing map (SOM) with Fisher discriminant analysis (FDA) is proposed. FDA can reduce the dimension of the data in terms of maximizing the separability of the classes. After feature extraction by FDA, SOM can distinguish the different states on the output map clearly and it can also be employed to monitor abnormal states. Tennessee Eastman (TE) process is employed to illustrate the fault diagnosis and monitoring performance of the proposed method. The result shows that the SOM integrated with FDA method is efficient and capable for real-time monitoring and fault diagnosis in complex chemical process.展开更多
Based on the principle of Mahalanobis distance discriminant analysis (DDA) theory, a stability classification model for mine-lane surrounding rock was established, including six indexes of discriminant factors that re...Based on the principle of Mahalanobis distance discriminant analysis (DDA) theory, a stability classification model for mine-lane surrounding rock was established, including six indexes of discriminant factors that reflect the engineering quality of surrounding rock: lane depth below surface, span of lane, ratio of directly top layer thickness to coal thickness, uniaxial comprehensive strength of surrounding rock, development degree coefficient of surrounding rock joint and range of broken surrounding rock zone. A DDA model was obtained through training 15 practical measuring samples. The re-substitution method was introduced to verify the stability of DDA model and the ratio of mis-discrimination is zero. The DDA model was used to discriminate 3 new samples and the results are identical with actual rock kind. Compared with the artificial neural network method and support vector mechanic method, the results show that this model has high prediction accuracy and can be used in practical engineering.展开更多
Since there are not enough fault data in historical data sets, it is very difficult to diagnose faults for batch processes. In addition, a complete batch trajectory can be obtained till the end of its operation. In or...Since there are not enough fault data in historical data sets, it is very difficult to diagnose faults for batch processes. In addition, a complete batch trajectory can be obtained till the end of its operation. In order to overcome the need for estimated or filled up future unmeasured values in the online fault diagnosis, sufficiently utilize the finite information of faults, and enhance the diagnostic performance, an improved multi-model Fisher discriminant analysis is represented. The trait of the proposed method is that the training data sets are made of the current measured information and the past major discriminant information, and not only the current information or the whole batch data. An industrial typical multi-stage streptomycin fermentation process is used to test the performance of fault diagnosis of the proposed method.展开更多
As the basal group of Polypodiales, the specific taxonomy of Dicksoniaceae is still being debated. As aquantitative analysis method, numerical taxonomy has been applied to the taxonomic study of many plant families an...As the basal group of Polypodiales, the specific taxonomy of Dicksoniaceae is still being debated. As aquantitative analysis method, numerical taxonomy has been applied to the taxonomic study of many plant families andgenera in recent years due to its simplicity and high accuracy. However, the numerical analysis of the Dicksoniaceae fossilshas not been reported at present. In the present study, the pinnule morphological data of 42 Mesozoic fossil species of theDicksoniaceae were analyzed using cluster analysis, principal component analysis and correlation analysis. The resultsrevealed that 42 taxonomic units could be divided into six representative groups, which are consistent with the traditionaltaxonomy. After screening, an identification key on 28 fossil species of four genera with a definite taxonomic position wasestablished. According to the quantitative analysis, a Bayes discriminant model was established for the selected species.Lastly, the model was tested using the morphological data of the fossil pinnules in Dicksoniaceae from the YaojieFormation, suggesting that the discriminant model is accurate to a certain extent. As a result, the numerical taxonomy canbe applied to the classification of the Dicksoniaceae fossils.展开更多
A direct linear discriminant analysis algorithm based on economic singular value decomposition (DLDA/ESVD) is proposed to address the computationally complex problem of the conventional DLDA algorithm, which directl...A direct linear discriminant analysis algorithm based on economic singular value decomposition (DLDA/ESVD) is proposed to address the computationally complex problem of the conventional DLDA algorithm, which directly uses ESVD to reduce dimension and extract eigenvectors corresponding to nonzero eigenvalues. Then a DLDA algorithm based on column pivoting orthogonal triangular (QR) decomposition and ESVD (DLDA/QR-ESVD) is proposed to improve the performance of the DLDA/ESVD algorithm by processing a high-dimensional low rank matrix, which uses column pivoting QR decomposition to reduce dimension and ESVD to extract eigenvectors corresponding to nonzero eigenvalues. The experimental results on ORL, FERET and YALE face databases show that the proposed two algorithms can achieve almost the same performance and outperform the conventional DLDA algorithm in terms of computational complexity and training time. In addition, the experimental results on random data matrices show that the DLDA/QR-ESVD algorithm achieves better performance than the DLDA/ESVD algorithm by processing high-dimensional low rank matrices.展开更多
The identification of liquor brands is very important for food safety. Most of the fake liquors are usually made into the products with the same flavor and alcohol content as regular brand, so the identification for t...The identification of liquor brands is very important for food safety. Most of the fake liquors are usually made into the products with the same flavor and alcohol content as regular brand, so the identification for the liquor brands with the same flavor and the same alcohol content is essential. However, it is also difficult because the components of such liquor samples are very similar. Near-infrared (NIR) spectroscopy combined with partial least squares discriminant analysis (PLS-DA) was applied to identification of liquor brands with the same flavor and alcohol content. A total of 160 samples of Luzhou Laojiao liquor and 200 samples of non-Luzhou Laojiao liquor with the same flavor and alcohol content were used for identification. Samples of each type were randomly divided into the modeling and validation sets. The modeling samples were further divided into calibration and prediction sets using the Kennard-Stone algorithm to achieve uniformity and representativeness. In the modeling and validation processes based on PLS-DA method, the recognition rates of samples achieved 99.1% and 98.7%, respectively. The results show high prediction performance for the identification of liquor brands, and were obviously better than those obtained from the principal component linear discriminant analysis method. NIR spectroscopy combined with the PLS-DA method provides a quick and effective means of the discriminant analysis of liquor brands, and is also a promising tool for large-scale inspection of liquor food safety.展开更多
基金funding this work under the Research Group Funding Program Grant Code(NU/RG/SERC/12/6)supported via funding from Prince Satam bin Abdulaziz University Project Number(PSAU/2023/R/1444)+1 种基金Princess Nourah bint Abdulrahman University Researchers Supporting Project Number(PNURSP2023R348)Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia,and this work was also supported by the Ministry of Science and ICT(MSIT),South Korea,through the ICT Creative Consilience Program supervised by the Institute for Information and Communications Technology Planning and Evaluation(IITP)under Grant IITP-2023-2020-0-01821.
文摘Human-human interaction recognition is crucial in computer vision fields like surveillance,human-computer interaction,and social robotics.It enhances systems’ability to interpret and respond to human behavior precisely.This research focuses on recognizing human interaction behaviors using a static image,which is challenging due to the complexity of diverse actions.The overall purpose of this study is to develop a robust and accurate system for human interaction recognition.This research presents a novel image-based human interaction recognition method using a Hidden Markov Model(HMM).The technique employs hue,saturation,and intensity(HSI)color transformation to enhance colors in video frames,making them more vibrant and visually appealing,especially in low-contrast or washed-out scenes.Gaussian filters reduce noise and smooth imperfections followed by silhouette extraction using a statistical method.Feature extraction uses the features from Accelerated Segment Test(FAST),Oriented FAST,and Rotated BRIEF(ORB)techniques.The application of Quadratic Discriminant Analysis(QDA)for feature fusion and discrimination enables high-dimensional data to be effectively analyzed,thus further enhancing the classification process.It ensures that the final features loaded into the HMM classifier accurately represent the relevant human activities.The impressive accuracy rates of 93%and 94.6%achieved in the BIT-Interaction and UT-Interaction datasets respectively,highlight the success and reliability of the proposed technique.The proposed approach addresses challenges in various domains by focusing on frame improvement,silhouette and feature extraction,feature fusion,and HMM classification.This enhances data quality,accuracy,adaptability,reliability,and reduction of errors.
文摘The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysphonia that are caused by voice alteration of vocal folds and their accuracy is between 60%–70%.To enhance detection accuracy and reduce processing speed of dysphonia detection,a novel approach is proposed in this paper.We have leveraged Linear Discriminant Analysis(LDA)to train multiple Machine Learning(ML)models for dysphonia detection.Several ML models are utilized like Support Vector Machine(SVM),Logistic Regression,and K-nearest neighbor(K-NN)to predict the voice pathologies based on features like Mel-Frequency Cepstral Coefficients(MFCC),Fundamental Frequency(F0),Shimmer(%),Jitter(%),and Harmonic to Noise Ratio(HNR).The experiments were performed using Saarbrucken Voice Data-base(SVD)and a privately collected dataset.The K-fold cross-validation approach was incorporated to increase the robustness and stability of the ML models.According to the experimental results,our proposed approach has a 70%increase in processing speed over Principal Component Analysis(PCA)and performs remarkably well with a recognition accuracy of 95.24%on the SVD dataset surpassing the previous best accuracy of 82.37%.In the case of the private dataset,our proposed method achieved an accuracy rate of 93.37%.It can be an effective non-invasive method to detect dysphonia.
文摘GIS and remote sensing were utilized for prioritizing the W. Mujib catchment. Fifty three fourth-order sub-watersheds were prioritized based on morphometric analysis of linear and shape parameters. ASTER DEM (v.2), topographical maps, and Arc GIS (10.1) software, have been employed to delineate the 53 sub-basins, to extract the drainage networks, and to compute the required basic, linear, and shape parameters, and to compile the necessary thematic maps such as elevation and slope categories. The land use/land cover map was generated using ERDAS Imagine (2015), LANDSAT 8 image, and supervised classification (Maximum Likelihood Method). Soil map was digitized using the Arc GIS tool. Each sub-basin is prioritized by assigning ranks based on the calculated compound parameter (Cp). The final score for each sub-basin is ascribed as per erosion threat. The 53 sub-watersheds were grouped into four categories of priority: very high (15 sub-basins, 28.3% of the total), high (17 sub-basins, 32% of the total), moderate (16 sub-basins, 30.2% of the total), and low (5 sub-basins, 9.5% of the total). Sub-basins categorized as very high and high priority (60.3% of the total) are subjected to high erosion risk, thus, creating an urgent need for applying soil and water conservation measures. The validity of the prioritized four groups was tested statistically by means of Discriminant Analysis (DA), and a significant difference was found between the four priority classes. A relatively complete separation exists between the recognized priority classes;thus, they are statistically valid, distinct, and different from each other. The present results intend to help decision makers pay sufficient attention to soil and water conservation programs, and to encourage tree plantation over the government-owned sloping land. Such procedures are essential in order to minimize soil erosion loss, and to increase soil moisture on farms, thus, reducing the impact of recurrent droughts and the possibility of flooding downstream.
文摘Quantitatively determining the sources of dune sand is one of the problems necessarily and urgently to be solved in aeolian landforms and desertification research. Based on the granulometric data of sand materials from the Hulun Buir Sandy Land, the paper employs the stepwise discriminant analysis technique (SDA) for two groups to select the principal factors determining the differences between surface loose sediments. The extent of similarity between two statistical populations can be described quantitatively by three factors such as the number of principal variables, Mahalanobis distance D 2 and confidence level 琢for F-test. Results reveal that: 1) Aeolian dune sand in the region mainly derives from Hailar Formation (Q 3 ), while fluvial sand and palaeosol also supply partially source sand for dunes; and 2) in the vicinity of Cuogang Town and west of the broad valley of the lower reaches of Hailar River, fluvial sand can naturally become principal supplier for dune sand.
基金Project (50934006) supported by the National Natural Science Foundation of ChinaProject (2010CB732004) supported by the National Basic Research Program of ChinaProject (CX2011B119) supported by the Graduated Students’ Research and Innovation Fund Project of Hunan Province of China
文摘The purpose of this study is to apply some statistical and soft computing methods such as Fisher discriminant analysis (FDA) and support vector machines (SVMs) methodology to the determination of pillar stability for underground mines selected from various coal and stone mines by using some index and mechanical properties, including the width, the height, the ratio of the pillar width to its height, the uniaxial compressive strength of the rock and pillar stress. The study includes four main stages: sampling, testing, modeling and assessment of the model performances. During the modeling stage, two pillar stability prediction models were investigated with FDA and SVMs methodology based on the statistical learning theory. After using 40 sets of measured data in various mines in the world for training and testing, the model was applied to other 6 data for validating the trained proposed models. The prediction results of SVMs were compared with those of FDA as well as the measured field values. The general performance of models developed in this study is close; however, the SVMs exhibit the best performance considering the performance index with the correct classification rate Prs by re-substitution method and Pcv by cross validation method. The results show that the SVMs approach has the potential to be a reliable and practical tool for determination of pillar stability for underground mines.
文摘Linear discriminant analysis and kernel vector quantization are integrated into vector quantization based speech recognition system for improving the recognition accuracy of Mandarin digits. These techniques increase the class separability and optimize the clustering procedure. Speaker-dependent (SD) and speaker-independent (SI) experiments are performed to evaluate the performance of the proposed method. The experiment results show that the proposed method is capable of reaching the word error rate of 3.76% in SD case and 6.60 % in SI case. Such a system can be suitable for being embedded in personal digital assistant(PDA), mobile phone and so on to perform voice controlling such as digit dialing, calculating, etc.
基金Project (2010CB732004) supported by the National Basic Research Program of China
文摘A Bayes discriminant analysis method to identify the risky of complicated goaf in mines was presented. Nine factors influencing the stability of goaf risky, including uniaxial compressive strength of rock, elastic modulus of rock, rock quality designation (RQD), area ratio of pillar, ratio of width to height of pillar, depth of ore body, volume of goaf, dip of ore body and area of goal, were selected as discriminant indexes in the stability analysis of goal. The actual data of 40 goals were used as training samples to establish a discriminant analysis model to identify the stability of goaf. The results show that this discriminant analysis model has high precision and misdiscriminant ratio is 0.025 in re-substitution process. The instability identification of a metal mine was distinguished by using this model and the identification result is identical with that of practical situation.
文摘In gene prediction, the Fisher discriminant analysis (FDA) is used to separate protein coding region (exon) from non-coding regions (intron). Usually, the positive data set and the negative data set are of the same size if the number of the data is big enough. But for some situations the data are not sufficient or not equal, the threshold used in FDA may have important influence on prediction results. This paper presents a study on the selection of the threshold. The eigen value of each exon/intron sequence is computed using the Z-curve method with 69 variables. The experiments results suggest that the size and the standard deviation of the data sets and the threshold are the three key elements to be taken into consideration to improve the prediction results.
文摘Clinical indications for herbal drugs very often only rely on traditional knowledge. Single plant-derived preparations are used for many purposes and cannot be classified to belong to a single category like calming or stimulating drugs. With respect to the brain a unique possibility exists to analyze drug effects by recording the EEG. It is common knowledge that many drugs change the frequency content of electric brain activity. Quantitative analysis of the EEG by Fast Fourier Transformation reveals parameters like spectral power, which can be processed further (CATEEM®). Source density was determined from 17 channels of the quantitative EEG from 10 clinical studies recorded in a relaxed state with open eyes. Linear discriminant analysis was used to differentiate the effects of Placebo (circadian rhythm) from CNS-active herbal drugs in comparison to Valium®. Calmvalera®, L-Theanine, Lasea®, Neurapas®, Neuravena®, Neurexan®, Nutrifin Relax®, Pascoflair®(herbal calming drugs) as well as memoLoges®, Zembrin®(herbal stimulating drugs) induced different changes of the frequency content of brain electric activity. Discriminant analysis revealed that Nutrifin Relax®, Pascoflair®and Suntheanine®could not be separated well from each other indicating a similar mechanism of action. The effect of Valium®was projected at a very isolated position far away from the herbal preparations indicating a totally different mechanism of action. Zembrin®and memoLoges®grouped together with respect to the first three discriminant functions, but were different with respect to the 4th to 6th discriminant function. Lasea®as anxiolytic drug and Neurapas®as antidepressive drug were projected at isolated positions indicating their different clinical indications. The results indicate that discriminant analysis of human quantitative EEG data allows for unique pharmacological description of individual effect profiles of herbal drugs.
文摘The bug tracking system is well known as the project support tool of open source software. There are many categorical data sets recorded on the bug tracking system. In the past, many reliability assessment methods have been proposed in the research area of software reliability. Also, there are several software project analyses based on the software effort data such as the earned value management. In particular, the software reliability growth models can </span><span style="font-family:Verdana;">apply to the system testing phase of software development. On the other</span><span style="font-family:Verdana;"> hand, the software effort analysis can apply to all development phase, because the fault data is only recorded on the testing phase. We focus on the big fault data and effort data of open source software. Then, it is difficult to assess by using the typical statistical assessment method, because the data recorded on the bug tracking system is large scale. Also, we discuss the jump diffusion process model based on the estimation method of jump parameters by using the discriminant analysis. Moreover, we analyze actual big fault data to show numerical examples of software effort assessment considering many categorical data set.
文摘We revisit a comparison of two discriminant analysis procedures, namely the linear combination classifier of Chung and Han (2000) and the maximum likelihood estimation substitution classifier for the problem of classifying unlabeled multivariate normal observations with equal covariance matrices into one of two classes. Both classes have matching block monotone missing training data. Here, we demonstrate that for intra-class covariance structures with at least small correlation among the variables with missing data and the variables without block missing data, the maximum likelihood estimation substitution classifier outperforms the Chung and Han (2000) classifier regardless of the percent of missing observations. Specifically, we examine the differences in the estimated expected error rates for these classifiers using a Monte Carlo simulation, and we compare the two classifiers using two real data sets with monotone missing data via parametric bootstrap simulations. Our results contradict the conclusions of Chung and Han (2000) that their linear combination classifier is superior to the MLE classifier for block monotone missing multivariate normal data.
基金The National Natural Science Foundation of China(No.61231002,61273266)the Ph.D.Programs Foundation of Ministry of Education of China(No.20110092130004)
文摘Semi-supervised discriminant analysis SDA which uses a combination of multiple embedding graphs and kernel SDA KSDA are adopted in supervised speech emotion recognition.When the emotional factors of speech signal samples are preprocessed different categories of features including pitch zero-cross rate energy durance formant and Mel frequency cepstrum coefficient MFCC as well as their statistical parameters are extracted from the utterances of samples.In the dimensionality reduction stage before the feature vectors are sent into classifiers parameter-optimized SDA and KSDA are performed to reduce dimensionality.Experiments on the Berlin speech emotion database show that SDA for supervised speech emotion recognition outperforms some other state-of-the-art dimensionality reduction methods based on spectral graph learning such as linear discriminant analysis LDA locality preserving projections LPP marginal Fisher analysis MFA etc. when multi-class support vector machine SVM classifiers are used.Additionally KSDA can achieve better recognition performance based on kernelized data mapping compared with the above methods including SDA.
文摘Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.
基金Supported by the National Basic Research Program of China (2013CB733600), the National Natural Science Foundation of China (21176073), the Doctoral Fund of Ministry of Education of China (20090074110005), the Program for New Century Excellent Talents in University (NCET-09-0346), Shu Guang Project (09SG29) and the Fundamental Research Funds for the Central Universities.
文摘Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In order to get a better visualization effect, a novel fault diagnosis method which combines self-organizing map (SOM) with Fisher discriminant analysis (FDA) is proposed. FDA can reduce the dimension of the data in terms of maximizing the separability of the classes. After feature extraction by FDA, SOM can distinguish the different states on the output map clearly and it can also be employed to monitor abnormal states. Tennessee Eastman (TE) process is employed to illustrate the fault diagnosis and monitoring performance of the proposed method. The result shows that the SOM integrated with FDA method is efficient and capable for real-time monitoring and fault diagnosis in complex chemical process.
基金Project(50490274) supported by the National Natural Science Foundation of China
文摘Based on the principle of Mahalanobis distance discriminant analysis (DDA) theory, a stability classification model for mine-lane surrounding rock was established, including six indexes of discriminant factors that reflect the engineering quality of surrounding rock: lane depth below surface, span of lane, ratio of directly top layer thickness to coal thickness, uniaxial comprehensive strength of surrounding rock, development degree coefficient of surrounding rock joint and range of broken surrounding rock zone. A DDA model was obtained through training 15 practical measuring samples. The re-substitution method was introduced to verify the stability of DDA model and the ratio of mis-discrimination is zero. The DDA model was used to discriminate 3 new samples and the results are identical with actual rock kind. Compared with the artificial neural network method and support vector mechanic method, the results show that this model has high prediction accuracy and can be used in practical engineering.
基金Supported by the National Natural Science Foundation of China (No.60421002).
文摘Since there are not enough fault data in historical data sets, it is very difficult to diagnose faults for batch processes. In addition, a complete batch trajectory can be obtained till the end of its operation. In order to overcome the need for estimated or filled up future unmeasured values in the online fault diagnosis, sufficiently utilize the finite information of faults, and enhance the diagnostic performance, an improved multi-model Fisher discriminant analysis is represented. The trait of the proposed method is that the training data sets are made of the current measured information and the past major discriminant information, and not only the current information or the whole batch data. An industrial typical multi-stage streptomycin fermentation process is used to test the performance of fault diagnosis of the proposed method.
基金supported by National Natural Science Foundation of China(60802069,61273270)the Fundamental Research Funds for the Central Universities of China+1 种基金Natural Science Foundation of Guangdong Province(2014A030313173)Science and Technology Program of Guangzhou(2014Y2-00165,2014J4100114,2014J4100095)
基金support from the National Natural Science Foundation of China(Grant No41262001)the Science and Technology Support Fund of Gansu Province(Grant No.1104FKCA116)
文摘As the basal group of Polypodiales, the specific taxonomy of Dicksoniaceae is still being debated. As aquantitative analysis method, numerical taxonomy has been applied to the taxonomic study of many plant families andgenera in recent years due to its simplicity and high accuracy. However, the numerical analysis of the Dicksoniaceae fossilshas not been reported at present. In the present study, the pinnule morphological data of 42 Mesozoic fossil species of theDicksoniaceae were analyzed using cluster analysis, principal component analysis and correlation analysis. The resultsrevealed that 42 taxonomic units could be divided into six representative groups, which are consistent with the traditionaltaxonomy. After screening, an identification key on 28 fossil species of four genera with a definite taxonomic position wasestablished. According to the quantitative analysis, a Bayes discriminant model was established for the selected species.Lastly, the model was tested using the morphological data of the fossil pinnules in Dicksoniaceae from the YaojieFormation, suggesting that the discriminant model is accurate to a certain extent. As a result, the numerical taxonomy canbe applied to the classification of the Dicksoniaceae fossils.
基金The National Natural Science Foundation of China (No.61374194)
文摘A direct linear discriminant analysis algorithm based on economic singular value decomposition (DLDA/ESVD) is proposed to address the computationally complex problem of the conventional DLDA algorithm, which directly uses ESVD to reduce dimension and extract eigenvectors corresponding to nonzero eigenvalues. Then a DLDA algorithm based on column pivoting orthogonal triangular (QR) decomposition and ESVD (DLDA/QR-ESVD) is proposed to improve the performance of the DLDA/ESVD algorithm by processing a high-dimensional low rank matrix, which uses column pivoting QR decomposition to reduce dimension and ESVD to extract eigenvectors corresponding to nonzero eigenvalues. The experimental results on ORL, FERET and YALE face databases show that the proposed two algorithms can achieve almost the same performance and outperform the conventional DLDA algorithm in terms of computational complexity and training time. In addition, the experimental results on random data matrices show that the DLDA/QR-ESVD algorithm achieves better performance than the DLDA/ESVD algorithm by processing high-dimensional low rank matrices.
文摘The identification of liquor brands is very important for food safety. Most of the fake liquors are usually made into the products with the same flavor and alcohol content as regular brand, so the identification for the liquor brands with the same flavor and the same alcohol content is essential. However, it is also difficult because the components of such liquor samples are very similar. Near-infrared (NIR) spectroscopy combined with partial least squares discriminant analysis (PLS-DA) was applied to identification of liquor brands with the same flavor and alcohol content. A total of 160 samples of Luzhou Laojiao liquor and 200 samples of non-Luzhou Laojiao liquor with the same flavor and alcohol content were used for identification. Samples of each type were randomly divided into the modeling and validation sets. The modeling samples were further divided into calibration and prediction sets using the Kennard-Stone algorithm to achieve uniformity and representativeness. In the modeling and validation processes based on PLS-DA method, the recognition rates of samples achieved 99.1% and 98.7%, respectively. The results show high prediction performance for the identification of liquor brands, and were obviously better than those obtained from the principal component linear discriminant analysis method. NIR spectroscopy combined with the PLS-DA method provides a quick and effective means of the discriminant analysis of liquor brands, and is also a promising tool for large-scale inspection of liquor food safety.