Underwater pulse waveform recognition is an important method for underwater object detection.Most existing works focus on the application of traditional pattern recognition methods,which ignore the time-and space-vary...Underwater pulse waveform recognition is an important method for underwater object detection.Most existing works focus on the application of traditional pattern recognition methods,which ignore the time-and space-varying characteristics in sound propagation channels and cannot easily extract valuable waveform features.Sound propagation channels in seawater are time-and space-varying convolutional channels.In the extraction of the waveform features of underwater acoustic signals,the effect of high-accuracy underwater acoustic signal recognition is identified by eliminating the influence of time-and space-varying convolutional channels to the greatest extent possible.We propose a hash aggregate discriminative network(HADN),which combines hash learning and deep learning to minimize the time-and space-varying effects on convolutional channels and adaptively learns effective underwater waveform features to achieve high-accuracy underwater pulse waveform recognition.In the extraction of the hash features of acoustic signals,a discrete constraint between clusters within a hash feature class is introduced.This constraint can ensure that the influence of convolutional channels on hash features is minimized.In addition,we design a new loss function called aggregate discriminative loss(AD-loss).The use of AD-loss and softmax-loss can increase the discriminativeness of the learned hash features.Experimental results show that on pool and ocean datasets,which were collected in pools and oceans,respectively,by using acoustic collectors,the proposed HADN performs better than other comparative models in terms of accuracy and mAP.展开更多
This paper extends the criterion of the misclassification ratio of discriminant model and presents a new selection method of discriminant model.For selecting the discriminant model,this method establishes the rule of ...This paper extends the criterion of the misclassification ratio of discriminant model and presents a new selection method of discriminant model.For selecting the discriminant model,this method establishes the rule of misclassification degree ratio through misclassification ratio of the discriminant model and misclassification degree of the samples.To test the effect of this method,this work uses seven UCI data sets.Numerical experiments on these examples indicate that this method has certain rationality and has a better effect to select a discriminant model.展开更多
Human-human interaction recognition is crucial in computer vision fields like surveillance,human-computer interaction,and social robotics.It enhances systems’ability to interpret and respond to human behavior precise...Human-human interaction recognition is crucial in computer vision fields like surveillance,human-computer interaction,and social robotics.It enhances systems’ability to interpret and respond to human behavior precisely.This research focuses on recognizing human interaction behaviors using a static image,which is challenging due to the complexity of diverse actions.The overall purpose of this study is to develop a robust and accurate system for human interaction recognition.This research presents a novel image-based human interaction recognition method using a Hidden Markov Model(HMM).The technique employs hue,saturation,and intensity(HSI)color transformation to enhance colors in video frames,making them more vibrant and visually appealing,especially in low-contrast or washed-out scenes.Gaussian filters reduce noise and smooth imperfections followed by silhouette extraction using a statistical method.Feature extraction uses the features from Accelerated Segment Test(FAST),Oriented FAST,and Rotated BRIEF(ORB)techniques.The application of Quadratic Discriminant Analysis(QDA)for feature fusion and discrimination enables high-dimensional data to be effectively analyzed,thus further enhancing the classification process.It ensures that the final features loaded into the HMM classifier accurately represent the relevant human activities.The impressive accuracy rates of 93%and 94.6%achieved in the BIT-Interaction and UT-Interaction datasets respectively,highlight the success and reliability of the proposed technique.The proposed approach addresses challenges in various domains by focusing on frame improvement,silhouette and feature extraction,feature fusion,and HMM classification.This enhances data quality,accuracy,adaptability,reliability,and reduction of errors.展开更多
The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysph...The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysphonia that are caused by voice alteration of vocal folds and their accuracy is between 60%–70%.To enhance detection accuracy and reduce processing speed of dysphonia detection,a novel approach is proposed in this paper.We have leveraged Linear Discriminant Analysis(LDA)to train multiple Machine Learning(ML)models for dysphonia detection.Several ML models are utilized like Support Vector Machine(SVM),Logistic Regression,and K-nearest neighbor(K-NN)to predict the voice pathologies based on features like Mel-Frequency Cepstral Coefficients(MFCC),Fundamental Frequency(F0),Shimmer(%),Jitter(%),and Harmonic to Noise Ratio(HNR).The experiments were performed using Saarbrucken Voice Data-base(SVD)and a privately collected dataset.The K-fold cross-validation approach was incorporated to increase the robustness and stability of the ML models.According to the experimental results,our proposed approach has a 70%increase in processing speed over Principal Component Analysis(PCA)and performs remarkably well with a recognition accuracy of 95.24%on the SVD dataset surpassing the previous best accuracy of 82.37%.In the case of the private dataset,our proposed method achieved an accuracy rate of 93.37%.It can be an effective non-invasive method to detect dysphonia.展开更多
To reduce the cost and increase the efficiency of plant genetic marker fingerprinting for variety discrimination,it is desirable to identify the optimal marker combinations.We describe a marker combination screening m...To reduce the cost and increase the efficiency of plant genetic marker fingerprinting for variety discrimination,it is desirable to identify the optimal marker combinations.We describe a marker combination screening model based on the genetic algorithm(GA)and implemented in a software tool,Loci Scan.Ratio-based variety discrimination power provided the largest optimization space among multiple fitness functions.Among GA parameters,an increase in population size and generation number enlarged optimization depth but also calculation workload.Exhaustive algorithm afforded the same optimization depth as GA but vastly increased calculation time.In comparison with two other software tools,Loci Scan accommodated missing data,reduced calculation time,and offered more fitness functions.In large datasets,the sample size of training data exerted the strongest influence on calculation time,whereas the marker size of training data showed no effect,and target marker number had limited effect on analysis speed.展开更多
The objective of this study is to investigate themethods for soil liquefaction discrimination. Typically, predicting soilliquefaction potential involves conducting the standard penetration test (SPT), which requires f...The objective of this study is to investigate themethods for soil liquefaction discrimination. Typically, predicting soilliquefaction potential involves conducting the standard penetration test (SPT), which requires field testing and canbe time-consuming and labor-intensive. In contrast, the cone penetration test (CPT) provides a more convenientmethod and offers detailed and continuous information about soil layers. In this study, the feature matrix based onCPT data is proposed to predict the standard penetration test blow count N. The featurematrix comprises the CPTcharacteristic parameters at specific depths, such as tip resistance qc, sleeve resistance f s, and depth H. To fuse thefeatures on the matrix, the convolutional neural network (CNN) is employed for feature extraction. Additionally,Genetic Algorithm (GA) is utilized to obtain the best combination of convolutional kernels and the number ofneurons. The study evaluated the robustness of the proposed model using multiple engineering field data sets.Results demonstrated that the proposed model outperformed conventional methods in predicting N values forvarious soil categories, including sandy silt, silty sand, and clayey silt. Finally, the proposed model was employedfor liquefaction discrimination. The liquefaction discrimination based on the predicted N values was comparedwith the measured N values, and the results showed that the discrimination results were in 75% agreement. Thestudy has important practical application value for foundation liquefaction engineering. Also, the novel methodadopted in this research provides new ideas and methods for research in related fields, which is of great academicsignificance.展开更多
Intelligent diagnosis driven by big data for mechanical fault is an important means to ensure the safe operation ofequipment. In these methods, deep learning-based machinery fault diagnosis approaches have received in...Intelligent diagnosis driven by big data for mechanical fault is an important means to ensure the safe operation ofequipment. In these methods, deep learning-based machinery fault diagnosis approaches have received increasingattention and achieved some results. It might lead to insufficient performance for using transfer learning alone andcause misclassification of target samples for domain bias when building deep models to learn domain-invariantfeatures. To address the above problems, a deep discriminative adversarial domain adaptation neural networkfor the bearing fault diagnosis model is proposed (DDADAN). In this method, the raw vibration data are firstlyconverted into frequency domain data by Fast Fourier Transform, and an improved deep convolutional neuralnetwork with wide first-layer kernels is used as a feature extractor to extract deep fault features. Then, domaininvariant features are learned from the fault data with correlation alignment-based domain adversarial training.Furthermore, to enhance the discriminative property of features, discriminative feature learning is embeddedinto this network to make the features compact, as well as separable between classes within the class. Finally, theperformance and anti-noise capability of the proposedmethod are evaluated using two sets of bearing fault datasets.The results demonstrate that the proposed method is capable of handling domain offset caused by differentworkingconditions and maintaining more than 97.53% accuracy on various transfer tasks. Furthermore, the proposedmethod can achieve high diagnostic accuracy under varying noise levels.展开更多
Zero-shot learning enables the recognition of new class samples by migrating models learned from semanticfeatures and existing sample features to things that have never been seen before. The problems of consistencyof ...Zero-shot learning enables the recognition of new class samples by migrating models learned from semanticfeatures and existing sample features to things that have never been seen before. The problems of consistencyof different types of features and domain shift problems are two of the critical issues in zero-shot learning. Toaddress both of these issues, this paper proposes a new modeling structure. The traditional approach mappedsemantic features and visual features into the same feature space;based on this, a dual discriminator approachis used in the proposed model. This dual discriminator approach can further enhance the consistency betweensemantic and visual features. At the same time, this approach can also align unseen class semantic features andtraining set samples, providing a portion of information about the unseen classes. In addition, a new feature fusionmethod is proposed in the model. This method is equivalent to adding perturbation to the seen class features,which can reduce the degree to which the classification results in the model are biased towards the seen classes.At the same time, this feature fusion method can provide part of the information of the unseen classes, improvingits classification accuracy in generalized zero-shot learning and reducing domain bias. The proposed method isvalidated and compared with othermethods on four datasets, and fromthe experimental results, it can be seen thatthe method proposed in this paper achieves promising results.展开更多
To detect radioactive substances with low activity levels,an anticoincidence detector and a high-purity germanium(HPGe)detector are typically used simultaneously to suppress Compton scattering background,thereby resul...To detect radioactive substances with low activity levels,an anticoincidence detector and a high-purity germanium(HPGe)detector are typically used simultaneously to suppress Compton scattering background,thereby resulting in an extremely low detection limit and improving the measurement accuracy.However,the complex and expensive hardware required does not facilitate the application or promotion of this method.Thus,a method is proposed in this study to discriminate the digital waveform of pulse signals output using an HPGe detector,whereby Compton scattering background is suppressed and a low minimum detectable activity(MDA)is achieved without using an expensive and complex anticoincidence detector and device.The electric-field-strength and energy-deposition distributions of the detector are simulated to determine the relationship between pulse shape and energy-deposition location,as well as the characteristics of energy-deposition distributions for fulland partial-energy deposition events.This relationship is used to develop a pulse-shape-discrimination algorithm based on an artificial neural network for pulse-feature identification.To accurately determine the relationship between the deposited energy of gamma(γ)rays in the detector and the deposition location,we extract four shape parameters from the pulse signals output by the detector.Machine learning is used to input the four shape parameters into the detector.Subsequently,the pulse signals are identified and classified to discriminate between partial-and full-energy deposition events.Some partial-energy deposition events are removed to suppress Compton scattering.The proposed method effectively decreases the MDA of an HPGeγ-energy dispersive spectrometer.Test results show that the Compton suppression factors for energy spectra obtained from measurements on ^(152)Eu,^(137)Cs,and ^(60)Co radioactive sources are 1.13(344 keV),1.11(662 keV),and 1.08(1332 keV),respectively,and that the corresponding MDAs are 1.4%,5.3%,and 21.6%lower,respectively.展开更多
Orthogonal Time Frequency and Space(OTFS) modulation is expected to provide high-speed and ultra-reliable communications for emerging mobile applications, including low-orbit satellite communications. Using the Dopple...Orthogonal Time Frequency and Space(OTFS) modulation is expected to provide high-speed and ultra-reliable communications for emerging mobile applications, including low-orbit satellite communications. Using the Doppler frequency for positioning is a promising research direction on communication and navigation integration. To tackle the high Doppler frequency and low signal-to-noise ratio(SNR) in satellite communication, this paper proposes a Red and Blue Frequency Shift Discriminator(RBFSD) based on the pseudo-noise(PN) sequence.The paper derives that the cross-correlation function on the Doppler domain exhibits the characteristic of a Sinc function. Therefore, it applies modulation onto the Delay-Doppler domain using PN sequence and adjusts Doppler frequency estimation by red-shifting or blue-shifting. Simulation results show that the performance of Doppler frequency estimation is close to the Cramér-Rao Lower Bound when the SNR is greater than -15dB. The proposed algorithm is about 1/D times less complex than the existing PN pilot sequence algorithm, where D is the resolution of the fractional Doppler.展开更多
Given the prominence and magnitude of airport incentive schemes,it is surprising that literature hitherto remains silent as to their effectiveness.In this paper,the relationship between airport incentive schemes and t...Given the prominence and magnitude of airport incentive schemes,it is surprising that literature hitherto remains silent as to their effectiveness.In this paper,the relationship between airport incentive schemes and the route development behavior of airlines is analyzed.Because of rare and often controversial findings in the extant literature regarding relevant influencing variables for attracting airlines at an airport,expert interviews are used as a complement to formulate testable hypotheses in this regard.A fixed effects regression model is used to test the hypotheses with a dataset that covers all seat capacity offered at the 22 largest German commercial airports in the week 46 from 2004 to 2011.It is found that incentives from primary choice,as well as secondary choice airports,have a significant influence on Low Cost Carriers.Furthermore,Low Cost Carriers,in general,do not leave any of both types of airports when the incentives cease.In the case of Network Carriers,no case is found where one joins a primary choice airport and receives an incentive.Insufficient data between Network Carriers and secondary choice airports in the time when incentives have ceased means that no statement can be given.展开更多
Most face recognition techniques have been successful in dealing with high-resolution(HR) frontal face images. However, real-world face recognition systems are often confronted with the low-resolution(LR) face images ...Most face recognition techniques have been successful in dealing with high-resolution(HR) frontal face images. However, real-world face recognition systems are often confronted with the low-resolution(LR) face images with pose and illumination variations. This is a very challenging issue, especially under the constraint of using only a single gallery image per person.To address the problem, we propose a novel approach called coupled kernel-based enhanced discriminant analysis(CKEDA).CKEDA aims to simultaneously project the features from LR non-frontal probe images and HR frontal gallery ones into a common space where discrimination property is maximized.There are four advantages of the proposed approach: 1) by using the appropriate kernel function, the data becomes linearly separable, which is beneficial for recognition; 2) inspired by linear discriminant analysis(LDA), we integrate multiple discriminant factors into our objective function to enhance the discrimination property; 3) we use the gallery extended trick to improve the recognition performance for a single gallery image per person problem; 4) our approach can address the problem of matching LR non-frontal probe images with HR frontal gallery images,which is difficult for most existing face recognition techniques.Experimental evaluation on the multi-PIE dataset signifies highly competitive performance of our algorithm.展开更多
As the basal group of Polypodiales, the specific taxonomy of Dicksoniaceae is still being debated. As aquantitative analysis method, numerical taxonomy has been applied to the taxonomic study of many plant families an...As the basal group of Polypodiales, the specific taxonomy of Dicksoniaceae is still being debated. As aquantitative analysis method, numerical taxonomy has been applied to the taxonomic study of many plant families andgenera in recent years due to its simplicity and high accuracy. However, the numerical analysis of the Dicksoniaceae fossilshas not been reported at present. In the present study, the pinnule morphological data of 42 Mesozoic fossil species of theDicksoniaceae were analyzed using cluster analysis, principal component analysis and correlation analysis. The resultsrevealed that 42 taxonomic units could be divided into six representative groups, which are consistent with the traditionaltaxonomy. After screening, an identification key on 28 fossil species of four genera with a definite taxonomic position wasestablished. According to the quantitative analysis, a Bayes discriminant model was established for the selected species.Lastly, the model was tested using the morphological data of the fossil pinnules in Dicksoniaceae from the YaojieFormation, suggesting that the discriminant model is accurate to a certain extent. As a result, the numerical taxonomy canbe applied to the classification of the Dicksoniaceae fossils.展开更多
The identification of liquor brands is very important for food safety. Most of the fake liquors are usually made into the products with the same flavor and alcohol content as regular brand, so the identification for t...The identification of liquor brands is very important for food safety. Most of the fake liquors are usually made into the products with the same flavor and alcohol content as regular brand, so the identification for the liquor brands with the same flavor and the same alcohol content is essential. However, it is also difficult because the components of such liquor samples are very similar. Near-infrared (NIR) spectroscopy combined with partial least squares discriminant analysis (PLS-DA) was applied to identification of liquor brands with the same flavor and alcohol content. A total of 160 samples of Luzhou Laojiao liquor and 200 samples of non-Luzhou Laojiao liquor with the same flavor and alcohol content were used for identification. Samples of each type were randomly divided into the modeling and validation sets. The modeling samples were further divided into calibration and prediction sets using the Kennard-Stone algorithm to achieve uniformity and representativeness. In the modeling and validation processes based on PLS-DA method, the recognition rates of samples achieved 99.1% and 98.7%, respectively. The results show high prediction performance for the identification of liquor brands, and were obviously better than those obtained from the principal component linear discriminant analysis method. NIR spectroscopy combined with the PLS-DA method provides a quick and effective means of the discriminant analysis of liquor brands, and is also a promising tool for large-scale inspection of liquor food safety.展开更多
OBJECTIVE: To estimate the operative mortality in patients with malignant obstructive jaundice. METHODS: Twelve risk factors were analyzed using multivariate discriminant analysis in 90 patients who had been operated ...OBJECTIVE: To estimate the operative mortality in patients with malignant obstructive jaundice. METHODS: Twelve risk factors were analyzed using multivariate discriminant analysis in 90 patients who had been operated on. RESULTS: Operative mortality was significantly related to the following factors: age, duration of jaundice, packed RBC volume, white blood cell count and concentration of blood urine nitrogen; it was not significantly related to diseases and types of operation. The following formula was obtained: packed RBC volume×0.09954-age×0. 04018-blood urine nitrogen×0. 23693-duration of jaundice× 2. 07388-WBC count×0. 21118+5. 26593. With this formula, an operative mortality of 77. 8% was predicted. CONCLUSION: With a positive value from the formula, the patient should be operated on; otherwise non-operative treatment is advocated.展开更多
Fetal distress is one of the main factors to cesarean section in obstetrics and gynecology. If the fetus lack of oxygen in uterus, threat to the fetal health and fetal death could happen. Cardiotocography (CTG) is the...Fetal distress is one of the main factors to cesarean section in obstetrics and gynecology. If the fetus lack of oxygen in uterus, threat to the fetal health and fetal death could happen. Cardiotocography (CTG) is the most widely used technique to monitor the fetal health and fetal heart rate (FHR) is an important index to identify occurs of fetal distress. This study is to propose discriminant analysis (DA), decision tree (DT), and artificial neural network (ANN) to evaluate fetal distress. The results show that the accuracies of DA, DT and ANN are 82.1%, 86.36% and 97.78%, respectively.展开更多
Partial least squares discriminant analysis (PLS-DA) with integrated moving-window (MW) waveband screening was applied to the discriminant analysis of liquor brands with near-infrared (NIR) spectroscopy. Luzhou Laojia...Partial least squares discriminant analysis (PLS-DA) with integrated moving-window (MW) waveband screening was applied to the discriminant analysis of liquor brands with near-infrared (NIR) spectroscopy. Luzhou Laojiao, a popular liquor with strong fragrant flavor, was used as the identified liquor brand (160 samples, negative, 52 vol alcoholicity). Liquors of 10 other brands with strong fragrant flavor were used as the interferential brands (200 samples, positive, 52 vol alcoholicity). The Kennard-Stone algorithm was used for the division of modeling samples to achieve uniformity and representativeness. Based on the MW-PLS-DA, a simplified optimal model set with 157 wavebands was further proposed. This set contained five types of wavebands corresponding to the NIR absorption bands of water, ethanol, and other micronutrients (i.e., acids, aldehydes, phenols, and aromatic compounds) in liquor for practical choice. Using five selected simple models with 4775 - 4239, 7804 - 6569, 6264 - 5844, 9435 - 7896, and 12066 - 10373 cm-1, the validation recognition rates were obtained as 99.3% or higher. Results show good prediction performance and low model complexity, and also provided a valuable reference for designing small dedicated instruments. The proposed method is a promising tool for large-scale inspection of liquor food safety.展开更多
Considering limitations of Linear Discriminant Analysis (LDA) and Marginal Fisher Analysis (MFA), a novel discriminant analysis called Local Correlation Discriminant Analysis (LCDA) is proposed in this paper. The main...Considering limitations of Linear Discriminant Analysis (LDA) and Marginal Fisher Analysis (MFA), a novel discriminant analysis called Local Correlation Discriminant Analysis (LCDA) is proposed in this paper. The main idea behind LCDA is to use more robust similarity measure, correlation metric, to measure the local similarity between image data. This results in better classifi-cation performance. In addition, to further improve the discriminant power of LCDA, we extend LCDA to semi-supervised case, which can make use of both labeled and unlabeled data to perform dis-criminant analysis. Extensive experimental results on ORL and AR face databases demonstrate that the proposed LCDA and its semi-supervised version are superior to Principal Component Analysis (PCA), LDA, CEA, and MFA.展开更多
High-end wine brand is made through the use of high-quality grape variety and yeast strain, and through a unique process. Not only is it rich in nutrients, but also it has a unique taste and a fragrant scent. Brand id...High-end wine brand is made through the use of high-quality grape variety and yeast strain, and through a unique process. Not only is it rich in nutrients, but also it has a unique taste and a fragrant scent. Brand identification of wine is difficult and complex because of high similarity. In this paper, visible and near-infrared (NIR) spectroscopy combined with partial least squares discriminant analysis (PLS-DA) was used to explore the feasibility of wine brand identification. Chilean Aoyo wine (2016 vintage) was selected as the identification brand (negative, 100 samples), and various other brands of wine were used as interference brands (positive, 373 samples). Samples of each type were randomly divided into the calibration, prediction and validation sets. For comparison, the PLS-DA models were established in three independent and two complex wavebands of visible (400 - 780 nm), short-NIR (780 - 1100 nm), long-NIR (1100 - 2498 nm), whole NIR (780 - 2498 nm) and whole scanning (400 - 2498 nm). In independent validation, the five models all achieved good discriminant effects. Among them, the visible region model achieved the best effect. The recognition-accuracy rates in validation of negative, positive and total samples achieved 100%, 95.6% and 97.5%, respectively. The results indicated the feasibility of wine brand identification with Vis-NIR spectroscopy.展开更多
基金partially supported by the National Key Research and Development Program of China(No.2018 AAA0100400)the Natural Science Foundation of Shandong Province(Nos.ZR2020MF131 and ZR2021ZD19)the Science and Technology Program of Qingdao(No.21-1-4-ny-19-nsh).
文摘Underwater pulse waveform recognition is an important method for underwater object detection.Most existing works focus on the application of traditional pattern recognition methods,which ignore the time-and space-varying characteristics in sound propagation channels and cannot easily extract valuable waveform features.Sound propagation channels in seawater are time-and space-varying convolutional channels.In the extraction of the waveform features of underwater acoustic signals,the effect of high-accuracy underwater acoustic signal recognition is identified by eliminating the influence of time-and space-varying convolutional channels to the greatest extent possible.We propose a hash aggregate discriminative network(HADN),which combines hash learning and deep learning to minimize the time-and space-varying effects on convolutional channels and adaptively learns effective underwater waveform features to achieve high-accuracy underwater pulse waveform recognition.In the extraction of the hash features of acoustic signals,a discrete constraint between clusters within a hash feature class is introduced.This constraint can ensure that the influence of convolutional channels on hash features is minimized.In addition,we design a new loss function called aggregate discriminative loss(AD-loss).The use of AD-loss and softmax-loss can increase the discriminativeness of the learned hash features.Experimental results show that on pool and ocean datasets,which were collected in pools and oceans,respectively,by using acoustic collectors,the proposed HADN performs better than other comparative models in terms of accuracy and mAP.
基金Supported by the National Natural Science Foundation of China(52070119)Key Laboratory of Financial Mathematics of Fujian Province University(Putian University)(JR201801).
文摘This paper extends the criterion of the misclassification ratio of discriminant model and presents a new selection method of discriminant model.For selecting the discriminant model,this method establishes the rule of misclassification degree ratio through misclassification ratio of the discriminant model and misclassification degree of the samples.To test the effect of this method,this work uses seven UCI data sets.Numerical experiments on these examples indicate that this method has certain rationality and has a better effect to select a discriminant model.
基金funding this work under the Research Group Funding Program Grant Code(NU/RG/SERC/12/6)supported via funding from Prince Satam bin Abdulaziz University Project Number(PSAU/2023/R/1444)+1 种基金Princess Nourah bint Abdulrahman University Researchers Supporting Project Number(PNURSP2023R348)Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia,and this work was also supported by the Ministry of Science and ICT(MSIT),South Korea,through the ICT Creative Consilience Program supervised by the Institute for Information and Communications Technology Planning and Evaluation(IITP)under Grant IITP-2023-2020-0-01821.
文摘Human-human interaction recognition is crucial in computer vision fields like surveillance,human-computer interaction,and social robotics.It enhances systems’ability to interpret and respond to human behavior precisely.This research focuses on recognizing human interaction behaviors using a static image,which is challenging due to the complexity of diverse actions.The overall purpose of this study is to develop a robust and accurate system for human interaction recognition.This research presents a novel image-based human interaction recognition method using a Hidden Markov Model(HMM).The technique employs hue,saturation,and intensity(HSI)color transformation to enhance colors in video frames,making them more vibrant and visually appealing,especially in low-contrast or washed-out scenes.Gaussian filters reduce noise and smooth imperfections followed by silhouette extraction using a statistical method.Feature extraction uses the features from Accelerated Segment Test(FAST),Oriented FAST,and Rotated BRIEF(ORB)techniques.The application of Quadratic Discriminant Analysis(QDA)for feature fusion and discrimination enables high-dimensional data to be effectively analyzed,thus further enhancing the classification process.It ensures that the final features loaded into the HMM classifier accurately represent the relevant human activities.The impressive accuracy rates of 93%and 94.6%achieved in the BIT-Interaction and UT-Interaction datasets respectively,highlight the success and reliability of the proposed technique.The proposed approach addresses challenges in various domains by focusing on frame improvement,silhouette and feature extraction,feature fusion,and HMM classification.This enhances data quality,accuracy,adaptability,reliability,and reduction of errors.
文摘The recognition of pathological voice is considered a difficult task for speech analysis.Moreover,otolaryngologists needed to rely on oral communication with patients to discover traces of voice pathologies like dysphonia that are caused by voice alteration of vocal folds and their accuracy is between 60%–70%.To enhance detection accuracy and reduce processing speed of dysphonia detection,a novel approach is proposed in this paper.We have leveraged Linear Discriminant Analysis(LDA)to train multiple Machine Learning(ML)models for dysphonia detection.Several ML models are utilized like Support Vector Machine(SVM),Logistic Regression,and K-nearest neighbor(K-NN)to predict the voice pathologies based on features like Mel-Frequency Cepstral Coefficients(MFCC),Fundamental Frequency(F0),Shimmer(%),Jitter(%),and Harmonic to Noise Ratio(HNR).The experiments were performed using Saarbrucken Voice Data-base(SVD)and a privately collected dataset.The K-fold cross-validation approach was incorporated to increase the robustness and stability of the ML models.According to the experimental results,our proposed approach has a 70%increase in processing speed over Principal Component Analysis(PCA)and performs remarkably well with a recognition accuracy of 95.24%on the SVD dataset surpassing the previous best accuracy of 82.37%.In the case of the private dataset,our proposed method achieved an accuracy rate of 93.37%.It can be an effective non-invasive method to detect dysphonia.
基金supported by the Scientific and Technological Innovation 2030 Major Project(2022ZD04019)the Science and Technology Innovation Capacity Building Project of BAAFS(KJCX20230303)+1 种基金Hainan Province Science and Technology Special Fund(ZDYF2023XDNY077)the Beijing Scholars Program(BSP041)。
文摘To reduce the cost and increase the efficiency of plant genetic marker fingerprinting for variety discrimination,it is desirable to identify the optimal marker combinations.We describe a marker combination screening model based on the genetic algorithm(GA)and implemented in a software tool,Loci Scan.Ratio-based variety discrimination power provided the largest optimization space among multiple fitness functions.Among GA parameters,an increase in population size and generation number enlarged optimization depth but also calculation workload.Exhaustive algorithm afforded the same optimization depth as GA but vastly increased calculation time.In comparison with two other software tools,Loci Scan accommodated missing data,reduced calculation time,and offered more fitness functions.In large datasets,the sample size of training data exerted the strongest influence on calculation time,whereas the marker size of training data showed no effect,and target marker number had limited effect on analysis speed.
基金the Center University(Grant No.B220202013)Qinglan Project of Jiangsu Province(2022).
文摘The objective of this study is to investigate themethods for soil liquefaction discrimination. Typically, predicting soilliquefaction potential involves conducting the standard penetration test (SPT), which requires field testing and canbe time-consuming and labor-intensive. In contrast, the cone penetration test (CPT) provides a more convenientmethod and offers detailed and continuous information about soil layers. In this study, the feature matrix based onCPT data is proposed to predict the standard penetration test blow count N. The featurematrix comprises the CPTcharacteristic parameters at specific depths, such as tip resistance qc, sleeve resistance f s, and depth H. To fuse thefeatures on the matrix, the convolutional neural network (CNN) is employed for feature extraction. Additionally,Genetic Algorithm (GA) is utilized to obtain the best combination of convolutional kernels and the number ofneurons. The study evaluated the robustness of the proposed model using multiple engineering field data sets.Results demonstrated that the proposed model outperformed conventional methods in predicting N values forvarious soil categories, including sandy silt, silty sand, and clayey silt. Finally, the proposed model was employedfor liquefaction discrimination. The liquefaction discrimination based on the predicted N values was comparedwith the measured N values, and the results showed that the discrimination results were in 75% agreement. Thestudy has important practical application value for foundation liquefaction engineering. Also, the novel methodadopted in this research provides new ideas and methods for research in related fields, which is of great academicsignificance.
基金the Natural Science Foundation of Henan Province(232300420094)the Science and TechnologyResearch Project of Henan Province(222102220092).
文摘Intelligent diagnosis driven by big data for mechanical fault is an important means to ensure the safe operation ofequipment. In these methods, deep learning-based machinery fault diagnosis approaches have received increasingattention and achieved some results. It might lead to insufficient performance for using transfer learning alone andcause misclassification of target samples for domain bias when building deep models to learn domain-invariantfeatures. To address the above problems, a deep discriminative adversarial domain adaptation neural networkfor the bearing fault diagnosis model is proposed (DDADAN). In this method, the raw vibration data are firstlyconverted into frequency domain data by Fast Fourier Transform, and an improved deep convolutional neuralnetwork with wide first-layer kernels is used as a feature extractor to extract deep fault features. Then, domaininvariant features are learned from the fault data with correlation alignment-based domain adversarial training.Furthermore, to enhance the discriminative property of features, discriminative feature learning is embeddedinto this network to make the features compact, as well as separable between classes within the class. Finally, theperformance and anti-noise capability of the proposedmethod are evaluated using two sets of bearing fault datasets.The results demonstrate that the proposed method is capable of handling domain offset caused by differentworkingconditions and maintaining more than 97.53% accuracy on various transfer tasks. Furthermore, the proposedmethod can achieve high diagnostic accuracy under varying noise levels.
文摘Zero-shot learning enables the recognition of new class samples by migrating models learned from semanticfeatures and existing sample features to things that have never been seen before. The problems of consistencyof different types of features and domain shift problems are two of the critical issues in zero-shot learning. Toaddress both of these issues, this paper proposes a new modeling structure. The traditional approach mappedsemantic features and visual features into the same feature space;based on this, a dual discriminator approachis used in the proposed model. This dual discriminator approach can further enhance the consistency betweensemantic and visual features. At the same time, this approach can also align unseen class semantic features andtraining set samples, providing a portion of information about the unseen classes. In addition, a new feature fusionmethod is proposed in the model. This method is equivalent to adding perturbation to the seen class features,which can reduce the degree to which the classification results in the model are biased towards the seen classes.At the same time, this feature fusion method can provide part of the information of the unseen classes, improvingits classification accuracy in generalized zero-shot learning and reducing domain bias. The proposed method isvalidated and compared with othermethods on four datasets, and fromthe experimental results, it can be seen thatthe method proposed in this paper achieves promising results.
基金This work was supported by the National Key R&D Program of China(Nos.2022YFF0709503,2022YFB1902700,2017YFC0602101)the Key Research and Development Program of Sichuan province(No.2023YFG0347)the Key Research and Development Program of Sichuan province(No.2020ZDZX0007).
文摘To detect radioactive substances with low activity levels,an anticoincidence detector and a high-purity germanium(HPGe)detector are typically used simultaneously to suppress Compton scattering background,thereby resulting in an extremely low detection limit and improving the measurement accuracy.However,the complex and expensive hardware required does not facilitate the application or promotion of this method.Thus,a method is proposed in this study to discriminate the digital waveform of pulse signals output using an HPGe detector,whereby Compton scattering background is suppressed and a low minimum detectable activity(MDA)is achieved without using an expensive and complex anticoincidence detector and device.The electric-field-strength and energy-deposition distributions of the detector are simulated to determine the relationship between pulse shape and energy-deposition location,as well as the characteristics of energy-deposition distributions for fulland partial-energy deposition events.This relationship is used to develop a pulse-shape-discrimination algorithm based on an artificial neural network for pulse-feature identification.To accurately determine the relationship between the deposited energy of gamma(γ)rays in the detector and the deposition location,we extract four shape parameters from the pulse signals output by the detector.Machine learning is used to input the four shape parameters into the detector.Subsequently,the pulse signals are identified and classified to discriminate between partial-and full-energy deposition events.Some partial-energy deposition events are removed to suppress Compton scattering.The proposed method effectively decreases the MDA of an HPGeγ-energy dispersive spectrometer.Test results show that the Compton suppression factors for energy spectra obtained from measurements on ^(152)Eu,^(137)Cs,and ^(60)Co radioactive sources are 1.13(344 keV),1.11(662 keV),and 1.08(1332 keV),respectively,and that the corresponding MDAs are 1.4%,5.3%,and 21.6%lower,respectively.
文摘Orthogonal Time Frequency and Space(OTFS) modulation is expected to provide high-speed and ultra-reliable communications for emerging mobile applications, including low-orbit satellite communications. Using the Doppler frequency for positioning is a promising research direction on communication and navigation integration. To tackle the high Doppler frequency and low signal-to-noise ratio(SNR) in satellite communication, this paper proposes a Red and Blue Frequency Shift Discriminator(RBFSD) based on the pseudo-noise(PN) sequence.The paper derives that the cross-correlation function on the Doppler domain exhibits the characteristic of a Sinc function. Therefore, it applies modulation onto the Delay-Doppler domain using PN sequence and adjusts Doppler frequency estimation by red-shifting or blue-shifting. Simulation results show that the performance of Doppler frequency estimation is close to the Cramér-Rao Lower Bound when the SNR is greater than -15dB. The proposed algorithm is about 1/D times less complex than the existing PN pilot sequence algorithm, where D is the resolution of the fractional Doppler.
文摘Given the prominence and magnitude of airport incentive schemes,it is surprising that literature hitherto remains silent as to their effectiveness.In this paper,the relationship between airport incentive schemes and the route development behavior of airlines is analyzed.Because of rare and often controversial findings in the extant literature regarding relevant influencing variables for attracting airlines at an airport,expert interviews are used as a complement to formulate testable hypotheses in this regard.A fixed effects regression model is used to test the hypotheses with a dataset that covers all seat capacity offered at the 22 largest German commercial airports in the week 46 from 2004 to 2011.It is found that incentives from primary choice,as well as secondary choice airports,have a significant influence on Low Cost Carriers.Furthermore,Low Cost Carriers,in general,do not leave any of both types of airports when the incentives cease.In the case of Network Carriers,no case is found where one joins a primary choice airport and receives an incentive.Insufficient data between Network Carriers and secondary choice airports in the time when incentives have ceased means that no statement can be given.
基金supported by National Natural Science Foundation of China(60802069,61273270)the Fundamental Research Funds for the Central Universities of China+1 种基金Natural Science Foundation of Guangdong Province(2014A030313173)Science and Technology Program of Guangzhou(2014Y2-00165,2014J4100114,2014J4100095)
文摘Most face recognition techniques have been successful in dealing with high-resolution(HR) frontal face images. However, real-world face recognition systems are often confronted with the low-resolution(LR) face images with pose and illumination variations. This is a very challenging issue, especially under the constraint of using only a single gallery image per person.To address the problem, we propose a novel approach called coupled kernel-based enhanced discriminant analysis(CKEDA).CKEDA aims to simultaneously project the features from LR non-frontal probe images and HR frontal gallery ones into a common space where discrimination property is maximized.There are four advantages of the proposed approach: 1) by using the appropriate kernel function, the data becomes linearly separable, which is beneficial for recognition; 2) inspired by linear discriminant analysis(LDA), we integrate multiple discriminant factors into our objective function to enhance the discrimination property; 3) we use the gallery extended trick to improve the recognition performance for a single gallery image per person problem; 4) our approach can address the problem of matching LR non-frontal probe images with HR frontal gallery images,which is difficult for most existing face recognition techniques.Experimental evaluation on the multi-PIE dataset signifies highly competitive performance of our algorithm.
基金support from the National Natural Science Foundation of China(Grant No41262001)the Science and Technology Support Fund of Gansu Province(Grant No.1104FKCA116)
文摘As the basal group of Polypodiales, the specific taxonomy of Dicksoniaceae is still being debated. As aquantitative analysis method, numerical taxonomy has been applied to the taxonomic study of many plant families andgenera in recent years due to its simplicity and high accuracy. However, the numerical analysis of the Dicksoniaceae fossilshas not been reported at present. In the present study, the pinnule morphological data of 42 Mesozoic fossil species of theDicksoniaceae were analyzed using cluster analysis, principal component analysis and correlation analysis. The resultsrevealed that 42 taxonomic units could be divided into six representative groups, which are consistent with the traditionaltaxonomy. After screening, an identification key on 28 fossil species of four genera with a definite taxonomic position wasestablished. According to the quantitative analysis, a Bayes discriminant model was established for the selected species.Lastly, the model was tested using the morphological data of the fossil pinnules in Dicksoniaceae from the YaojieFormation, suggesting that the discriminant model is accurate to a certain extent. As a result, the numerical taxonomy canbe applied to the classification of the Dicksoniaceae fossils.
文摘The identification of liquor brands is very important for food safety. Most of the fake liquors are usually made into the products with the same flavor and alcohol content as regular brand, so the identification for the liquor brands with the same flavor and the same alcohol content is essential. However, it is also difficult because the components of such liquor samples are very similar. Near-infrared (NIR) spectroscopy combined with partial least squares discriminant analysis (PLS-DA) was applied to identification of liquor brands with the same flavor and alcohol content. A total of 160 samples of Luzhou Laojiao liquor and 200 samples of non-Luzhou Laojiao liquor with the same flavor and alcohol content were used for identification. Samples of each type were randomly divided into the modeling and validation sets. The modeling samples were further divided into calibration and prediction sets using the Kennard-Stone algorithm to achieve uniformity and representativeness. In the modeling and validation processes based on PLS-DA method, the recognition rates of samples achieved 99.1% and 98.7%, respectively. The results show high prediction performance for the identification of liquor brands, and were obviously better than those obtained from the principal component linear discriminant analysis method. NIR spectroscopy combined with the PLS-DA method provides a quick and effective means of the discriminant analysis of liquor brands, and is also a promising tool for large-scale inspection of liquor food safety.
文摘OBJECTIVE: To estimate the operative mortality in patients with malignant obstructive jaundice. METHODS: Twelve risk factors were analyzed using multivariate discriminant analysis in 90 patients who had been operated on. RESULTS: Operative mortality was significantly related to the following factors: age, duration of jaundice, packed RBC volume, white blood cell count and concentration of blood urine nitrogen; it was not significantly related to diseases and types of operation. The following formula was obtained: packed RBC volume×0.09954-age×0. 04018-blood urine nitrogen×0. 23693-duration of jaundice× 2. 07388-WBC count×0. 21118+5. 26593. With this formula, an operative mortality of 77. 8% was predicted. CONCLUSION: With a positive value from the formula, the patient should be operated on; otherwise non-operative treatment is advocated.
文摘Fetal distress is one of the main factors to cesarean section in obstetrics and gynecology. If the fetus lack of oxygen in uterus, threat to the fetal health and fetal death could happen. Cardiotocography (CTG) is the most widely used technique to monitor the fetal health and fetal heart rate (FHR) is an important index to identify occurs of fetal distress. This study is to propose discriminant analysis (DA), decision tree (DT), and artificial neural network (ANN) to evaluate fetal distress. The results show that the accuracies of DA, DT and ANN are 82.1%, 86.36% and 97.78%, respectively.
文摘Partial least squares discriminant analysis (PLS-DA) with integrated moving-window (MW) waveband screening was applied to the discriminant analysis of liquor brands with near-infrared (NIR) spectroscopy. Luzhou Laojiao, a popular liquor with strong fragrant flavor, was used as the identified liquor brand (160 samples, negative, 52 vol alcoholicity). Liquors of 10 other brands with strong fragrant flavor were used as the interferential brands (200 samples, positive, 52 vol alcoholicity). The Kennard-Stone algorithm was used for the division of modeling samples to achieve uniformity and representativeness. Based on the MW-PLS-DA, a simplified optimal model set with 157 wavebands was further proposed. This set contained five types of wavebands corresponding to the NIR absorption bands of water, ethanol, and other micronutrients (i.e., acids, aldehydes, phenols, and aromatic compounds) in liquor for practical choice. Using five selected simple models with 4775 - 4239, 7804 - 6569, 6264 - 5844, 9435 - 7896, and 12066 - 10373 cm-1, the validation recognition rates were obtained as 99.3% or higher. Results show good prediction performance and low model complexity, and also provided a valuable reference for designing small dedicated instruments. The proposed method is a promising tool for large-scale inspection of liquor food safety.
基金Supproted by the National Natural Science Foundation of China(No.60875004)the Natural Science Foundation of Jiangsu Province of China(No.BK2009184)the Natural Science Foundation of the Jiangsu Higher Education Institutions of China(No.07KJB520133)
文摘Considering limitations of Linear Discriminant Analysis (LDA) and Marginal Fisher Analysis (MFA), a novel discriminant analysis called Local Correlation Discriminant Analysis (LCDA) is proposed in this paper. The main idea behind LCDA is to use more robust similarity measure, correlation metric, to measure the local similarity between image data. This results in better classifi-cation performance. In addition, to further improve the discriminant power of LCDA, we extend LCDA to semi-supervised case, which can make use of both labeled and unlabeled data to perform dis-criminant analysis. Extensive experimental results on ORL and AR face databases demonstrate that the proposed LCDA and its semi-supervised version are superior to Principal Component Analysis (PCA), LDA, CEA, and MFA.
文摘High-end wine brand is made through the use of high-quality grape variety and yeast strain, and through a unique process. Not only is it rich in nutrients, but also it has a unique taste and a fragrant scent. Brand identification of wine is difficult and complex because of high similarity. In this paper, visible and near-infrared (NIR) spectroscopy combined with partial least squares discriminant analysis (PLS-DA) was used to explore the feasibility of wine brand identification. Chilean Aoyo wine (2016 vintage) was selected as the identification brand (negative, 100 samples), and various other brands of wine were used as interference brands (positive, 373 samples). Samples of each type were randomly divided into the calibration, prediction and validation sets. For comparison, the PLS-DA models were established in three independent and two complex wavebands of visible (400 - 780 nm), short-NIR (780 - 1100 nm), long-NIR (1100 - 2498 nm), whole NIR (780 - 2498 nm) and whole scanning (400 - 2498 nm). In independent validation, the five models all achieved good discriminant effects. Among them, the visible region model achieved the best effect. The recognition-accuracy rates in validation of negative, positive and total samples achieved 100%, 95.6% and 97.5%, respectively. The results indicated the feasibility of wine brand identification with Vis-NIR spectroscopy.