Video streaming applications have grown considerably in recent years.As a result,this becomes one of the most significant contributors to global internet traffic.According to recent studies,the telecommunications indu...Video streaming applications have grown considerably in recent years.As a result,this becomes one of the most significant contributors to global internet traffic.According to recent studies,the telecommunications industry loses millions of dollars due to poor video Quality of Experience(QoE)for users.Among the standard proposals for standardizing the quality of video streaming over internet service providers(ISPs)is the Mean Opinion Score(MOS).However,the accurate finding of QoE by MOS is subjective and laborious,and it varies depending on the user.A fully automated data analytics framework is required to reduce the inter-operator variability characteristic in QoE assessment.This work addresses this concern by suggesting a novel hybrid XGBStackQoE analytical model using a two-level layering technique.Level one combines multiple Machine Learning(ML)models via a layer one Hybrid XGBStackQoE-model.Individual ML models at level one are trained using the entire training data set.The level two Hybrid XGBStackQoE-Model is fitted using the outputs(meta-features)of the layer one ML models.The proposed model outperformed the conventional models,with an accuracy improvement of 4 to 5 percent,which is still higher than the current traditional models.The proposed framework could significantly improve video QoE accuracy.展开更多
Different sedimentary zones in coral reefs lead to significant anisotropy in the pore structure of coral reef limestone(CRL),making it difficult to study mechanical behaviors.With X-ray computed tomography(CT),112 CRL...Different sedimentary zones in coral reefs lead to significant anisotropy in the pore structure of coral reef limestone(CRL),making it difficult to study mechanical behaviors.With X-ray computed tomography(CT),112 CRL samples were utilized for training the support vector machine(SVM)-,random forest(RF)-,and back propagation neural network(BPNN)-based models,respectively.Simultaneously,the machine learning model was embedded into genetic algorithm(GA)for parameter optimization to effectively predict uniaxial compressive strength(UCS)of CRL.Results indicate that the BPNN model with five hidden layers presents the best training effect in the data set of CRL.The SVM-based model shows a tendency to overfitting in the training set and poor generalization ability in the testing set.The RF-based model is suitable for training CRL samples with large data.Analysis of Pearson correlation coefficient matrix and the percentage increment method of performance metrics shows that the dry density,pore structure,and porosity of CRL are strongly correlated to UCS.However,the P-wave velocity is almost uncorrelated to the UCS,which is significantly distinct from the law for homogenous geomaterials.In addition,the pore tensor proposed in this paper can effectively reflect the pore structure of coral framework limestone(CFL)and coral boulder limestone(CBL),realizing the quantitative characterization of the heterogeneity and anisotropy of pore.The pore tensor provides a feasible idea to establish the relationship between pore structure and mechanical behavior of CRL.展开更多
To improve the performance of video compression for machine vision analysis tasks,a video coding for machines(VCM)standard working group was established to promote standardization procedures.In this paper,recent advan...To improve the performance of video compression for machine vision analysis tasks,a video coding for machines(VCM)standard working group was established to promote standardization procedures.In this paper,recent advances in video coding for machine standards are presented and comprehensive introductions to the use cases,requirements,evaluation frameworks and corresponding metrics of the VCM standard are given.Then the existing methods are presented,introducing the existing proposals by category and the research progress of the latest VCM conference.Finally,we give conclusions.展开更多
The quick spread of the CoronavirusDisease(COVID-19)infection around the world considered a real danger for global health.The biological structure and symptoms of COVID-19 are similar to other viral chest maladies,whi...The quick spread of the CoronavirusDisease(COVID-19)infection around the world considered a real danger for global health.The biological structure and symptoms of COVID-19 are similar to other viral chest maladies,which makes it challenging and a big issue to improve approaches for efficient identification of COVID-19 disease.In this study,an automatic prediction of COVID-19 identification is proposed to automatically discriminate between healthy and COVID-19 infected subjects in X-ray images using two successful moderns are traditional machine learning methods(e.g.,artificial neural network(ANN),support vector machine(SVM),linear kernel and radial basis function(RBF),k-nearest neighbor(k-NN),Decision Tree(DT),andCN2 rule inducer techniques)and deep learningmodels(e.g.,MobileNets V2,ResNet50,GoogleNet,DarkNet andXception).A largeX-ray dataset has been created and developed,namely the COVID-19 vs.Normal(400 healthy cases,and 400 COVID cases).To the best of our knowledge,it is currently the largest publicly accessible COVID-19 dataset with the largest number of X-ray images of confirmed COVID-19 infection cases.Based on the results obtained from the experiments,it can be concluded that all the models performed well,deep learning models had achieved the optimum accuracy of 98.8%in ResNet50 model.In comparison,in traditional machine learning techniques, the SVM demonstrated the best result for an accuracy of 95% and RBFaccuracy 94% for the prediction of coronavirus disease 2019.展开更多
A general prediction model for seven heavy metals was established using the heavy metal contents of 207soil samples measured by a portable X-ray fluorescence spectrometer(XRF)and six environmental factors as model cor...A general prediction model for seven heavy metals was established using the heavy metal contents of 207soil samples measured by a portable X-ray fluorescence spectrometer(XRF)and six environmental factors as model correction coefficients.The eXtreme Gradient Boosting(XGBoost)model was used to fit the relationship between the content of heavy metals and environment characteristics to evaluate the soil ecological risk of the smelting site.The results demonstrated that the generalized prediction model developed for Pb,Cd,and As was highly accurate with fitted coefficients(R^(2))values of 0.911,0.950,and 0.835,respectively.Topsoil presented the highest ecological risk,and there existed high potential ecological risk at some positions with different depths due to high mobility of Cd.Generally,the application of machine learning significantly increased the accuracy of pXRF measurements,and identified key environmental factors.The adapted potential ecological risk assessment emphasized the need to focus on Pb,Cd,and As in future site remediation efforts.展开更多
The motivation for this study is that the quality of deep fakes is constantly improving,which leads to the need to develop new methods for their detection.The proposed Customized Convolutional Neural Network method in...The motivation for this study is that the quality of deep fakes is constantly improving,which leads to the need to develop new methods for their detection.The proposed Customized Convolutional Neural Network method involves extracting structured data from video frames using facial landmark detection,which is then used as input to the CNN.The customized Convolutional Neural Network method is the date augmented-based CNN model to generate‘fake data’or‘fake images’.This study was carried out using Python and its libraries.We used 242 films from the dataset gathered by the Deep Fake Detection Challenge,of which 199 were made up and the remaining 53 were real.Ten seconds were allotted for each video.There were 318 videos used in all,199 of which were fake and 119 of which were real.Our proposedmethod achieved a testing accuracy of 91.47%,loss of 0.342,and AUC score of 0.92,outperforming two alternative approaches,CNN and MLP-CNN.Furthermore,our method succeeded in greater accuracy than contemporary models such as XceptionNet,Meso-4,EfficientNet-BO,MesoInception-4,VGG-16,and DST-Net.The novelty of this investigation is the development of a new Convolutional Neural Network(CNN)learning model that can accurately detect deep fake face photos.展开更多
Cardiac coronary angiography is a major technique that assists physicians during interventional heart surgery.Under X-ray irradiation,the physician injects a contrast agent through a catheter and determines the corona...Cardiac coronary angiography is a major technique that assists physicians during interventional heart surgery.Under X-ray irradiation,the physician injects a contrast agent through a catheter and determines the coronary arteries’state in real time.However,to obtain a more accurate state of the coronary arteries,physicians need to increase the fre-quency and intensity of X-ray exposure,which will inevitably increase the potential for harm to both the patient and the surgeon.In the work reported here,we use advanced deep learning algorithms to fi nd a method of frame interpola-tion for coronary angiography videos that reduces the frequency of X-ray exposure by reducing the frame rate of the coronary angiography video,thereby reducing X-ray-induced damage to physicians.We established a new coronary angiography image group dataset containing 95,039 groups of images extracted from 31 videos.Each group includes three consecutive images,which are used to train the video interpolation network model.We apply six popular frame interpolation methods to this dataset to confi rm that the video frame interpolation technology can reduce the video frame rate and reduce exposure of physicians to X-rays.展开更多
Live video streaming is one of the newly emerged services over the Internet that has attracted immense interest of the service providers.Since Internet was not designed for such services during its inception,such a se...Live video streaming is one of the newly emerged services over the Internet that has attracted immense interest of the service providers.Since Internet was not designed for such services during its inception,such a service poses some serious challenges including cost and scalability.Peer-to-Peer(P2P)Internet Protocol Television(IPTV)is an application-level distributed paradigm to offer live video contents.In terms of ease of deployment,it has emerged as a serious alternative to client server,Content Delivery Network(CDN)and IP multicast solutions.Nevertheless,P2P approach has struggled to provide the desired streaming quality due to a number of issues.Stability of peers in a network is one of themajor issues among these.Most of the existing approaches address this issue through older-stable principle.This paper first extensively investigates the older-stable principle to observe its validity in different scenarios.It is observed that the older-stable principle does not hold in several of them.Then,it utilizes machine learning approach to predict the stability of peers.This work evaluates the accuracy of severalmachine learning algorithms over the prediction of stability,where the Gradient Boosting Regressor(GBR)out-performs other algorithms.Finally,this work presents a proof-of-concept simulation to compare the effectiveness of older-stable rule and machine learning-based predictions for the stabilization of the overlay.The results indicate that machine learning-based stability estimation significantly improves the system.展开更多
X-ray image has been widely used in many fields such as medical diagnosis,industrial inspection,and so on.Unfortunately,due to the physical characteristics of X-ray and imaging system,distortion of the projected image...X-ray image has been widely used in many fields such as medical diagnosis,industrial inspection,and so on.Unfortunately,due to the physical characteristics of X-ray and imaging system,distortion of the projected image will happen,which restrict the application of X-ray image,especially in high accuracy fields.Distortion correction can be performed using algorithms that can be classified as global or local according to the method used,both having specific advantages and disadvantages.In this paper,a new global method based on support vector regression(SVR)machine for distortion correction is proposed.In order to test the presented method,a calibration phantom is specially designed for this purpose.A comparison of the proposed method with the traditional global distortion correction techniques is performed.The experimental results show that the proposed correction method performs better than the traditional global one.展开更多
Like the Covid-19 pandemic,smallpox virus infection broke out in the last century,wherein 500 million deaths were reported along with enormous economic loss.But unlike smallpox,the Covid-19 recorded a low exponential ...Like the Covid-19 pandemic,smallpox virus infection broke out in the last century,wherein 500 million deaths were reported along with enormous economic loss.But unlike smallpox,the Covid-19 recorded a low exponential infection rate and mortality rate due to advancement inmedical aid and diagnostics.Data analytics,machine learning,and automation techniques can help in early diagnostics and supporting treatments of many reported patients.This paper proposes a robust and efficient methodology for the early detection of COVID-19 from Chest X-Ray scans utilizing enhanced deep learning techniques.Our study suggests that using the Prediction and Deconvolutional Modules in combination with the SSD architecture can improve the performance of the model trained at this task.We used a publicly open CXR image dataset and implemented the detectionmodelwith task-specific pre-processing and near 80:20 split.This achieved a competitive specificity of 0.9474 and a sensibility/accuracy of 0.9597,which shall help better decision-making for various aspects of identification and treat the infection.展开更多
Real-time video transport over wireless Internet faces many challenges due to the heterogeneous environment including wireline and wireless networks. A robust network condition classification algorithm using multiple ...Real-time video transport over wireless Internet faces many challenges due to the heterogeneous environment including wireline and wireless networks. A robust network condition classification algorithm using multiple end-to-end metrics and Support Vector Machine (SVM) is proposed to classify different network events and model the transition pattern of network conditions. End-to-end Quality-of-Service (QoS) mechanisms like congestion control, error control, and power control can benefit from the network condition information and react to different network situations appropriately. The proposed network condition classifica- tion algorithm uses SVM as a classifier to cluster different end-to-end metrics such as end-to-end delay, delay jitter, throughput and packet loss-rate for the UDP traffic with TCP-friendly Rate Control (TFRC), which is used for video transport. The algorithm is also flexible for classifying different numbers of states representing different levels of network events such as wireline congestion and wireless channel loss. Simulation results using network simulator 2 (ns2) showed the effectiveness of the proposed scheme.展开更多
A number of automated video shot boundary detection methods for indexing a videosequence to facilitate browsing and retrieval have been proposed in recent years.Among these methods,the dissolve shot boundary isn't...A number of automated video shot boundary detection methods for indexing a videosequence to facilitate browsing and retrieval have been proposed in recent years.Among these methods,the dissolve shot boundary isn't accurately detected because it involves the camera operation and objectmovement.In this paper,a method based on support vector machine (SVM) is proposed to detect thedissolve shot boundary in MPEG compressed sequence.The problem of detection between the dissolveshot boundary and other boundaries is considered as two-class classification in our method.Featuresfrom the compressed sequences are directly extracted without decoding them,and the optimal classboundary between two classes are learned from training data by using SVM.Experiments,whichcompare various classification methods,show that using proposed method encourages performance ofvideo shot boundary detection.展开更多
This paper presents a human detection system in a vision-based hospital surveillance environment. The system is composed of three subsystems, i.e. background segmentation subsystem (BSS), human feature extraction su...This paper presents a human detection system in a vision-based hospital surveillance environment. The system is composed of three subsystems, i.e. background segmentation subsystem (BSS), human feature extraction subsystem (HFES), and human recognition subsystem (HRS). The codebook background model is applied in the BSS, the histogram of oriented gradients (HOG) features are used in the HFES, and the support vector machine (SVM) classification is employed in the HRS. By means of the integration of these subsystems, the human detection in a vision-based hospital surveillance environment is performed. Experimental results show that the proposed system can effectively detect most of the people in hospital surveillance video sequences.展开更多
Background Video anomaly detection has always been a hot topic and has attracted increasing attention.Many of the existing methods for video anomaly detection depend on processing the entire video rather than consider...Background Video anomaly detection has always been a hot topic and has attracted increasing attention.Many of the existing methods for video anomaly detection depend on processing the entire video rather than considering only the significant context. Method This paper proposes a novel video anomaly detection method called COVAD that mainly focuses on the region of interest in the video instead of the entire video. Our proposed COVAD method is based on an autoencoded convolutional neural network and a coordinated attention mechanism,which can effectively capture meaningful objects in the video and dependencies among different objects. Relying on the existing memory-guided video frame prediction network, our algorithm can significantly predict the future motion and appearance of objects in a video more effectively. Result The proposed algorithm obtained better experimental results on multiple datasets and outperformed the baseline models considered in our analysis. Simultaneously, we provide an improved visual test that can provide pixel-level anomaly explanations.展开更多
In order to respond to the need of social development,cultivate international talents,and improve the current English teaching mode,this paper studies video English visual-audio-oral learning system based on machine l...In order to respond to the need of social development,cultivate international talents,and improve the current English teaching mode,this paper studies video English visual-audio-oral learning system based on machine learning from the perspective of teaching and learning video English.It mainly analyzes the knowledge discovery process of machine learning,the design and application of video English visual-audio-oral learning system.It is found that the video English visual-audio-oral learning system based on machine learning has much higher level of practicality and efficiency compared with the traditional English language teaching in real life.The application of this system can also be of great significance in changes on language learning modes and methods in the future.展开更多
In this era of pandemic, the future of healthcare industry has never been more exciting. Artificial intelligence and machine learning (AI & ML) present opportunities to develop solutions that cater for very specif...In this era of pandemic, the future of healthcare industry has never been more exciting. Artificial intelligence and machine learning (AI & ML) present opportunities to develop solutions that cater for very specific needs within the industry. Deep learning in healthcare had become incredibly powerful for supporting clinics and in transforming patient care in general. Deep learning is increasingly being applied for the detection of clinically important features in the images beyond what can be perceived by the naked human eye. Chest X-ray images are one of the most common clinical method for diagnosing a number of diseases such as pneumonia, lung cancer and many other abnormalities like lesions and fractures. Proper diagnosis of a disease from X-ray images is often challenging task for even expert radiologists and there is a growing need for computerized support systems due to the large amount of information encoded in X-Ray images. The goal of this paper is to develop a lightweight solution to detect 14 different chest conditions from an X ray image. Given an X-ray image as input, our classifier outputs a label vector indicating which of 14 disease classes does the image fall into. Along with the image features, we are also going to use non-image features available in the data such as X-ray view type, age, gender etc. The original study conducted Stanford ML Group is our base line. Original study focuses on predicting 5 diseases. Our aim is to improve upon previous work, expand prediction to 14 diseases and provide insight for future chest radiography research.展开更多
文摘Video streaming applications have grown considerably in recent years.As a result,this becomes one of the most significant contributors to global internet traffic.According to recent studies,the telecommunications industry loses millions of dollars due to poor video Quality of Experience(QoE)for users.Among the standard proposals for standardizing the quality of video streaming over internet service providers(ISPs)is the Mean Opinion Score(MOS).However,the accurate finding of QoE by MOS is subjective and laborious,and it varies depending on the user.A fully automated data analytics framework is required to reduce the inter-operator variability characteristic in QoE assessment.This work addresses this concern by suggesting a novel hybrid XGBStackQoE analytical model using a two-level layering technique.Level one combines multiple Machine Learning(ML)models via a layer one Hybrid XGBStackQoE-model.Individual ML models at level one are trained using the entire training data set.The level two Hybrid XGBStackQoE-Model is fitted using the outputs(meta-features)of the layer one ML models.The proposed model outperformed the conventional models,with an accuracy improvement of 4 to 5 percent,which is still higher than the current traditional models.The proposed framework could significantly improve video QoE accuracy.
基金supported by the National Natural Science Foundation of China(Grant Nos.41877267 and 41877260)the Priority Research Program of the Chinese Academy of Sciences(Grant No.XDA13010201).
文摘Different sedimentary zones in coral reefs lead to significant anisotropy in the pore structure of coral reef limestone(CRL),making it difficult to study mechanical behaviors.With X-ray computed tomography(CT),112 CRL samples were utilized for training the support vector machine(SVM)-,random forest(RF)-,and back propagation neural network(BPNN)-based models,respectively.Simultaneously,the machine learning model was embedded into genetic algorithm(GA)for parameter optimization to effectively predict uniaxial compressive strength(UCS)of CRL.Results indicate that the BPNN model with five hidden layers presents the best training effect in the data set of CRL.The SVM-based model shows a tendency to overfitting in the training set and poor generalization ability in the testing set.The RF-based model is suitable for training CRL samples with large data.Analysis of Pearson correlation coefficient matrix and the percentage increment method of performance metrics shows that the dry density,pore structure,and porosity of CRL are strongly correlated to UCS.However,the P-wave velocity is almost uncorrelated to the UCS,which is significantly distinct from the law for homogenous geomaterials.In addition,the pore tensor proposed in this paper can effectively reflect the pore structure of coral framework limestone(CFL)and coral boulder limestone(CBL),realizing the quantitative characterization of the heterogeneity and anisotropy of pore.The pore tensor provides a feasible idea to establish the relationship between pore structure and mechanical behavior of CRL.
基金supported by ZTE Industry-University-Institute Cooperation Funds.
文摘To improve the performance of video compression for machine vision analysis tasks,a video coding for machines(VCM)standard working group was established to promote standardization procedures.In this paper,recent advances in video coding for machine standards are presented and comprehensive introductions to the use cases,requirements,evaluation frameworks and corresponding metrics of the VCM standard are given.Then the existing methods are presented,introducing the existing proposals by category and the research progress of the latest VCM conference.Finally,we give conclusions.
文摘The quick spread of the CoronavirusDisease(COVID-19)infection around the world considered a real danger for global health.The biological structure and symptoms of COVID-19 are similar to other viral chest maladies,which makes it challenging and a big issue to improve approaches for efficient identification of COVID-19 disease.In this study,an automatic prediction of COVID-19 identification is proposed to automatically discriminate between healthy and COVID-19 infected subjects in X-ray images using two successful moderns are traditional machine learning methods(e.g.,artificial neural network(ANN),support vector machine(SVM),linear kernel and radial basis function(RBF),k-nearest neighbor(k-NN),Decision Tree(DT),andCN2 rule inducer techniques)and deep learningmodels(e.g.,MobileNets V2,ResNet50,GoogleNet,DarkNet andXception).A largeX-ray dataset has been created and developed,namely the COVID-19 vs.Normal(400 healthy cases,and 400 COVID cases).To the best of our knowledge,it is currently the largest publicly accessible COVID-19 dataset with the largest number of X-ray images of confirmed COVID-19 infection cases.Based on the results obtained from the experiments,it can be concluded that all the models performed well,deep learning models had achieved the optimum accuracy of 98.8%in ResNet50 model.In comparison,in traditional machine learning techniques, the SVM demonstrated the best result for an accuracy of 95% and RBFaccuracy 94% for the prediction of coronavirus disease 2019.
基金financially supported from the National Key Research and Development Program of China(No.2019YFC1803601)the Fundamental Research Funds for the Central Universities of Central South University,China(No.2023ZZTS0801)+1 种基金the Postgraduate Innovative Project of Central South University,China(No.2023XQLH068)the Postgraduate Scientific Research Innovation Project of Hunan Province,China(No.QL20230054)。
文摘A general prediction model for seven heavy metals was established using the heavy metal contents of 207soil samples measured by a portable X-ray fluorescence spectrometer(XRF)and six environmental factors as model correction coefficients.The eXtreme Gradient Boosting(XGBoost)model was used to fit the relationship between the content of heavy metals and environment characteristics to evaluate the soil ecological risk of the smelting site.The results demonstrated that the generalized prediction model developed for Pb,Cd,and As was highly accurate with fitted coefficients(R^(2))values of 0.911,0.950,and 0.835,respectively.Topsoil presented the highest ecological risk,and there existed high potential ecological risk at some positions with different depths due to high mobility of Cd.Generally,the application of machine learning significantly increased the accuracy of pXRF measurements,and identified key environmental factors.The adapted potential ecological risk assessment emphasized the need to focus on Pb,Cd,and As in future site remediation efforts.
基金Science and Technology Funds from the Liaoning Education Department(Serial Number:LJKZ0104).
文摘The motivation for this study is that the quality of deep fakes is constantly improving,which leads to the need to develop new methods for their detection.The proposed Customized Convolutional Neural Network method involves extracting structured data from video frames using facial landmark detection,which is then used as input to the CNN.The customized Convolutional Neural Network method is the date augmented-based CNN model to generate‘fake data’or‘fake images’.This study was carried out using Python and its libraries.We used 242 films from the dataset gathered by the Deep Fake Detection Challenge,of which 199 were made up and the remaining 53 were real.Ten seconds were allotted for each video.There were 318 videos used in all,199 of which were fake and 119 of which were real.Our proposedmethod achieved a testing accuracy of 91.47%,loss of 0.342,and AUC score of 0.92,outperforming two alternative approaches,CNN and MLP-CNN.Furthermore,our method succeeded in greater accuracy than contemporary models such as XceptionNet,Meso-4,EfficientNet-BO,MesoInception-4,VGG-16,and DST-Net.The novelty of this investigation is the development of a new Convolutional Neural Network(CNN)learning model that can accurately detect deep fake face photos.
文摘Cardiac coronary angiography is a major technique that assists physicians during interventional heart surgery.Under X-ray irradiation,the physician injects a contrast agent through a catheter and determines the coronary arteries’state in real time.However,to obtain a more accurate state of the coronary arteries,physicians need to increase the fre-quency and intensity of X-ray exposure,which will inevitably increase the potential for harm to both the patient and the surgeon.In the work reported here,we use advanced deep learning algorithms to fi nd a method of frame interpola-tion for coronary angiography videos that reduces the frequency of X-ray exposure by reducing the frame rate of the coronary angiography video,thereby reducing X-ray-induced damage to physicians.We established a new coronary angiography image group dataset containing 95,039 groups of images extracted from 31 videos.Each group includes three consecutive images,which are used to train the video interpolation network model.We apply six popular frame interpolation methods to this dataset to confi rm that the video frame interpolation technology can reduce the video frame rate and reduce exposure of physicians to X-rays.
文摘Live video streaming is one of the newly emerged services over the Internet that has attracted immense interest of the service providers.Since Internet was not designed for such services during its inception,such a service poses some serious challenges including cost and scalability.Peer-to-Peer(P2P)Internet Protocol Television(IPTV)is an application-level distributed paradigm to offer live video contents.In terms of ease of deployment,it has emerged as a serious alternative to client server,Content Delivery Network(CDN)and IP multicast solutions.Nevertheless,P2P approach has struggled to provide the desired streaming quality due to a number of issues.Stability of peers in a network is one of themajor issues among these.Most of the existing approaches address this issue through older-stable principle.This paper first extensively investigates the older-stable principle to observe its validity in different scenarios.It is observed that the older-stable principle does not hold in several of them.Then,it utilizes machine learning approach to predict the stability of peers.This work evaluates the accuracy of severalmachine learning algorithms over the prediction of stability,where the Gradient Boosting Regressor(GBR)out-performs other algorithms.Finally,this work presents a proof-of-concept simulation to compare the effectiveness of older-stable rule and machine learning-based predictions for the stabilization of the overlay.The results indicate that machine learning-based stability estimation significantly improves the system.
基金National Natural Science Foundation of China(No.61305118)
文摘X-ray image has been widely used in many fields such as medical diagnosis,industrial inspection,and so on.Unfortunately,due to the physical characteristics of X-ray and imaging system,distortion of the projected image will happen,which restrict the application of X-ray image,especially in high accuracy fields.Distortion correction can be performed using algorithms that can be classified as global or local according to the method used,both having specific advantages and disadvantages.In this paper,a new global method based on support vector regression(SVR)machine for distortion correction is proposed.In order to test the presented method,a calibration phantom is specially designed for this purpose.A comparison of the proposed method with the traditional global distortion correction techniques is performed.The experimental results show that the proposed correction method performs better than the traditional global one.
文摘Like the Covid-19 pandemic,smallpox virus infection broke out in the last century,wherein 500 million deaths were reported along with enormous economic loss.But unlike smallpox,the Covid-19 recorded a low exponential infection rate and mortality rate due to advancement inmedical aid and diagnostics.Data analytics,machine learning,and automation techniques can help in early diagnostics and supporting treatments of many reported patients.This paper proposes a robust and efficient methodology for the early detection of COVID-19 from Chest X-Ray scans utilizing enhanced deep learning techniques.Our study suggests that using the Prediction and Deconvolutional Modules in combination with the SSD architecture can improve the performance of the model trained at this task.We used a publicly open CXR image dataset and implemented the detectionmodelwith task-specific pre-processing and near 80:20 split.This achieved a competitive specificity of 0.9474 and a sensibility/accuracy of 0.9597,which shall help better decision-making for various aspects of identification and treat the infection.
基金Project supported by the Croucher Foundation Fellowship fromHong Kong, China
文摘Real-time video transport over wireless Internet faces many challenges due to the heterogeneous environment including wireline and wireless networks. A robust network condition classification algorithm using multiple end-to-end metrics and Support Vector Machine (SVM) is proposed to classify different network events and model the transition pattern of network conditions. End-to-end Quality-of-Service (QoS) mechanisms like congestion control, error control, and power control can benefit from the network condition information and react to different network situations appropriately. The proposed network condition classifica- tion algorithm uses SVM as a classifier to cluster different end-to-end metrics such as end-to-end delay, delay jitter, throughput and packet loss-rate for the UDP traffic with TCP-friendly Rate Control (TFRC), which is used for video transport. The algorithm is also flexible for classifying different numbers of states representing different levels of network events such as wireline congestion and wireless channel loss. Simulation results using network simulator 2 (ns2) showed the effectiveness of the proposed scheme.
文摘A number of automated video shot boundary detection methods for indexing a videosequence to facilitate browsing and retrieval have been proposed in recent years.Among these methods,the dissolve shot boundary isn't accurately detected because it involves the camera operation and objectmovement.In this paper,a method based on support vector machine (SVM) is proposed to detect thedissolve shot boundary in MPEG compressed sequence.The problem of detection between the dissolveshot boundary and other boundaries is considered as two-class classification in our method.Featuresfrom the compressed sequences are directly extracted without decoding them,and the optimal classboundary between two classes are learned from training data by using SVM.Experiments,whichcompare various classification methods,show that using proposed method encourages performance ofvideo shot boundary detection.
基金supported by the“MOST”under Grant No.103-2221-E-468-008-MY2
文摘This paper presents a human detection system in a vision-based hospital surveillance environment. The system is composed of three subsystems, i.e. background segmentation subsystem (BSS), human feature extraction subsystem (HFES), and human recognition subsystem (HRS). The codebook background model is applied in the BSS, the histogram of oriented gradients (HOG) features are used in the HFES, and the support vector machine (SVM) classification is employed in the HRS. By means of the integration of these subsystems, the human detection in a vision-based hospital surveillance environment is performed. Experimental results show that the proposed system can effectively detect most of the people in hospital surveillance video sequences.
文摘Background Video anomaly detection has always been a hot topic and has attracted increasing attention.Many of the existing methods for video anomaly detection depend on processing the entire video rather than considering only the significant context. Method This paper proposes a novel video anomaly detection method called COVAD that mainly focuses on the region of interest in the video instead of the entire video. Our proposed COVAD method is based on an autoencoded convolutional neural network and a coordinated attention mechanism,which can effectively capture meaningful objects in the video and dependencies among different objects. Relying on the existing memory-guided video frame prediction network, our algorithm can significantly predict the future motion and appearance of objects in a video more effectively. Result The proposed algorithm obtained better experimental results on multiple datasets and outperformed the baseline models considered in our analysis. Simultaneously, we provide an improved visual test that can provide pixel-level anomaly explanations.
文摘In order to respond to the need of social development,cultivate international talents,and improve the current English teaching mode,this paper studies video English visual-audio-oral learning system based on machine learning from the perspective of teaching and learning video English.It mainly analyzes the knowledge discovery process of machine learning,the design and application of video English visual-audio-oral learning system.It is found that the video English visual-audio-oral learning system based on machine learning has much higher level of practicality and efficiency compared with the traditional English language teaching in real life.The application of this system can also be of great significance in changes on language learning modes and methods in the future.
文摘In this era of pandemic, the future of healthcare industry has never been more exciting. Artificial intelligence and machine learning (AI & ML) present opportunities to develop solutions that cater for very specific needs within the industry. Deep learning in healthcare had become incredibly powerful for supporting clinics and in transforming patient care in general. Deep learning is increasingly being applied for the detection of clinically important features in the images beyond what can be perceived by the naked human eye. Chest X-ray images are one of the most common clinical method for diagnosing a number of diseases such as pneumonia, lung cancer and many other abnormalities like lesions and fractures. Proper diagnosis of a disease from X-ray images is often challenging task for even expert radiologists and there is a growing need for computerized support systems due to the large amount of information encoded in X-Ray images. The goal of this paper is to develop a lightweight solution to detect 14 different chest conditions from an X ray image. Given an X-ray image as input, our classifier outputs a label vector indicating which of 14 disease classes does the image fall into. Along with the image features, we are also going to use non-image features available in the data such as X-ray view type, age, gender etc. The original study conducted Stanford ML Group is our base line. Original study focuses on predicting 5 diseases. Our aim is to improve upon previous work, expand prediction to 14 diseases and provide insight for future chest radiography research.