The study delves into the expanding role of network platforms in our daily lives, encompassing various mediums like blogs, forums, online chats, and prominent social media platforms such as Facebook, Twitter, and Inst...The study delves into the expanding role of network platforms in our daily lives, encompassing various mediums like blogs, forums, online chats, and prominent social media platforms such as Facebook, Twitter, and Instagram. While these platforms offer avenues for self-expression and community support, they concurrently harbor negative impacts, fostering antisocial behaviors like phishing, impersonation, hate speech, cyberbullying, cyberstalking, cyberterrorism, fake news propagation, spamming, and fraud. Notably, individuals also leverage these platforms to connect with authorities and seek aid during disasters. The overarching objective of this research is to address the dual nature of network platforms by proposing innovative methodologies aimed at enhancing their positive aspects and mitigating their negative repercussions. To achieve this, the study introduces a weight learning method grounded in multi-linear attribute ranking. This approach serves to evaluate the significance of attribute combinations across all feature spaces. Additionally, a novel clustering method based on tensors is proposed to elevate the quality of clustering while effectively distinguishing selected features. The methodology incorporates a weighted average similarity matrix and optionally integrates weighted Euclidean distance, contributing to a more nuanced understanding of attribute importance. The analysis of the proposed methods yields significant findings. The weight learning method proves instrumental in discerning the importance of attribute combinations, shedding light on key aspects within feature spaces. Simultaneously, the clustering method based on tensors exhibits improved efficacy in enhancing clustering quality and feature distinction. This not only advances our understanding of attribute importance but also paves the way for more nuanced data analysis methodologies. In conclusion, this research underscores the pivotal role of network platforms in contemporary society, emphasizing their potential for both positive contributions and adverse consequences. The proposed methodologies offer novel approaches to address these dualities, providing a foundation for future research and practical applications. Ultimately, this study contributes to the ongoing discourse on optimizing the utility of network platforms while minimizing their negative impacts.展开更多
This paper explores a highly accurate identification modeling approach for the ship maneuvering motion with fullscale trial. A multi-innovation gradient iterative(MIGI) approach is proposed to optimize the distance me...This paper explores a highly accurate identification modeling approach for the ship maneuvering motion with fullscale trial. A multi-innovation gradient iterative(MIGI) approach is proposed to optimize the distance metric of locally weighted learning(LWL), and a novel non-parametric modeling technique is developed for a nonlinear ship maneuvering system. This proposed method’s advantages are as follows: first, it can avoid the unmodeled dynamics and multicollinearity inherent to the conventional parametric model; second, it eliminates the over-learning or underlearning and obtains the optimal distance metric; and third, the MIGI is not sensitive to the initial parameter value and requires less time during the training phase. These advantages result in a highly accurate mathematical modeling technique that can be conveniently implemented in applications. To verify the characteristics of this mathematical model, two examples are used as the model platforms to study the ship maneuvering.展开更多
The groundwater potential map is an important tool for a sustainable water management and land use planning,particularly for agricultural countries like Vietnam.In this article,we proposed new machine learning ensembl...The groundwater potential map is an important tool for a sustainable water management and land use planning,particularly for agricultural countries like Vietnam.In this article,we proposed new machine learning ensemble techniques namely AdaBoost ensemble(ABLWL),Bagging ensemble(BLWL),Multi Boost ensemble(MBLWL),Rotation Forest ensemble(RFLWL)with Locally Weighted Learning(LWL)algorithm as a base classifier to build the groundwater potential map of Gia Lai province in Vietnam.For this study,eleven conditioning factors(aspect,altitude,curvature,slope,Stream Transport Index(STI),Topographic Wetness Index(TWI),soil,geology,river density,rainfall,land-use)and 134 wells yield data was used to create training(70%)and testing(30%)datasets for the development and validation of the models.Several statistical indices were used namely Positive Predictive Value(PPV),Negative Predictive Value(NPV),Sensitivity(SST),Specificity(SPF),Accuracy(ACC),Kappa,and Receiver Operating Characteristics(ROC)curve to validate and compare performance of models.Results show that performance of all the models is good to very good(AUC:0.75 to 0.829)but the ABLWL model with AUC=0.89 is the best.All the models applied in this study can support decision-makers to streamline the management of the groundwater and to develop economy not only of specific territories but also in other regions across the world with minor changes of the input parameters.展开更多
In this paper, we propose two weighted learning methods for the construction of single hidden layer feedforward neural networks. Both methods incorporate weighted least squares. Our idea is to allow the training insta...In this paper, we propose two weighted learning methods for the construction of single hidden layer feedforward neural networks. Both methods incorporate weighted least squares. Our idea is to allow the training instances nearer to the query to offer bigger contributions to the estimated output. By minimizing the weighted mean square error function, optimal networks can be obtained. The results of a number of experiments demonstrate the effectiveness of our proposed methods.展开更多
Accurately forecasting ocean waves during typhoon events is extremely important in aiding the mitigation and minimization of their potential damage to the coastal infrastructure, and the protection of coastal communit...Accurately forecasting ocean waves during typhoon events is extremely important in aiding the mitigation and minimization of their potential damage to the coastal infrastructure, and the protection of coastal communities. However, due to the complex hydrological and meteorological interaction and uncertainties arising from different modeling systems, quantifying the uncertainties and improving the forecasting accuracy of modeled typhoon-induced waves remain challenging. This paper presents a practical approach to optimizing model-ensemble wave heights in an attempt to improve the accuracy of real-time typhoon wave forecasting. A locally weighted learning algorithm is used to obtain the weights for the wave heights computed by the WAVEWATCH III wave model driven by winds from four different weather models (model-ensembles). The optimized weights are subsequently used to calculate the resulting wave heights from the model-ensembles. The results show that the opti- mization is capable of capturing the different behavioral effects of the different weather models on wave generation. Comparison with the measurements at the selected wave buoy locations shows that the optimized weights, obtained through a training process, can significantly improve the accuracy of the forecasted wave heights over the standard mean values, particularly for typhoon-induced peak waves. The results also indicate that the algorithm is easy to imnlement and practieal for real-time wave forecasting.展开更多
The Extreme Learning Machine(ELM) and its variants are effective in many machine learning applications such as Imbalanced Learning(IL) or Big Data(BD) learning. However, they are unable to solve both imbalanced ...The Extreme Learning Machine(ELM) and its variants are effective in many machine learning applications such as Imbalanced Learning(IL) or Big Data(BD) learning. However, they are unable to solve both imbalanced and large-volume data learning problems. This study addresses the IL problem in BD applications. The Distributed and Weighted ELM(DW-ELM) algorithm is proposed, which is based on the Map Reduce framework. To confirm the feasibility of parallel computation, first, the fact that matrix multiplication operators are decomposable is illustrated.Then, to further improve the computational efficiency, an Improved DW-ELM algorithm(IDW-ELM) is developed using only one Map Reduce job. The successful operations of the proposed DW-ELM and IDW-ELM algorithms are finally validated through experiments.展开更多
Van der Pauw's function is often used in the measurement of a semiconductor's resistivity. However, it is difficult to obtain its value from voltage measurements because it has an implicit form. If it can be express...Van der Pauw's function is often used in the measurement of a semiconductor's resistivity. However, it is difficult to obtain its value from voltage measurements because it has an implicit form. If it can be expressed as a polynomial, a semiconductor's resistivity can be obtained from such measurements. Normally, five orders of the abscissa can provide sufficient precision during the expression of any non-linear function. Therefore, the key is to determine the coefficients of the polynomial. By taking five coefficients as weights to construct a neuronetwork, neurocomputing has been used to solve this problem. Finally, the polynomial expression for van der Pauw's function is obtained.展开更多
Unlike the traditional fossil energy, wind, as the clean renewable energy, can reduce the emission of the greenhouse gas. To take full advantage of the environmental benefits of wind energy, wind power forecasting has...Unlike the traditional fossil energy, wind, as the clean renewable energy, can reduce the emission of the greenhouse gas. To take full advantage of the environmental benefits of wind energy, wind power forecasting has to be studied to overcome the troubles brought by the variable nature of wind. Power forecasting for regional wind farm groups is the problem that many power system operators care about. The high-dimensional feature sets with redundant information are frequently encountered when dealing with this problem. In this paper, two kinds of feature set construction methods are proposed which can achieve the proper feature set either by selecting the subsets or by transforming the original variables with specific combinations. The former method selects the subset according to the criterion of minimal-redundancy-maximal-relevance (mRMR), while the latter does so based on the method of principal component analysis (PCA). A locally weighted learning method is also proposed to utilize the processed feature set to produce the power forecast results. The proposed model is simple and easy to use with parameters optimized automatically. Finally, a case study of 28 wind farms in East China is provided to verify the effectiveness of the proposed method.展开更多
In personalised medicine,the goal is tomake a treatment recommendation for each patient with a given set of covariates tomaximise the treatment benefitmeasured by patient’s response to the treatment.In application,su...In personalised medicine,the goal is tomake a treatment recommendation for each patient with a given set of covariates tomaximise the treatment benefitmeasured by patient’s response to the treatment.In application,such a treatment assignment rule is constructed using a sample training data consisting of patients’responses and covariates.Instead of modelling responses using treatments and covariates,an alternative approach is maximising a response-weighted target function whose value directly reflects the effectiveness of treatment assignments.Since the target function involves a loss function,efforts have been made recently on the choice of the loss function to ensure a computationally feasible and theoretically sound solution.We propose to use a smooth hinge loss function so that the target function is convex and differentiable,which possesses good asymptotic properties and numerical advantages.To further simplify the computation and interpretability,we focus on the rules that are linear functions of covariates and discuss their asymptotic properties.We also examine the performances of our method with simulation studies and real data analysis.展开更多
基金sponsored by the National Natural Science Foundation of P.R.China(Nos.62102194 and 62102196)Six Talent Peaks Project of Jiangsu Province(No.RJFW-111)Postgraduate Research and Practice Innovation Program of Jiangsu Province(Nos.KYCX23_1087 and KYCX22_1027).
文摘The study delves into the expanding role of network platforms in our daily lives, encompassing various mediums like blogs, forums, online chats, and prominent social media platforms such as Facebook, Twitter, and Instagram. While these platforms offer avenues for self-expression and community support, they concurrently harbor negative impacts, fostering antisocial behaviors like phishing, impersonation, hate speech, cyberbullying, cyberstalking, cyberterrorism, fake news propagation, spamming, and fraud. Notably, individuals also leverage these platforms to connect with authorities and seek aid during disasters. The overarching objective of this research is to address the dual nature of network platforms by proposing innovative methodologies aimed at enhancing their positive aspects and mitigating their negative repercussions. To achieve this, the study introduces a weight learning method grounded in multi-linear attribute ranking. This approach serves to evaluate the significance of attribute combinations across all feature spaces. Additionally, a novel clustering method based on tensors is proposed to elevate the quality of clustering while effectively distinguishing selected features. The methodology incorporates a weighted average similarity matrix and optionally integrates weighted Euclidean distance, contributing to a more nuanced understanding of attribute importance. The analysis of the proposed methods yields significant findings. The weight learning method proves instrumental in discerning the importance of attribute combinations, shedding light on key aspects within feature spaces. Simultaneously, the clustering method based on tensors exhibits improved efficacy in enhancing clustering quality and feature distinction. This not only advances our understanding of attribute importance but also paves the way for more nuanced data analysis methodologies. In conclusion, this research underscores the pivotal role of network platforms in contemporary society, emphasizing their potential for both positive contributions and adverse consequences. The proposed methodologies offer novel approaches to address these dualities, providing a foundation for future research and practical applications. Ultimately, this study contributes to the ongoing discourse on optimizing the utility of network platforms while minimizing their negative impacts.
基金financially supported in part by the National High Technology Research and Development Program of China(863Program,Grant No.2015AA016404)the National Natural Science Foundation of China(Grant Nos.51109020,51179019 and 51779029)the Fundamental Research Program for Key Laboratory of the Education Department of Liaoning Province(Grant No.LZ2015006)
文摘This paper explores a highly accurate identification modeling approach for the ship maneuvering motion with fullscale trial. A multi-innovation gradient iterative(MIGI) approach is proposed to optimize the distance metric of locally weighted learning(LWL), and a novel non-parametric modeling technique is developed for a nonlinear ship maneuvering system. This proposed method’s advantages are as follows: first, it can avoid the unmodeled dynamics and multicollinearity inherent to the conventional parametric model; second, it eliminates the over-learning or underlearning and obtains the optimal distance metric; and third, the MIGI is not sensitive to the initial parameter value and requires less time during the training phase. These advantages result in a highly accurate mathematical modeling technique that can be conveniently implemented in applications. To verify the characteristics of this mathematical model, two examples are used as the model platforms to study the ship maneuvering.
基金funded by Vietnam National Foundation for Science and Technology Development(NAFOSTED)under grant number 105.08-2019.03.
文摘The groundwater potential map is an important tool for a sustainable water management and land use planning,particularly for agricultural countries like Vietnam.In this article,we proposed new machine learning ensemble techniques namely AdaBoost ensemble(ABLWL),Bagging ensemble(BLWL),Multi Boost ensemble(MBLWL),Rotation Forest ensemble(RFLWL)with Locally Weighted Learning(LWL)algorithm as a base classifier to build the groundwater potential map of Gia Lai province in Vietnam.For this study,eleven conditioning factors(aspect,altitude,curvature,slope,Stream Transport Index(STI),Topographic Wetness Index(TWI),soil,geology,river density,rainfall,land-use)and 134 wells yield data was used to create training(70%)and testing(30%)datasets for the development and validation of the models.Several statistical indices were used namely Positive Predictive Value(PPV),Negative Predictive Value(NPV),Sensitivity(SST),Specificity(SPF),Accuracy(ACC),Kappa,and Receiver Operating Characteristics(ROC)curve to validate and compare performance of models.Results show that performance of all the models is good to very good(AUC:0.75 to 0.829)but the ABLWL model with AUC=0.89 is the best.All the models applied in this study can support decision-makers to streamline the management of the groundwater and to develop economy not only of specific territories but also in other regions across the world with minor changes of the input parameters.
基金supported by the NSC under Grant No.NSC-100-2221-E-110-083-MY3 and NSC-101-2622-E-110-011-CC3"Aim for the Top University Plan"of the National Sun-Yat-Sen University and Ministry of Education
文摘In this paper, we propose two weighted learning methods for the construction of single hidden layer feedforward neural networks. Both methods incorporate weighted least squares. Our idea is to allow the training instances nearer to the query to offer bigger contributions to the estimated output. By minimizing the weighted mean square error function, optimal networks can be obtained. The results of a number of experiments demonstrate the effectiveness of our proposed methods.
基金supported by the European Commission within FP7-THEME 6(Grant No.244104)the Natural Environment Research Council(NERC)of the UK(Grant No.NE/J005541/1)the Ministry of Science and Technology(MOST)of Taiwan(Grant No.MOST 104-2221-E-006-183)
文摘Accurately forecasting ocean waves during typhoon events is extremely important in aiding the mitigation and minimization of their potential damage to the coastal infrastructure, and the protection of coastal communities. However, due to the complex hydrological and meteorological interaction and uncertainties arising from different modeling systems, quantifying the uncertainties and improving the forecasting accuracy of modeled typhoon-induced waves remain challenging. This paper presents a practical approach to optimizing model-ensemble wave heights in an attempt to improve the accuracy of real-time typhoon wave forecasting. A locally weighted learning algorithm is used to obtain the weights for the wave heights computed by the WAVEWATCH III wave model driven by winds from four different weather models (model-ensembles). The optimized weights are subsequently used to calculate the resulting wave heights from the model-ensembles. The results show that the opti- mization is capable of capturing the different behavioral effects of the different weather models on wave generation. Comparison with the measurements at the selected wave buoy locations shows that the optimized weights, obtained through a training process, can significantly improve the accuracy of the forecasted wave heights over the standard mean values, particularly for typhoon-induced peak waves. The results also indicate that the algorithm is easy to imnlement and practieal for real-time wave forecasting.
基金partially supported by the National Natural Science Foundation of China(Nos.61402089,61472069,and 61501101)the Fundamental Research Funds for the Central Universities(Nos.N161904001,N161602003,and N150408001)+2 种基金the Natural Science Foundation of Liaoning Province(No.2015020553)the China Postdoctoral Science Foundation(No.2016M591447)the Postdoctoral Science Foundation of Northeastern University(No.20160203)
文摘The Extreme Learning Machine(ELM) and its variants are effective in many machine learning applications such as Imbalanced Learning(IL) or Big Data(BD) learning. However, they are unable to solve both imbalanced and large-volume data learning problems. This study addresses the IL problem in BD applications. The Distributed and Weighted ELM(DW-ELM) algorithm is proposed, which is based on the Map Reduce framework. To confirm the feasibility of parallel computation, first, the fact that matrix multiplication operators are decomposable is illustrated.Then, to further improve the computational efficiency, an Improved DW-ELM algorithm(IDW-ELM) is developed using only one Map Reduce job. The successful operations of the proposed DW-ELM and IDW-ELM algorithms are finally validated through experiments.
文摘Van der Pauw's function is often used in the measurement of a semiconductor's resistivity. However, it is difficult to obtain its value from voltage measurements because it has an implicit form. If it can be expressed as a polynomial, a semiconductor's resistivity can be obtained from such measurements. Normally, five orders of the abscissa can provide sufficient precision during the expression of any non-linear function. Therefore, the key is to determine the coefficients of the polynomial. By taking five coefficients as weights to construct a neuronetwork, neurocomputing has been used to solve this problem. Finally, the polynomial expression for van der Pauw's function is obtained.
文摘Unlike the traditional fossil energy, wind, as the clean renewable energy, can reduce the emission of the greenhouse gas. To take full advantage of the environmental benefits of wind energy, wind power forecasting has to be studied to overcome the troubles brought by the variable nature of wind. Power forecasting for regional wind farm groups is the problem that many power system operators care about. The high-dimensional feature sets with redundant information are frequently encountered when dealing with this problem. In this paper, two kinds of feature set construction methods are proposed which can achieve the proper feature set either by selecting the subsets or by transforming the original variables with specific combinations. The former method selects the subset according to the criterion of minimal-redundancy-maximal-relevance (mRMR), while the latter does so based on the method of principal component analysis (PCA). A locally weighted learning method is also proposed to utilize the processed feature set to produce the power forecast results. The proposed model is simple and easy to use with parameters optimized automatically. Finally, a case study of 28 wind farms in East China is provided to verify the effectiveness of the proposed method.
基金Research reported in this article was partially funded through a Patient-Centered Outcomes Research Institute(PCORI)Award[ME-1409-21219]The second author’s research was also partially supported by the Chinese 111 Project[B14019]the US National Science Foundation[grant number DMS-1612873].
文摘In personalised medicine,the goal is tomake a treatment recommendation for each patient with a given set of covariates tomaximise the treatment benefitmeasured by patient’s response to the treatment.In application,such a treatment assignment rule is constructed using a sample training data consisting of patients’responses and covariates.Instead of modelling responses using treatments and covariates,an alternative approach is maximising a response-weighted target function whose value directly reflects the effectiveness of treatment assignments.Since the target function involves a loss function,efforts have been made recently on the choice of the loss function to ensure a computationally feasible and theoretically sound solution.We propose to use a smooth hinge loss function so that the target function is convex and differentiable,which possesses good asymptotic properties and numerical advantages.To further simplify the computation and interpretability,we focus on the rules that are linear functions of covariates and discuss their asymptotic properties.We also examine the performances of our method with simulation studies and real data analysis.