Spatial heterogeneity refers to the variation or differences in characteristics or features across different locations or areas in space. Spatial data refers to information that explicitly or indirectly belongs to a p...Spatial heterogeneity refers to the variation or differences in characteristics or features across different locations or areas in space. Spatial data refers to information that explicitly or indirectly belongs to a particular geographic region or location, also known as geo-spatial data or geographic information. Focusing on spatial heterogeneity, we present a hybrid machine learning model combining two competitive algorithms: the Random Forest Regressor and CNN. The model is fine-tuned using cross validation for hyper-parameter adjustment and performance evaluation, ensuring robustness and generalization. Our approach integrates Global Moran’s I for examining global autocorrelation, and local Moran’s I for assessing local spatial autocorrelation in the residuals. To validate our approach, we implemented the hybrid model on a real-world dataset and compared its performance with that of the traditional machine learning models. Results indicate superior performance with an R-squared of 0.90, outperforming RF 0.84 and CNN 0.74. This study contributed to a detailed understanding of spatial variations in data considering the geographical information (Longitude & Latitude) present in the dataset. Our results, also assessed using the Root Mean Squared Error (RMSE), indicated that the hybrid yielded lower errors, showing a deviation of 53.65% from the RF model and 63.24% from the CNN model. Additionally, the global Moran’s I index was observed to be 0.10. This study underscores that the hybrid was able to predict correctly the house prices both in clusters and in dispersed areas.展开更多
A hybrid GSI (Grid-point Statistical Interpolation)-ETKF (Ensemble Transform Kalman Filter) data assimila- tion system has been recently developed for the WRF (Weather Research and Forecasting) model and tested ...A hybrid GSI (Grid-point Statistical Interpolation)-ETKF (Ensemble Transform Kalman Filter) data assimila- tion system has been recently developed for the WRF (Weather Research and Forecasting) model and tested with simu- lated observations for tropical cyclone (TC) forecast. This system is based on the existing GSI but with ensemble back- ground information incorporated. As a follow-up, this work extends the new system to assimilate real observations to further understand the hybrid scheme. As a first effort to explore the system with real observations, relatively coarse grid resolution (27 km) is used. A case study of typhoon Muifa (2011) is performed to assimilate real observations in- cluding conventional in-situ and satellite data. The hybrid system with flow-dependent ensemble eovariance shows sig- nificant improvements with respect to track forecast compared to the standard GSI system which in theory is three di- mensional variational analysis (3DVAR). By comparing the analyses, analysis increments and forecasts, the hybrid sys- tem is found to be potentially able to recognize the existence of TC vortex, adjust its position systematically, better de- scribe the asymmetric structure of typhoon Muifa and maintain the dynamic and thermodynamic balance in typhoon ini- tial field. In addition, a cold-start hybrid approach by using the global ensembles to provide flow-dependent error is test- ed and similar results are revealed with those from cycled GSI-ETKF approach.展开更多
Data aggregation from various web sources is very significant for web data analysis domain. In ad- dition, the recognition of coherence micro cluster is one of the most interesting issues in the field of data aggregat...Data aggregation from various web sources is very significant for web data analysis domain. In ad- dition, the recognition of coherence micro cluster is one of the most interesting issues in the field of data aggregation. Until now, many algorithms have been proposed to work on this issue. However, the deficiency of these solutions is that they cannot recognize the micro-cluster data stream accurately. A semantic-based coherent micro-cluster recognition algorithm for hybrid web data stream is nronosed.Firstly, an objective function is proposed to recognize the coherence micro-cluster and then the coher- ence micro-cluster recognition algorithm for hybrid web data stream based on semantic is raised. Fi-展开更多
An 8×10 GHz receiver optical sub-assembly (ROSA) consisting of an 8-channel arrayed waveguide grating (AWG) and an 8-channel PIN photodetector (PD) array is designed and fabricated based on silica hybrid in...An 8×10 GHz receiver optical sub-assembly (ROSA) consisting of an 8-channel arrayed waveguide grating (AWG) and an 8-channel PIN photodetector (PD) array is designed and fabricated based on silica hybrid integration technology. Multimode output waveguides in the silica AWG with 2% refractive index difference are used to obtain fiat-top spectra. The output waveguide facet is polished to 45° bevel to change the light propagation direction into the mesa-type PIN PD, which simplifies the packaging process. The experimentM results show that the single channel I dB bandwidth of AWG ranges from 2.12nm to 3.06nm, the ROSA responsivity ranges from 0.097 A/W to 0.158A/W, and the 3dB bandwidth is up to 11 GHz. It is promising to be applied in the eight-lane WDM transmission system in data center interconnection.展开更多
Under Type-Ⅱ progressively hybrid censoring, this paper discusses statistical inference and optimal design on stepstress partially accelerated life test for hybrid system in presence of masked data. It is assumed tha...Under Type-Ⅱ progressively hybrid censoring, this paper discusses statistical inference and optimal design on stepstress partially accelerated life test for hybrid system in presence of masked data. It is assumed that the lifetime of the component in hybrid systems follows independent and identical modified Weibull distributions. The maximum likelihood estimations(MLEs)of the unknown parameters, acceleration factor and reliability indexes are derived by using the Newton-Raphson algorithm. The asymptotic variance-covariance matrix and the approximate confidence intervals are obtained based on normal approximation to the asymptotic distribution of MLEs of model parameters. Moreover,two bootstrap confidence intervals are constructed by using the parametric bootstrap method. The optimal time of changing stress levels is determined under D-optimality and A-optimality criteria.Finally, the Monte Carlo simulation study is carried out to illustrate the proposed procedures.展开更多
Decision rules mining is an important issue in machine learning and data mining.However,most proposed algorithms mine categorical data at single level,and these rules are not easily understandable and really useful fo...Decision rules mining is an important issue in machine learning and data mining.However,most proposed algorithms mine categorical data at single level,and these rules are not easily understandable and really useful for users.Thus,a new approach to hierarchical decision rules mining is provided in this paper,in which similarity direction measure is introduced to deal with hybrid data.This approach can mine hierarchical decision rules by adjusting similarity measure parameters and the level of concept hierarchy trees.展开更多
Cloud storage is widely used by large companies to store vast amounts of data and files,offering flexibility,financial savings,and security.However,information shoplifting poses significant threats,potentially leading...Cloud storage is widely used by large companies to store vast amounts of data and files,offering flexibility,financial savings,and security.However,information shoplifting poses significant threats,potentially leading to poor performance and privacy breaches.Blockchain-based cognitive computing can help protect and maintain information security and privacy in cloud platforms,ensuring businesses can focus on business development.To ensure data security in cloud platforms,this research proposed a blockchain-based Hybridized Data Driven Cognitive Computing(HD2C)model.However,the proposed HD2C framework addresses breaches of the privacy information of mixed participants of the Internet of Things(IoT)in the cloud.HD2C is developed by combining Federated Learning(FL)with a Blockchain consensus algorithm to connect smart contracts with Proof of Authority.The“Data Island”problem can be solved by FL’s emphasis on privacy and lightning-fast processing,while Blockchain provides a decentralized incentive structure that is impervious to poisoning.FL with Blockchain allows quick consensus through smart member selection and verification.The HD2C paradigm significantly improves the computational processing efficiency of intelligent manufacturing.Extensive analysis results derived from IIoT datasets confirm HD2C superiority.When compared to other consensus algorithms,the Blockchain PoA’s foundational cost is significant.The accuracy and memory utilization evaluation results predict the total benefits of the system.In comparison to the values 0.004 and 0.04,the value of 0.4 achieves good accuracy.According to the experiment results,the number of transactions per second has minimal impact on memory requirements.The findings of this study resulted in the development of a brand-new IIoT framework based on blockchain technology.展开更多
The majority of spatial data reveal some degree of spatial dependence. The term “spatial dependence” refers to the tendency for phenomena to be more similar when they occur close together than when they occur far ap...The majority of spatial data reveal some degree of spatial dependence. The term “spatial dependence” refers to the tendency for phenomena to be more similar when they occur close together than when they occur far apart in space. This property is ignored in machine learning (ML) for spatial domains of application. Most classical machine learning algorithms are generally inappropriate unless modified in some way to account for it. In this study, we proposed an approach that aimed to improve a ML model to detect the dependence without incorporating any spatial features in the learning process. To detect this dependence while also improving performance, a hybrid model was used based on two representative algorithms. In addition, cross-validation method was used to make the model stable. Furthermore, global moran’s I and local moran were used to capture the spatial dependence in the residuals. The results show that the HM has significant with a R2 of 99.91% performance compared to RBFNN and RF that have 74.22% and 82.26% as R2 respectively. With lower errors, the HM was able to achieve an average test error of 0.033% and a positive global moran’s of 0.12. We concluded that as the R2 value increases, the models become weaker in terms of capturing the dependence.展开更多
<span style="font-family:Verdana;">Develop</span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;&qu...<span style="font-family:Verdana;">Develop</span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">ment</span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;"> of renewable energy (RE) and mitigation of carbon dioxide, as the two largest climate action initiatives are the most challenging factors for new generation green data center (GDC). Reduction of conventional electricity consumption as well as cost of electricity (COE) with preferred quality</span></span></span><span><span><span style="font-family:;" "=""> </span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">of service (QoS) has been recognized as the interesting research topic in Information and Communication Technology (ICT) sector. Moreover, it becomes challenging to design a large-scale sustainable GDC with standalone RE supply. This paper gives spotlight on hybrid energy supply solution for the GDC to reduce grid electricity usage and minimum net system cost. The proposed framework includes RE source such as solar photovoltaic, wind turbine and non-renewable energy sources as Disel Generator (DG) and Battery. A hybrid optimization model is designed using HOMER software for cost assessment and energy evaluation to validate the effectiveness of the suggested scheme focusing on eco-friendly implication.</span></span></span>展开更多
基于WRF模式构建了Hybrid En SRF-En3DVar同化系统,该系统使用En SRF方案直接更新集合扰动。利用构建的同化系统针对台风"桑美"分别进行集合协方差权重敏感性试验和同化雷达不同观测资料的敏感性试验。集合协方差权重敏感性...基于WRF模式构建了Hybrid En SRF-En3DVar同化系统,该系统使用En SRF方案直接更新集合扰动。利用构建的同化系统针对台风"桑美"分别进行集合协方差权重敏感性试验和同化雷达不同观测资料的敏感性试验。集合协方差权重敏感性试验发现:当集合协方差权重分别为0.25、0.5和0.75时,同化效果优于3DVar试验,其中0.75的集合协方差权重试验得到了分析场的最优估计;当集合协方差权重为1.0时,分析场最差。同化雷达不同观测资料的敏感性试验表明,联合同化雷达径向风及反射率能有效改善大气湿度场和风场,但对风场的改善效果不如仅同化雷达径向风好。将En SRF集合扰动更新方案与扰动观测方案综合分析发现,扰动观测方案集合离散度较小,计算代价大,En SRF方案优于扰动观测方案。展开更多
文摘Spatial heterogeneity refers to the variation or differences in characteristics or features across different locations or areas in space. Spatial data refers to information that explicitly or indirectly belongs to a particular geographic region or location, also known as geo-spatial data or geographic information. Focusing on spatial heterogeneity, we present a hybrid machine learning model combining two competitive algorithms: the Random Forest Regressor and CNN. The model is fine-tuned using cross validation for hyper-parameter adjustment and performance evaluation, ensuring robustness and generalization. Our approach integrates Global Moran’s I for examining global autocorrelation, and local Moran’s I for assessing local spatial autocorrelation in the residuals. To validate our approach, we implemented the hybrid model on a real-world dataset and compared its performance with that of the traditional machine learning models. Results indicate superior performance with an R-squared of 0.90, outperforming RF 0.84 and CNN 0.74. This study contributed to a detailed understanding of spatial variations in data considering the geographical information (Longitude & Latitude) present in the dataset. Our results, also assessed using the Root Mean Squared Error (RMSE), indicated that the hybrid yielded lower errors, showing a deviation of 53.65% from the RF model and 63.24% from the CNN model. Additionally, the global Moran’s I index was observed to be 0.10. This study underscores that the hybrid was able to predict correctly the house prices both in clusters and in dispersed areas.
基金Project for Public Welfare(Meteorology)of China(GYHY201206006)973 Program(2013CB430305)+2 种基金National Natural Science Foundation of China(41575107)Project of Shanghai Meteorological Bureau(YJ201401)Key Project of Science and Technology Commission of Shanghai Municipality(13231203300)
文摘A hybrid GSI (Grid-point Statistical Interpolation)-ETKF (Ensemble Transform Kalman Filter) data assimila- tion system has been recently developed for the WRF (Weather Research and Forecasting) model and tested with simu- lated observations for tropical cyclone (TC) forecast. This system is based on the existing GSI but with ensemble back- ground information incorporated. As a follow-up, this work extends the new system to assimilate real observations to further understand the hybrid scheme. As a first effort to explore the system with real observations, relatively coarse grid resolution (27 km) is used. A case study of typhoon Muifa (2011) is performed to assimilate real observations in- cluding conventional in-situ and satellite data. The hybrid system with flow-dependent ensemble eovariance shows sig- nificant improvements with respect to track forecast compared to the standard GSI system which in theory is three di- mensional variational analysis (3DVAR). By comparing the analyses, analysis increments and forecasts, the hybrid sys- tem is found to be potentially able to recognize the existence of TC vortex, adjust its position systematically, better de- scribe the asymmetric structure of typhoon Muifa and maintain the dynamic and thermodynamic balance in typhoon ini- tial field. In addition, a cold-start hybrid approach by using the global ensembles to provide flow-dependent error is test- ed and similar results are revealed with those from cycled GSI-ETKF approach.
基金Supported by the National High Technology Research and Development Programme of China(No.2011AA120300,2011AA120302)the National Key Technology Support Program of China(No.2013BAH66F02)
文摘Data aggregation from various web sources is very significant for web data analysis domain. In ad- dition, the recognition of coherence micro cluster is one of the most interesting issues in the field of data aggregation. Until now, many algorithms have been proposed to work on this issue. However, the deficiency of these solutions is that they cannot recognize the micro-cluster data stream accurately. A semantic-based coherent micro-cluster recognition algorithm for hybrid web data stream is nronosed.Firstly, an objective function is proposed to recognize the coherence micro-cluster and then the coher- ence micro-cluster recognition algorithm for hybrid web data stream based on semantic is raised. Fi-
基金Supported by the National High Technology Research and Development Program of China under Grant No 2015AA016902the National Natural Science Foundation of China under Grant Nos 61435013 and 61405188the K.C.Wong Education Foundation
文摘An 8×10 GHz receiver optical sub-assembly (ROSA) consisting of an 8-channel arrayed waveguide grating (AWG) and an 8-channel PIN photodetector (PD) array is designed and fabricated based on silica hybrid integration technology. Multimode output waveguides in the silica AWG with 2% refractive index difference are used to obtain fiat-top spectra. The output waveguide facet is polished to 45° bevel to change the light propagation direction into the mesa-type PIN PD, which simplifies the packaging process. The experimentM results show that the single channel I dB bandwidth of AWG ranges from 2.12nm to 3.06nm, the ROSA responsivity ranges from 0.097 A/W to 0.158A/W, and the 3dB bandwidth is up to 11 GHz. It is promising to be applied in the eight-lane WDM transmission system in data center interconnection.
基金supported by the National Natural Science Foundation of China(71401134 71571144+1 种基金 71171164)the Program of International Cooperation and Exchanges in Science and Technology Funded by Shaanxi Province(2016KW-033)
文摘Under Type-Ⅱ progressively hybrid censoring, this paper discusses statistical inference and optimal design on stepstress partially accelerated life test for hybrid system in presence of masked data. It is assumed that the lifetime of the component in hybrid systems follows independent and identical modified Weibull distributions. The maximum likelihood estimations(MLEs)of the unknown parameters, acceleration factor and reliability indexes are derived by using the Newton-Raphson algorithm. The asymptotic variance-covariance matrix and the approximate confidence intervals are obtained based on normal approximation to the asymptotic distribution of MLEs of model parameters. Moreover,two bootstrap confidence intervals are constructed by using the parametric bootstrap method. The optimal time of changing stress levels is determined under D-optimality and A-optimality criteria.Finally, the Monte Carlo simulation study is carried out to illustrate the proposed procedures.
基金The research was supported by the National Natural Science Foundation of China under grant No:60775036, 60970061the Higher Education Nature Science Research Fund Project of Jiangsu Province under grant No: 09KJD520004.
文摘Decision rules mining is an important issue in machine learning and data mining.However,most proposed algorithms mine categorical data at single level,and these rules are not easily understandable and really useful for users.Thus,a new approach to hierarchical decision rules mining is provided in this paper,in which similarity direction measure is introduced to deal with hybrid data.This approach can mine hierarchical decision rules by adjusting similarity measure parameters and the level of concept hierarchy trees.
文摘Cloud storage is widely used by large companies to store vast amounts of data and files,offering flexibility,financial savings,and security.However,information shoplifting poses significant threats,potentially leading to poor performance and privacy breaches.Blockchain-based cognitive computing can help protect and maintain information security and privacy in cloud platforms,ensuring businesses can focus on business development.To ensure data security in cloud platforms,this research proposed a blockchain-based Hybridized Data Driven Cognitive Computing(HD2C)model.However,the proposed HD2C framework addresses breaches of the privacy information of mixed participants of the Internet of Things(IoT)in the cloud.HD2C is developed by combining Federated Learning(FL)with a Blockchain consensus algorithm to connect smart contracts with Proof of Authority.The“Data Island”problem can be solved by FL’s emphasis on privacy and lightning-fast processing,while Blockchain provides a decentralized incentive structure that is impervious to poisoning.FL with Blockchain allows quick consensus through smart member selection and verification.The HD2C paradigm significantly improves the computational processing efficiency of intelligent manufacturing.Extensive analysis results derived from IIoT datasets confirm HD2C superiority.When compared to other consensus algorithms,the Blockchain PoA’s foundational cost is significant.The accuracy and memory utilization evaluation results predict the total benefits of the system.In comparison to the values 0.004 and 0.04,the value of 0.4 achieves good accuracy.According to the experiment results,the number of transactions per second has minimal impact on memory requirements.The findings of this study resulted in the development of a brand-new IIoT framework based on blockchain technology.
文摘The majority of spatial data reveal some degree of spatial dependence. The term “spatial dependence” refers to the tendency for phenomena to be more similar when they occur close together than when they occur far apart in space. This property is ignored in machine learning (ML) for spatial domains of application. Most classical machine learning algorithms are generally inappropriate unless modified in some way to account for it. In this study, we proposed an approach that aimed to improve a ML model to detect the dependence without incorporating any spatial features in the learning process. To detect this dependence while also improving performance, a hybrid model was used based on two representative algorithms. In addition, cross-validation method was used to make the model stable. Furthermore, global moran’s I and local moran were used to capture the spatial dependence in the residuals. The results show that the HM has significant with a R2 of 99.91% performance compared to RBFNN and RF that have 74.22% and 82.26% as R2 respectively. With lower errors, the HM was able to achieve an average test error of 0.033% and a positive global moran’s of 0.12. We concluded that as the R2 value increases, the models become weaker in terms of capturing the dependence.
文摘<span style="font-family:Verdana;">Develop</span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">ment</span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;"> of renewable energy (RE) and mitigation of carbon dioxide, as the two largest climate action initiatives are the most challenging factors for new generation green data center (GDC). Reduction of conventional electricity consumption as well as cost of electricity (COE) with preferred quality</span></span></span><span><span><span style="font-family:;" "=""> </span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">of service (QoS) has been recognized as the interesting research topic in Information and Communication Technology (ICT) sector. Moreover, it becomes challenging to design a large-scale sustainable GDC with standalone RE supply. This paper gives spotlight on hybrid energy supply solution for the GDC to reduce grid electricity usage and minimum net system cost. The proposed framework includes RE source such as solar photovoltaic, wind turbine and non-renewable energy sources as Disel Generator (DG) and Battery. A hybrid optimization model is designed using HOMER software for cost assessment and energy evaluation to validate the effectiveness of the suggested scheme focusing on eco-friendly implication.</span></span></span>