Accurate gas viscosity determination is an important issue in the oil and gas industries.Experimental approaches for gas viscosity measurement are timeconsuming,expensive and hardly possible at high pressures and high...Accurate gas viscosity determination is an important issue in the oil and gas industries.Experimental approaches for gas viscosity measurement are timeconsuming,expensive and hardly possible at high pressures and high temperatures(HPHT).In this study,a number of correlations were developed to estimate gas viscosity by the use of group method of data handling(GMDH)type neural network and gene expression programming(GEP)techniques using a large data set containing more than 3000 experimental data points for methane,nitrogen,and hydrocarbon gas mixtures.It is worth mentioning that unlike many of viscosity correlations,the proposed ones in this study could compute gas viscosity at pressures ranging between 34 and 172 MPa and temperatures between 310 and 1300 K.Also,a comparison was performed between the results of these established models and the results of ten wellknown models reported in the literature.Average absolute relative errors of GMDH models were obtained 4.23%,0.64%,and 0.61%for hydrocarbon gas mixtures,methane,and nitrogen,respectively.In addition,graphical analyses indicate that the GMDH can predict gas viscosity with higher accuracy than GEP at HPHT conditions.Also,using leverage technique,valid,suspected and outlier data points were determined.Finally,trends of gas viscosity models at different conditions were evaluated.展开更多
This paper proposes the solution of tasks set required for autonomous robotic group behavior optimization during the mission on a distributed area in a cluttered hazardous terrain.The navigation scheme uses the benefi...This paper proposes the solution of tasks set required for autonomous robotic group behavior optimization during the mission on a distributed area in a cluttered hazardous terrain.The navigation scheme uses the benefits of the original real-time technical vision system(TVS)based on a dynamic triangulation principle.The method uses TVS output data with fuzzy logic rules processing for resolution stabilization.Based on previous researches,the dynamic communication network model is modified to implement the propagation of information with a feedback method for more stable data exchange inside the robotic group.According to the comparative analysis of approximation methods,in this paper authors are proposing to use two-steps post-processing path planning aiming to get a smooth and energy-saving trajectory.The article provides a wide range of studies and computational experiment results for different scenarios for evaluation of common cloud point influence on robotic motion planning.展开更多
Consider the bivariate exponential distribution due to Marshall and Olkin[2], whose survival function is F(x, g) = exp[-λ1x-λ2y-λ12 max(x, y)] (x 0,y 0)with unknown Parameters λ1 > 0, λ2 > 0 and λ12 0.Base...Consider the bivariate exponential distribution due to Marshall and Olkin[2], whose survival function is F(x, g) = exp[-λ1x-λ2y-λ12 max(x, y)] (x 0,y 0)with unknown Parameters λ1 > 0, λ2 > 0 and λ12 0.Based on grouped data, a newestimator for λ1, λ2 and λ12 is derived and its asymptotic properties are discussed.Besides, some test procedures of equal marginals and independence are given. Asimulation result is given, too.展开更多
A new method of multi sensor location data fusion is proposed.The method is based on group consensus approach, which constructs group utility function (or its density) based on uncertainty of each sensor, and the loc...A new method of multi sensor location data fusion is proposed.The method is based on group consensus approach, which constructs group utility function (or its density) based on uncertainty of each sensor, and the location estimation is obtained based on the group utility function (or its density). The simulation results show that the method is better than those of mean and median estimation, and outlier and sensor failure can not affect the location estimation.展开更多
Machine-type communication (MTC) devices provide a broad range of data collection especially on the massive data generated environments such as urban, industrials and event-enabled areas. In dense deployments, the dat...Machine-type communication (MTC) devices provide a broad range of data collection especially on the massive data generated environments such as urban, industrials and event-enabled areas. In dense deployments, the data collected at the closest locations between the MTC devices are spatially correlated. In this paper, we propose a k-means grouping technique to combine all MTC devices based on spatially correlated. The MTC devices collect the data on the event-based area and then transmit to the centralized aggregator for processing and computing. With the limitation of computational resources at the centralized aggregator, some grouped MTC devices data offloaded to the nearby base station collocated with the mobile edge-computing server. As a sensing capability adopted on MTC devices, we use a power exponential function model to compute a correlation coefficient existing between the MTC devices. Based on this framework, we compare the energy consumption when all data processed locally at centralized aggregator or offloaded at mobile edge computing server with optimal solution obtained by the brute force method. Then, the simulation results revealed that the proposed k-means grouping technique reduce the energy consumption at centralized aggregator while satisfying the required completion time.展开更多
Accurate and reliable nuclear data libraries are essential for calculation and design of advanced nuclea systems. A 1200 fine group nuclear data library Hybrid Evaluated Nuclear Data Library/Fine Group(HENDL/FG with n...Accurate and reliable nuclear data libraries are essential for calculation and design of advanced nuclea systems. A 1200 fine group nuclear data library Hybrid Evaluated Nuclear Data Library/Fine Group(HENDL/FG with neutrons of up to 150 Me V has been developed to improve the accuracy of neutronics calculations and anal ysis. Corrections of Doppler, resonance self-shielding, and thermal upscatter effects were done for HENDL/FG Shielding and critical safety benchmarks were performed to test the accuracy and reliability of the library. The dis crepancy between calculated and measured nuclea parameters fell into a reasonable range.展开更多
To bridge the performance gap between original probability data association (PDA) algorithm and the optimum maximum a posterior (MAP) algorithm for multi-input multi-output (MIMO) detection, a grouped PDA (GP-...To bridge the performance gap between original probability data association (PDA) algorithm and the optimum maximum a posterior (MAP) algorithm for multi-input multi-output (MIMO) detection, a grouped PDA (GP-PDA) detection algorithm is proposed. The proposed GP-PDA method divides all the transmit antennas into groups, and then updates the symbol probabilities group by group using PDA computations. In each group, joint a posterior probability (APP) is computed to obtain the APP of a single symbol in this group, like the MAP algorithm. Such new algorithm combines the characters of MAP and PDA. MAP and original PDA algorithm can be regarded as a special case of the proposed GP-PDA. Simulations show that the proposed GP-PDA provides a performance and complexity trade, off between original PDA and MAP algorithm.展开更多
In this paper, the weighted Kolmogrov-Smirnov, Cramer von-Miss and the Anderson Darling test statistics are considered as goodness of fit tests for the generalized Rayleigh interval grouped data. An extensive simulati...In this paper, the weighted Kolmogrov-Smirnov, Cramer von-Miss and the Anderson Darling test statistics are considered as goodness of fit tests for the generalized Rayleigh interval grouped data. An extensive simulation process is conducted to evaluate their controlling of type 1 error and their power functions. Generally, the weighted Kolmogrov-Smirnov test statistics show a relatively better performance than both, the Cramer von-Miss and the Anderson Darling test statistics. For large sample values, the Anderson Darling test statistics cannot control type 1 error but for relatively small sample values it indicates a better performance than the Cramer von-Miss test statistics. Best selection of the test statistics and highlights for future studies are also explored.展开更多
This paper proposes the use of Group Method of Data Handling (GMDH) technique for modeling Magneto-Rheological (MR) dampers in the context of system identification. GMDH is a multilayer network of quadratic neurons th...This paper proposes the use of Group Method of Data Handling (GMDH) technique for modeling Magneto-Rheological (MR) dampers in the context of system identification. GMDH is a multilayer network of quadratic neurons that offers an effective solution to modeling non-linear systems. As such, we propose the use of GMDH to approximate the forward and inverse dynamic behaviors of MR dampers. We also introduce two enhanced GMDH-based solutions. Firstly, a two-tier architecture is proposed whereby an enhanced GMD model is generated by the aid of a feedback scheme. Secondly, stepwise regression is used as a feature selection method prior to GMDH modeling. The proposed enhancements to GMDH are found to offer improved prediction results in terms of reducing the root-mean-squared error by around 40%.展开更多
Within the framework of the Deep-time Digital Earth(DDE)project,thematic databases driven by scientific issues will have strong scientific vitality.In the field of sedimentology,thematic databases based on the current...Within the framework of the Deep-time Digital Earth(DDE)project,thematic databases driven by scientific issues will have strong scientific vitality.In the field of sedimentology,thematic databases based on the current unified sedimentary knowledge tree established by the Sedimentary Data Group(Fig.1),can solve specific scientific problems effectively and improve the scope and utility of the DDE platform significantly.展开更多
We used simulated data to investigate both the small and large sample properties of the within-groups (WG) estimator and the first difference generalized method of moments (FD-GMM) estimator of a dynamic panel data (D...We used simulated data to investigate both the small and large sample properties of the within-groups (WG) estimator and the first difference generalized method of moments (FD-GMM) estimator of a dynamic panel data (DPD) model. The magnitude of WG and FD-GMM estimates are almost the same for square panels. WG estimator performs best for long panels such as those with time dimension as large as 50. The advantage of FD-GMM estimator however, is observed on panels that are long and wide, say with time dimension at least 25 and cross-section dimension size of at least 30. For small-sized panels, the two methods failed since their optimality was established in the context of asymptotic theory. We developed parametric bootstrap versions of WG and FD-GMM estimators. Simulation study indicates the advantages of the bootstrap methods under small sample cases on the assumption that variances of the individual effects and the disturbances are of similar magnitude. The boostrapped WG and FD-GMM estimators are optimal for small samples.展开更多
This paper simultaneously investigates variable selection and imputation estimation of semiparametric partially linear varying-coefficient model in that case where there exist missing responses for cluster data. As is...This paper simultaneously investigates variable selection and imputation estimation of semiparametric partially linear varying-coefficient model in that case where there exist missing responses for cluster data. As is well known, commonly used approach to deal with missing data is complete-case data. Combined the idea of complete-case data with a discussion of shrinkage estimation is made on different cluster. In order to avoid the biased results as well as improve the estimation efficiency, this article introduces Group Least Absolute Shrinkage and Selection Operator (Group Lasso) to semiparametric model. That is to say, the method combines the approach of local polynomial smoothing and the Least Absolute Shrinkage and Selection Operator. In that case, it can conduct nonparametric estimation and variable selection in a computationally efficient manner. According to the same criterion, the parametric estimators are also obtained. Additionally, for each cluster, the nonparametric and parametric estimators are derived, and then compute the weighted average per cluster as finally estimators. Moreover, the large sample properties of estimators are also derived respectively.展开更多
Online monitoring methods have been widely used in many major devices, however the normal and abnormal states of equipment are estimated mainly based on the monitoring results whether monitored parameters exceed the s...Online monitoring methods have been widely used in many major devices, however the normal and abnormal states of equipment are estimated mainly based on the monitoring results whether monitored parameters exceed the setting thresholds. Using these monitoring methods may cause serious false positive or false negative results. In order to precisely monitor the state of equipment, the problem of abnormality degree detection without fault sample is studied with a new detection method called negative potential field group detectors(NPFG-detectors). This method achieves the quantitative expression of abnormality degree and provides the better detection results compared with other methods. In the process of Iris data set simulation, the new algorithm obtains the successful results in abnormal detection. The detection rates for 3 types of Iris data set respectively reach 100%, 91.6%, and 95.24% with 50% training samples. The problem of Bearing abnormality degree detection via an abnormality degree curve is successfully solved.展开更多
Online learning is a very important means of study, and has been adopted in many countries worldwide. However, only recently are researchers able to collect and analyze massive online learning datasets due to the COVI...Online learning is a very important means of study, and has been adopted in many countries worldwide. However, only recently are researchers able to collect and analyze massive online learning datasets due to the COVID-19 epidemic. In this article, we analyze the difference between online learner groups by using an unsupervised machine learning technique, i.e., k-prototypes clustering. Specifically, we use questionnaires designed by domain experts to collect various online learning data, and investigate students’ online learning behavior and learning outcomes through analyzing the collected questionnaire data. Our analysis results suggest that students with better learning media generally have better online learning behavior and learning result than those with poor online learning media. In addition, both in economically developed or undeveloped regions, the number of students with better learning media is less than the number of students with poor learning media. Finally, the results presented here show that whether in an economically developed or an economically undeveloped region, the number of students who are enriched with learning media available is an important factor that affects online learning behavior and learning outcomes.展开更多
文摘Accurate gas viscosity determination is an important issue in the oil and gas industries.Experimental approaches for gas viscosity measurement are timeconsuming,expensive and hardly possible at high pressures and high temperatures(HPHT).In this study,a number of correlations were developed to estimate gas viscosity by the use of group method of data handling(GMDH)type neural network and gene expression programming(GEP)techniques using a large data set containing more than 3000 experimental data points for methane,nitrogen,and hydrocarbon gas mixtures.It is worth mentioning that unlike many of viscosity correlations,the proposed ones in this study could compute gas viscosity at pressures ranging between 34 and 172 MPa and temperatures between 310 and 1300 K.Also,a comparison was performed between the results of these established models and the results of ten wellknown models reported in the literature.Average absolute relative errors of GMDH models were obtained 4.23%,0.64%,and 0.61%for hydrocarbon gas mixtures,methane,and nitrogen,respectively.In addition,graphical analyses indicate that the GMDH can predict gas viscosity with higher accuracy than GEP at HPHT conditions.Also,using leverage technique,valid,suspected and outlier data points were determined.Finally,trends of gas viscosity models at different conditions were evaluated.
文摘This paper proposes the solution of tasks set required for autonomous robotic group behavior optimization during the mission on a distributed area in a cluttered hazardous terrain.The navigation scheme uses the benefits of the original real-time technical vision system(TVS)based on a dynamic triangulation principle.The method uses TVS output data with fuzzy logic rules processing for resolution stabilization.Based on previous researches,the dynamic communication network model is modified to implement the propagation of information with a feedback method for more stable data exchange inside the robotic group.According to the comparative analysis of approximation methods,in this paper authors are proposing to use two-steps post-processing path planning aiming to get a smooth and energy-saving trajectory.The article provides a wide range of studies and computational experiment results for different scenarios for evaluation of common cloud point influence on robotic motion planning.
文摘Consider the bivariate exponential distribution due to Marshall and Olkin[2], whose survival function is F(x, g) = exp[-λ1x-λ2y-λ12 max(x, y)] (x 0,y 0)with unknown Parameters λ1 > 0, λ2 > 0 and λ12 0.Based on grouped data, a newestimator for λ1, λ2 and λ12 is derived and its asymptotic properties are discussed.Besides, some test procedures of equal marginals and independence are given. Asimulation result is given, too.
文摘A new method of multi sensor location data fusion is proposed.The method is based on group consensus approach, which constructs group utility function (or its density) based on uncertainty of each sensor, and the location estimation is obtained based on the group utility function (or its density). The simulation results show that the method is better than those of mean and median estimation, and outlier and sensor failure can not affect the location estimation.
文摘Machine-type communication (MTC) devices provide a broad range of data collection especially on the massive data generated environments such as urban, industrials and event-enabled areas. In dense deployments, the data collected at the closest locations between the MTC devices are spatially correlated. In this paper, we propose a k-means grouping technique to combine all MTC devices based on spatially correlated. The MTC devices collect the data on the event-based area and then transmit to the centralized aggregator for processing and computing. With the limitation of computational resources at the centralized aggregator, some grouped MTC devices data offloaded to the nearby base station collocated with the mobile edge-computing server. As a sensing capability adopted on MTC devices, we use a power exponential function model to compute a correlation coefficient existing between the MTC devices. Based on this framework, we compare the energy consumption when all data processed locally at centralized aggregator or offloaded at mobile edge computing server with optimal solution obtained by the brute force method. Then, the simulation results revealed that the proposed k-means grouping technique reduce the energy consumption at centralized aggregator while satisfying the required completion time.
基金supported by the Natural Science Foundation of China(Nos.11405204 11305205 and 10675123)Special Program for Informatization of Chinese Academy of Sciences(No.XXH12504-1-09)the National Special Program for ITER(No.2014GB1120001)
文摘Accurate and reliable nuclear data libraries are essential for calculation and design of advanced nuclea systems. A 1200 fine group nuclear data library Hybrid Evaluated Nuclear Data Library/Fine Group(HENDL/FG with neutrons of up to 150 Me V has been developed to improve the accuracy of neutronics calculations and anal ysis. Corrections of Doppler, resonance self-shielding, and thermal upscatter effects were done for HENDL/FG Shielding and critical safety benchmarks were performed to test the accuracy and reliability of the library. The dis crepancy between calculated and measured nuclea parameters fell into a reasonable range.
基金Sponsored by the National Natural Science Foundation of China(60572120)
文摘To bridge the performance gap between original probability data association (PDA) algorithm and the optimum maximum a posterior (MAP) algorithm for multi-input multi-output (MIMO) detection, a grouped PDA (GP-PDA) detection algorithm is proposed. The proposed GP-PDA method divides all the transmit antennas into groups, and then updates the symbol probabilities group by group using PDA computations. In each group, joint a posterior probability (APP) is computed to obtain the APP of a single symbol in this group, like the MAP algorithm. Such new algorithm combines the characters of MAP and PDA. MAP and original PDA algorithm can be regarded as a special case of the proposed GP-PDA. Simulations show that the proposed GP-PDA provides a performance and complexity trade, off between original PDA and MAP algorithm.
文摘In this paper, the weighted Kolmogrov-Smirnov, Cramer von-Miss and the Anderson Darling test statistics are considered as goodness of fit tests for the generalized Rayleigh interval grouped data. An extensive simulation process is conducted to evaluate their controlling of type 1 error and their power functions. Generally, the weighted Kolmogrov-Smirnov test statistics show a relatively better performance than both, the Cramer von-Miss and the Anderson Darling test statistics. For large sample values, the Anderson Darling test statistics cannot control type 1 error but for relatively small sample values it indicates a better performance than the Cramer von-Miss test statistics. Best selection of the test statistics and highlights for future studies are also explored.
文摘This paper proposes the use of Group Method of Data Handling (GMDH) technique for modeling Magneto-Rheological (MR) dampers in the context of system identification. GMDH is a multilayer network of quadratic neurons that offers an effective solution to modeling non-linear systems. As such, we propose the use of GMDH to approximate the forward and inverse dynamic behaviors of MR dampers. We also introduce two enhanced GMDH-based solutions. Firstly, a two-tier architecture is proposed whereby an enhanced GMD model is generated by the aid of a feedback scheme. Secondly, stepwise regression is used as a feature selection method prior to GMDH modeling. The proposed enhancements to GMDH are found to offer improved prediction results in terms of reducing the root-mean-squared error by around 40%.
文摘Within the framework of the Deep-time Digital Earth(DDE)project,thematic databases driven by scientific issues will have strong scientific vitality.In the field of sedimentology,thematic databases based on the current unified sedimentary knowledge tree established by the Sedimentary Data Group(Fig.1),can solve specific scientific problems effectively and improve the scope and utility of the DDE platform significantly.
文摘We used simulated data to investigate both the small and large sample properties of the within-groups (WG) estimator and the first difference generalized method of moments (FD-GMM) estimator of a dynamic panel data (DPD) model. The magnitude of WG and FD-GMM estimates are almost the same for square panels. WG estimator performs best for long panels such as those with time dimension as large as 50. The advantage of FD-GMM estimator however, is observed on panels that are long and wide, say with time dimension at least 25 and cross-section dimension size of at least 30. For small-sized panels, the two methods failed since their optimality was established in the context of asymptotic theory. We developed parametric bootstrap versions of WG and FD-GMM estimators. Simulation study indicates the advantages of the bootstrap methods under small sample cases on the assumption that variances of the individual effects and the disturbances are of similar magnitude. The boostrapped WG and FD-GMM estimators are optimal for small samples.
文摘This paper simultaneously investigates variable selection and imputation estimation of semiparametric partially linear varying-coefficient model in that case where there exist missing responses for cluster data. As is well known, commonly used approach to deal with missing data is complete-case data. Combined the idea of complete-case data with a discussion of shrinkage estimation is made on different cluster. In order to avoid the biased results as well as improve the estimation efficiency, this article introduces Group Least Absolute Shrinkage and Selection Operator (Group Lasso) to semiparametric model. That is to say, the method combines the approach of local polynomial smoothing and the Least Absolute Shrinkage and Selection Operator. In that case, it can conduct nonparametric estimation and variable selection in a computationally efficient manner. According to the same criterion, the parametric estimators are also obtained. Additionally, for each cluster, the nonparametric and parametric estimators are derived, and then compute the weighted average per cluster as finally estimators. Moreover, the large sample properties of estimators are also derived respectively.
基金Supported by National Natural Science Foundation of China(Grant No.51175316)Specialized Research Fund for the Doctoral Program of Higher Education,China(Grant No.20103108110006)Basic Research Project of Shanghai Science and Technology Commission,China(Grant No.11JC1404100)
文摘Online monitoring methods have been widely used in many major devices, however the normal and abnormal states of equipment are estimated mainly based on the monitoring results whether monitored parameters exceed the setting thresholds. Using these monitoring methods may cause serious false positive or false negative results. In order to precisely monitor the state of equipment, the problem of abnormality degree detection without fault sample is studied with a new detection method called negative potential field group detectors(NPFG-detectors). This method achieves the quantitative expression of abnormality degree and provides the better detection results compared with other methods. In the process of Iris data set simulation, the new algorithm obtains the successful results in abnormal detection. The detection rates for 3 types of Iris data set respectively reach 100%, 91.6%, and 95.24% with 50% training samples. The problem of Bearing abnormality degree detection via an abnormality degree curve is successfully solved.
文摘Online learning is a very important means of study, and has been adopted in many countries worldwide. However, only recently are researchers able to collect and analyze massive online learning datasets due to the COVID-19 epidemic. In this article, we analyze the difference between online learner groups by using an unsupervised machine learning technique, i.e., k-prototypes clustering. Specifically, we use questionnaires designed by domain experts to collect various online learning data, and investigate students’ online learning behavior and learning outcomes through analyzing the collected questionnaire data. Our analysis results suggest that students with better learning media generally have better online learning behavior and learning result than those with poor online learning media. In addition, both in economically developed or undeveloped regions, the number of students with better learning media is less than the number of students with poor learning media. Finally, the results presented here show that whether in an economically developed or an economically undeveloped region, the number of students who are enriched with learning media available is an important factor that affects online learning behavior and learning outcomes.