Research on strain anomalies and large earthquakes based on temporal and spatial crustal activities has been rapidly growing due to data availability, especially in Japan and Indonesia. However, many research works us...Research on strain anomalies and large earthquakes based on temporal and spatial crustal activities has been rapidly growing due to data availability, especially in Japan and Indonesia. However, many research works used local-scale case studies that focused on a specific earthquake characteristic using knowledgedriven techniques, such as crustal deformation analysis. In this study, a data-driven-based analysis is used to detect anomalies using displacement rates and deformation pattern features extracted from daily global navigation satellite system(GNSS) data using a machine learning algorithm. The GNSS data with188 and 1181 continuously operating reference stations from Indonesia and Japan, respectively, are used to identify the anomaly of recent major earthquakes in the last two decades. Feature displacement rates and deformation patterns are processed in several window times with 2560 experiment scenarios to produce the best detection using tree-based algorithms. Tree-based algorithms with a single estimator(decision tree), ensemble bagging(bagging, random forest and Extra Trees), and ensemble boosting(AdaBoost, gradient boosting, LGBM, and XGB) are applied in the study. The experiment test using realtime scenario GNSSdailydatareveals high F1-scores and accuracy for anomaly detection using slope windowing 365 and 730 days of 91-day displacement rates and then 7-day deformation pattern features in tree-based algorithms. The results show the potential for medium-term anomaly detection using GNSS data without the need for multiple vulnerability assessments.展开更多
Wireless Mesh Network (WMN) is seen as an effective Intemet access solution for dynamic wireless applications. For the low mobility of mesh routers in WMN, the backbone topography can be effectively maintained by pr...Wireless Mesh Network (WMN) is seen as an effective Intemet access solution for dynamic wireless applications. For the low mobility of mesh routers in WMN, the backbone topography can be effectively maintained by proactive routing protocol. Pre-proposals like Tree Based Routing (TBR) protocol and Root Driven Routing (RDR) protocol are so centralized that they make the gateway becorre a bottleneck which severely restricts the network performance. We proposed an Optimized Tree-based Routing (OTR) protocol that logically separated the proactive tree into pieces. Route is partly computed by the branches instead of root. We also discussed the operation of multipie Intemet gateways which is a main issue in WMN. The new proposal lightens the load in root, reduces the overhead and improves the throughput. Numerical analysis and simulation results confirm that the perforrmnce of WMN is improved and OTR is more suitable for large scale WMN.展开更多
The sensor virus is a serious threat,as an attacker can simply send a single packet to compromise the entire sensor network.Epidemics become drastic with link additions among sensors when the small world phenomena occ...The sensor virus is a serious threat,as an attacker can simply send a single packet to compromise the entire sensor network.Epidemics become drastic with link additions among sensors when the small world phenomena occur.Two immunization strategies,uniform immunization and temporary immunization,are conducted on small worlds of tree-based wireless sensor networks to combat the sensor viruses.With the former strategy,the infection extends exponentially,although the immunization effectively reduces the contagion speed.With the latter strategy,recurrent contagion oscillations occur in the small world when the spatial-temporal dynamics of the epidemic are considered.The oscillations come from the small-world structure and the temporary immunization.Mathematical analyses on the small world of the Cayley tree are presented to reveal the epidemic dynamics with the two immunization strategies.展开更多
Recently,so-called tree-based phylogenetic networks have attracted considerable attention.These networks can be constructed from a phylogenetic tree,called the base tree,by adding additional edges.The primary aim of t...Recently,so-called tree-based phylogenetic networks have attracted considerable attention.These networks can be constructed from a phylogenetic tree,called the base tree,by adding additional edges.The primary aim of this study is to provide sufficient criteria for tree-basedness by reducing phylogenetic networks to related graph structures.Even though it is generally known that determining whether a network is tree-based is an NP-complete problem,one of these criteria,namely edge-basedness,can be verified in linear time.Surprisingly,the class of edgebased networks is closely related to a well-known family of graphs,namely,the class of generalized series-parallel graphs,and we explore this relationship in full detail.Additionally,we introduce further classes of tree-based networks and analyze their relationships.展开更多
This article proposes the high-speed and high-accuracy code clone detection method based on the combination of tree-based and token-based methods. Existence of duplicated program codes, called code clone, is one of th...This article proposes the high-speed and high-accuracy code clone detection method based on the combination of tree-based and token-based methods. Existence of duplicated program codes, called code clone, is one of the main factors that reduces the quality and maintainability of software. If one code fragment contains faults (bugs) and they are copied and modified to other locations, it is necessary to correct all of them. But it is not easy to find all code clones in large and complex software. Much research efforts have been done for code clone detection. There are mainly two methods for code clone detection. One is token-based and the other is tree-based method. Token-based method is fast and requires less resources. However it cannot detect all kinds of code clones. Tree-based method can detect all kinds of code clones, but it is slow and requires much computing resources. In this paper combination of these two methods was proposed to improve the efficiency and accuracy of detecting code clones. Firstly some candidates of code clones will be extracted by token-based method that is fast and lightweight. Then selected candidates will be checked more precisely by using tree-based method that can find all kinds of code clones. The prototype system was developed. This system accepts source code and tokenizes it in the first step. Then token-based method is applied to this token sequence to find candidates of code clones. After extracting several candidates, selected source codes will be converted into abstract syntax tree (AST) for applying tree-based method. Some sample source codes were used to evaluate the proposed method. This evaluation proved the improvement of efficiency and precision of code clones detecting.展开更多
Towards the crossing and coupling permissions in tasks existed widely in many fields and considering the design of role view must rely on the activities of the tasks process,based on Role Based Accessing Control (RBAC...Towards the crossing and coupling permissions in tasks existed widely in many fields and considering the design of role view must rely on the activities of the tasks process,based on Role Based Accessing Control (RBAC) model,this paper put forward a Role Tree-Based Access Control (RTBAC) model. In addition,the model definition and its constraint formal description is also discussed in this paper. RTBAC model is able to realize the dynamic organizing,self-determination and convenience of the design of role view,and guarantee the least role permission when task separating in the mean time.展开更多
With the increasing interest in e-commerce shopping, customer reviews have become one of the most important elements that determine customer satisfaction regarding products. This demonstrates the importance of working...With the increasing interest in e-commerce shopping, customer reviews have become one of the most important elements that determine customer satisfaction regarding products. This demonstrates the importance of working with Text Mining. This study is based on The Women’s Clothing E-Commerce Reviews database, which consists of reviews written by real customers. The aim of this paper is to conduct a Text Mining approach on a set of customer reviews. Each review was classified as either a positive or negative review by employing a classification method. Four tree-based methods were applied to solve the classification problem, namely Classification Tree, Random Forest, Gradient Boosting and XGBoost. The dataset was categorized into training and test sets. The results indicate that the Random Forest method displays an overfitting, XGBoost displays an overfitting if the number of trees is too high, Classification Tree is good at detecting negative reviews and bad at detecting positive reviews and the Gradient Boosting shows stable values and quality measures above 77% for the test dataset. A consensus between the applied methods is noted for important classification terms.展开更多
Tunnel Boring Machines(TBMs)are vital for tunnel and underground construction due to their high safety and efficiency.Accurately predicting TBM operational parameters based on the surrounding environment is crucial fo...Tunnel Boring Machines(TBMs)are vital for tunnel and underground construction due to their high safety and efficiency.Accurately predicting TBM operational parameters based on the surrounding environment is crucial for planning schedules and managing costs.This study investigates the effectiveness of tree-based machine learning models,including Random Forest,Extremely Randomized Trees,Adaptive Boosting Machine,Gradient Boosting Machine,Extreme Gradient Boosting Machine(XGBoost),Light Gradient Boosting Machine,and CatBoost,in predicting the Penetration Rate(PR)of TBMs by considering rock mass and material characteristics.These techniques are able to provide a good relationship between input(s)and output parameters;hence,obtaining a high level of accuracy.To do that,a comprehensive database comprising various rock mass and material parameters,including Rock Mass Rating,Brazilian Tensile Strength,and Weathering Zone,was utilized for model development.The practical application of these models was assessed with a new dataset representing diverse rock mass and material properties.To evaluate model performance,ranking systems and Taylor diagrams were employed.CatBoost emerged as the most accurate model during training and testing,with R2 scores of 0.927 and 0.861,respectively.However,during validation,XGBoost demonstrated superior performance with an R2 of 0.713.Despite these variations,all tree-based models showed promising accuracy in predicting TBM performance,providing valuable insights for similar projects in the future.展开更多
This paper proposes a tree-based backoff (TBB) protocol that reduces the number of iterations implemented in the procedure of tag collision arbitration in radio frequency identification (RFID) systems. This is ach...This paper proposes a tree-based backoff (TBB) protocol that reduces the number of iterations implemented in the procedure of tag collision arbitration in radio frequency identification (RFID) systems. This is achieved by employing the following mechanisms: one is send the request command iteratively to all tags in the interrogation zone until a single tag is identified. The other is backward to the parent node instead of root node to obtain the request parameters and send the request command again until all tags are identified. Compared with the traditional tree-based protocol, on average, simulated results show that the TBB protocol reduces the number of the iterations by 72.3% and the identification delay by 58.6% and achieves the goal of fast tag identification.展开更多
Hybrid Peer-to-Peer (P2P) systems that construct overlay networks structured among superpeers have great potential in that they can give the benefits such as scalability, search speed and network traffic, taking adv...Hybrid Peer-to-Peer (P2P) systems that construct overlay networks structured among superpeers have great potential in that they can give the benefits such as scalability, search speed and network traffic, taking advantages of superpeer-based and the structured P2P systems. In this article, we enhance keyword search in hybrid P2P systems by constructing a tree-based index overlay among directory nodes that maintain indices, according to the load and popularity of a keyword. The mathematical analysis shows that the keyword search based on semi-structured P2P overlay can improve the search performance, reducing the message traffic and maintenance costs.展开更多
In the mining industry,precise forecasting of rock fragmentation is critical for optimising blasting processes.In this study,we address the challenge of enhancing rock fragmentation assessment by developing a novel hy...In the mining industry,precise forecasting of rock fragmentation is critical for optimising blasting processes.In this study,we address the challenge of enhancing rock fragmentation assessment by developing a novel hybrid predictive model named GWO-RF.This model combines the grey wolf optimization(GWO)algorithm with the random forest(RF)technique to predict the D_(80)value,a critical parameter in evaluating rock fragmentation quality.The study is conducted using a dataset from Sarcheshmeh Copper Mine,employing six different swarm sizes for the GWO-RF hybrid model construction.The GWO-RF model’s hyperparameters are systematically optimized within established bounds,and its performance is rigorously evaluated using multiple evaluation metrics.The results show that the GWO-RF hybrid model has higher predictive skills,exceeding traditional models in terms of accuracy.Furthermore,the interpretability of the GWO-RF model is enhanced through the utilization of SHapley Additive exPlanations(SHAP)values.The insights gained from this research contribute to optimizing blasting operations and rock fragmentation outcomes in the mining industry.展开更多
Barshore is a small village in the Pishin District,Balochistan,Pakistan,with dry summers and cold rainy winters.This is an agrarian region,mostly with orchards of various fruit trees.This study investigated the physic...Barshore is a small village in the Pishin District,Balochistan,Pakistan,with dry summers and cold rainy winters.This is an agrarian region,mostly with orchards of various fruit trees.This study investigated the physico-chemical properties and macrofauna of soils under various agricultural management practices of this region.The concentrations of soil organic matter(SOM),soil organic carbon(SOC),nutrients,pH,electrical conductivity,soil texture,and the abundance and number of species of soil macrofauna of the agricultural fields were measured.Fifteen agricultural fields were sampled.Fourteen fields were orchards of apple,apricot or the mixture of apple and apricot trees and one field was a cropland,cultivated with wheat as a monocrop.The orchards were under conservation agricultural practices;whereas,the cropland was under conventional management.These agricultural lands were 2-26 years old.The concentration of soil organic matter(SOM)in the upper 0-10 cm depth of these field sites ranged from 11.6 g kg^(-1)to 32.8 g kg^(-1)soil.As compared to cropland,orchards had significantly higher concentration of SOM and SOC.A total of 18 soil macrofauna species were found and the most common and abundant were ants(Monomorium minimum,Camponotus pennsylvanicus,Solenopsis invicta,and Lasius niger)followed by Arion ssp.(Brown Slug)and earthworm Lumbricus terrestris.Regression analysis revealed non-significant relationship of the age and the concentration of SOM with the number of macrofauna species and with the concentrations of total mineral nitrogen,bioavailable phosphorus and clay.The existence of ants had no relationship with the concentration of SOM;whereas,existence of Lumbricus terrestris tended to had a positive relationship with the concentration of SOM.The field of tree-based intercropping system was 2 years of age since the land was converted from rangeland to a cropland,had two ant species coexisting.This indicates the positive influence of crop diversification on soil macrofauna.展开更多
In recent years there has been an increasing interest in developing spatial statistical models for data sets that are seemingly spatially independent.This lack of spatial structure makes it difficult,if not impossible...In recent years there has been an increasing interest in developing spatial statistical models for data sets that are seemingly spatially independent.This lack of spatial structure makes it difficult,if not impossible to use optimal predictors such as ordinary kriging for modeling the spatial variability in the data.In many instances,the data still contain a wealth of information that could be used to gain flexibility and precision in estimation.In this paper we propose using a combination of regression analysis to describe the large-scale spatial variability in a set of survey data and a tree-based stratification design to enhance the estimation process of the small-scale spatial variability.With this approach,sample units(i.e.,pixel of a satellite image) are classified with respect to predictions of error attributes into homogeneous classes,and the classes are then used as strata in the stratified analysis.Independent variables used as a basis of stratification included terrain data and satellite imagery.A decision rule was used to identify a tree size that minimized the error in estimating the variance of the mean response and prediction uncertainties at new spatial locations.This approach was applied to a set of n=937 forested plots from a state-wide inventory conducted in 2006 in the Mexican State of Jalisco.The final models accounted for 62% to 82% of the variability observed in canopy closure(%),basal area(m2·ha-1),cubic volumes(m3·ha-1) and biomass(t·ha-1) on the sample plots.The spatial models provided unbiased estimates and when averaged over all sample units in the population,estimates of forest structure were very close to those obtained using classical estimates based on the sampling strategy used in the state-wide inventory.The spatial models also provided unbiased estimates of model variances leading to confidence and prediction coverage rates close to the 0.95 nominal rate.展开更多
This study introduces a generic framework for geotechnical subsurface modeling, which accounts for spatial autocorrelation with local mapping machine learning(ML) methods. Instead of using XY coordinate fields directl...This study introduces a generic framework for geotechnical subsurface modeling, which accounts for spatial autocorrelation with local mapping machine learning(ML) methods. Instead of using XY coordinate fields directly as model input, a series of autocorrelated geotechnical distance fields(GDFs) is designed to enable the ML models to infer the spatial relationship between the sampled locations and unknown locations. The whole framework using GDF with ML methods is named GDF-ML. This framework is purely data-driven which avoids the tedious work in the scale of fluctuations(SOFs)estimating and data detrending in the conventional spatial interpolation methods. Six local mapping ML methods(extra trees(ETs), gradient boosting(GB), extreme gradient boosting(XGBoost), random forest(RF), general regression neural network(GRNN) and k-nearest neighbors(KNN)) are compared in the GDF-ML framework. The results show that the GDFs are better than the conventional XY coordinate fields based ML methods in both accuracy and spatial continuity. GDF-ML is flexible which can be applied to high-dimensional, multi-variable and incomplete datasets. Among these six methods, GDF with ET method(GDF-ET) clearly shows the best accuracy and best spatial continuity. The proposed GDF-ET method can provide a fast and accurate interpretation of the soil property profile. Sensitivity analysis shows that this method is applicable to very small training dataset size. The associated statistical uncertainty can also be quantified so that the reliability of the subsurface modeling results can be estimated objectively and explicitly. The uncertainty results clearly show that the prediction becomes more accurate when more sampled data are available.展开更多
This paper addresses an interesting security problem in wireless ad hoc networks: the dynamic group key agreement key establishment. For secure group communication in an ad hoc network, a group key shared by all group...This paper addresses an interesting security problem in wireless ad hoc networks: the dynamic group key agreement key establishment. For secure group communication in an ad hoc network, a group key shared by all group members is required. This group key should be updated when there are membership changes (when the new member joins or current member leaves) in the group. In this paper, we propose a novel, secure, scalable and efficient region-based group key agreement protocol for ad hoc networks. This is implemented by a two-level structure and a new scheme of group key update. The idea is to divide the group into subgroups, each maintaining its subgroup keys using group elliptic curve diffie-hellman (GECDH) Protocol and links with other subgroups in a tree structure using tree-based group elliptic curve diffie-hellman (TGECDH) protocol. By introducing region-based approach, messages and key updates will be limited within subgroup and outer group;hence computation load is distributed to many hosts. Both theoretical analysis and experimental results show that this Region-based key agreement protocol performs well for the key establishment problem in ad hoc network in terms of memory cost, computation cost and communication cost.展开更多
In wireless sensor networks, topology control plays an important role for data forwarding efficiency in the data gathering applications. In this paper, we present a novel topology control and data forwarding mechanism...In wireless sensor networks, topology control plays an important role for data forwarding efficiency in the data gathering applications. In this paper, we present a novel topology control and data forwarding mechanism called REMUDA, which is designed for a practical indoor parking lot management system. REMUDA forms a tree-based hierarchical network topology which brings as many nodes as possible to be leaf nodes and constructs a virtual cluster structure. Meanwhile, it takes the reliability, stability and path length into account in the tree construction process. Through an experiment in a network of 30 real sensor nodes, we evaluate the performance of REMUDA and compare it with LEPS which is also a practical routing protocol in TinyOS. Experiment results show that REMUDA can achieve better performance than LEPS.展开更多
Field experiments were conducted at the experimental farm Cocoa Re-search Institute of Nigeria (CRIN) Sub-Station, Ochaja, in the Southern Guinea Savannaagro ecological zone of Nigeria to examine uptake and use effici...Field experiments were conducted at the experimental farm Cocoa Re-search Institute of Nigeria (CRIN) Sub-Station, Ochaja, in the Southern Guinea Savannaagro ecological zone of Nigeria to examine uptake and use efficien-cies of nutrients by Sesame and Bambara nut alley crops as influenced by manuring in a Cashew-based intercropping system. Experimental treatments were based on responses of sole and intercrop mixtures of Sesame and Bam-bara nut alley crops to Cocoa Pod Husk (CPH), pelletized organic fertilizer and NPK fertilizer in a cashew-based intercropping system. Data were collected on the growth and yield variables of the alley crops. Highest nitrogen harvest in-dex (NHI) for seed and leaf of alley crops were obtained from un-manure treated plants. Cocoa pod husk (CPH) significantly enhanced P uptake com-pared with other fertilizers applied. CPH improved Na, Ca, Mg Zn, Cu, P, K and carbohydrate in the leaves and Ca, Mg, Zn, Fe, Cu, crude fibre and car-bohydrate contents of seeds of sole crops while Sesame + Bambara had en-hanced contents of N, Ca, Mg, Zn, Cu, P, N, K, moisture, protein, and crude fi-bre, crude protein, moisture content in leaves. The effects of NPK were signifi-cant for N, K Ca, Zn, Fe, Cu, P, moisture and crude fibre, while in the un-manure (control) plots influenced N, fat and protein and nitrogen harvest index (NHI) of leaf and seeds. CPH and NPK fertilizers enhanced nutrient up-take and nitrogen harvest index of alley crops. Nutrient uptake was similar for the varieties of Sesame and Bambara nut as affected by the application of 4.84 and 9.68 Kg pelletized organic fertilizer. Sole Bambara had higher N and K concentration in leaves compared with Bambara +Sesame. In addition, sole Bambara had higher values of Physiology efficiency (PE), and fertilizer use ef-ficiency (FAE) compared to the mixed crops of Bambara + sesame. However, physiology efficiency (PE), and fertilizer use efficiency (FAE) were significantly lower for Bambara + Sesame. The un-manure plants had enhanced N, P and K uptake. Varietal effects were pronounced for most of the resource use effi-ciency variables measured. The alley crop varieties responded differently to 4.84 and 9.68 kg pelletized fertilizer treatments (Agronomy Efficiency (AE), N-removed at harvest and Internal Utilization Efficiency (IE) and partial fac-tor productivity (PFP)). Sesame variety NCRIBen04E had enhanced AE, N-remove at harvest, IE and PFP while variety E8 had significantly higher ap-parent Recovery Efficiency (RE), apparent Recovery Efficiency by difference (RE%), Physiology Efficiency (PE), Utilization Efficiency (UE), and internal Utilization Efficient (IE). Bambara variety TVSu999 had higher IUE, Agron-omy Efficiency (AE), Apparent Recovery Efficiency (RE), Physiology Effi-ciency (PE) and Fertilizer Agronomy using Efficiency respectively (FAE) com-pared to variety TVSu1166. The fertilizers affected most of the indicators of nutrient use efficiency (NUE) measured. The effects were significant on AE, agronomic N-use efficiency (ANUE), RE, UE and PFP. NPK fertilizer enhanced Physiology efficiency (PE) and Partial factor production. NPK fertilizer signifi-cantly enhanced NUE parameters compared to CPH and un-manure. CPH manure significantly influenced RE%, PE and IE. The Internal Utilization Effi-ciency and N-remove at harvest were compared with the un-manure plants (control). The effects of 9.68 kg/plot pelletized fertilizer, were pronounced on Agronomy Efficiency (AE), Apparent Recovery Efficiency by difference (RE%), Physiology Efficiency (PE), Utilization Efficiency (UE), N-removed at harvest and Internal Utilization Efficiency (IE). Similar trends were observed in the responses NUE of Sesame and Bambara manuring. The responses sole crops in terms of RE, PE UE PFP were similar while their intercrop combina-tions had significantly higher AE, RE, UE, PFP and N removed at harvest. Sole Sesame significantly influence Agronomy Efficiency (AE), Utilization Effi-ciency (UE), Internal Efficiency (IE) and Partial Fertilizer Production (PFP) and sole Bambara under NPK fertilizer had enhanced N-removed at harvest and apparent recovery by difference (RE%). Bambara + Sesame under cocoa pod husk (CPH) manure had enhanced apparent recovery efficiency by difference (RE%), fertilizer use efficiency (FAE) and internal utilization efficiency (IE). Sesame variety NCRIBen04E had enhanced Agronomy Efficiency (AE), N-removed at harvest and Internal Utilization Efficiency (IE) under 9.68 kg treatment while variety E8 had higher partial factor productivity (PFP) at 4.84 kg/plot pelletized organic fertilizer.展开更多
In recent years, introduction of alternative energy sources such as solar energy is expected. Solar heat energy utilization systems are rapidly gaining acceptance as one of the best solutions to be an alternative ener...In recent years, introduction of alternative energy sources such as solar energy is expected. Solar heat energy utilization systems are rapidly gaining acceptance as one of the best solutions to be an alternative energy source. However, thermal energy collection is influenced by solar radiation and weather conditions. In order to control a solar heat energy utilization system as accurate as possible, it requires method of solar radiation estimation. This paper proposes the forecast technique of a thermal energy collection of solar heat energy utilization system based on solar radiation forecasting at one-day-ahead 24-hour thermal energy collection by using three different NN models. The proposed technique with application of NN is trained by weather data based on tree-based model, and tested according to forecast day. Since tree-based-model classifies a meteorological data exactly, NN will train a solar radiation with smoothly. The validity of the proposed technique is confirmed by computer simulations by use of actual meteorological data.展开更多
Utilizing both borehole and Cone Penetration Testing(CPT)data in soil stratification helps to get more convincing soil stratification results.However,the soil classification results revealed by borehole(Unified Soil C...Utilizing both borehole and Cone Penetration Testing(CPT)data in soil stratification helps to get more convincing soil stratification results.However,the soil classification results revealed by borehole(Unified Soil Classification System,USCS)and CPT tests(soil behavior type,SBT)are commonly not con-sistent.This study proposes a feasible solution to integrate the borehole and CPT data with the tree-based method.The tree-based method is naturally suitable for soil stratification tasks as it aims to divide the subsurface space into several clusters based on the similarities of the soil types.A novel boundary dic-tionary method is proposed to enhance the model performance on complex soil layer conditions.A prob-abilistic mapping matrix between the USCS-SBT system is built based on a collected municipal database with collocated borehole and CPT data.The optimal soil stratification results can be selected based on considering multiple borehole information and pruning the structure of trees.The structure of the trees can be optimized in a back analysis perspective with the Sequential Model-Based Global Optimization(SMBO)algorithm which aims to maximize the possibility of observing the borehole information based on the USCS-SBT probabilistic mapping matrix.The uncertainties of the optimal soil stratification results can be estimated based on a weighted Gini index method.The performance of the proposed method is validated based on a real case in New Zealand with a cross-validation method.The results indicate that the proposed method is robust and effective.展开更多
Surface chokes are widely utilized equipment installed on wellheads to control hydrocarbon flow rates.Several correlations have been suggested to model the multiphase flow of oil and gas via surface chokes.However,sub...Surface chokes are widely utilized equipment installed on wellheads to control hydrocarbon flow rates.Several correlations have been suggested to model the multiphase flow of oil and gas via surface chokes.However,substantial errors have been reported in empirical fitting models and correlations to estimate hydrocarbon flow because of the reservoir's heterogeneity,anisotropism,variance in reservoir fluid characteristics at diverse subsurface depths,which introduces complexity in production data.Therefore,the estimation of daily oil and gas production rates is still challenging for the petroleum industry.Recently,hybrid data-driven techniques have been reported to be effective for estimation problems in various aspects of the petroleum domain.This paper investigates hybrid ensemble data-driven approaches to forecast multiphase flow rates through the surface choke(viz.stacked generalization and voting architectures),followed by an assessment of the impact of input production control variables.Otherwise,machine learning models are also trained and tested individually on the production data of hydrocarbon wells located in North Sea.Feature engineering has been properly applied to select the most suitable contributing control variables for daily production rate forecasting.This study provides a chronological explanation of the data analytics required for the interpretation of production data.The test results reveal the estimation performance of the stacked generalization architecture has outperformed other significant paradigms considered for production forecasting.展开更多
基金the Program PenelitianKolaborasi Indonesia(PPKI)Non APBN Universitas Diponegoro Universitas Diponegoro Indonesia under Grant 117-03/UN7.6.1/PP/2021.
文摘Research on strain anomalies and large earthquakes based on temporal and spatial crustal activities has been rapidly growing due to data availability, especially in Japan and Indonesia. However, many research works used local-scale case studies that focused on a specific earthquake characteristic using knowledgedriven techniques, such as crustal deformation analysis. In this study, a data-driven-based analysis is used to detect anomalies using displacement rates and deformation pattern features extracted from daily global navigation satellite system(GNSS) data using a machine learning algorithm. The GNSS data with188 and 1181 continuously operating reference stations from Indonesia and Japan, respectively, are used to identify the anomaly of recent major earthquakes in the last two decades. Feature displacement rates and deformation patterns are processed in several window times with 2560 experiment scenarios to produce the best detection using tree-based algorithms. Tree-based algorithms with a single estimator(decision tree), ensemble bagging(bagging, random forest and Extra Trees), and ensemble boosting(AdaBoost, gradient boosting, LGBM, and XGB) are applied in the study. The experiment test using realtime scenario GNSSdailydatareveals high F1-scores and accuracy for anomaly detection using slope windowing 365 and 730 days of 91-day displacement rates and then 7-day deformation pattern features in tree-based algorithms. The results show the potential for medium-term anomaly detection using GNSS data without the need for multiple vulnerability assessments.
基金Acknowledgements This paper was supported by the Major National Science and Technology program under Grant No. 2011ZX03005-002 the National Natural Science Foundation of China under Grant No. 61100233 the Fundamental Universities under Grant No Research Funds for the Central K50510030010.
文摘Wireless Mesh Network (WMN) is seen as an effective Intemet access solution for dynamic wireless applications. For the low mobility of mesh routers in WMN, the backbone topography can be effectively maintained by proactive routing protocol. Pre-proposals like Tree Based Routing (TBR) protocol and Root Driven Routing (RDR) protocol are so centralized that they make the gateway becorre a bottleneck which severely restricts the network performance. We proposed an Optimized Tree-based Routing (OTR) protocol that logically separated the proactive tree into pieces. Route is partly computed by the branches instead of root. We also discussed the operation of multipie Intemet gateways which is a main issue in WMN. The new proposal lightens the load in root, reduces the overhead and improves the throughput. Numerical analysis and simulation results confirm that the perforrmnce of WMN is improved and OTR is more suitable for large scale WMN.
文摘The sensor virus is a serious threat,as an attacker can simply send a single packet to compromise the entire sensor network.Epidemics become drastic with link additions among sensors when the small world phenomena occur.Two immunization strategies,uniform immunization and temporary immunization,are conducted on small worlds of tree-based wireless sensor networks to combat the sensor viruses.With the former strategy,the infection extends exponentially,although the immunization effectively reduces the contagion speed.With the latter strategy,recurrent contagion oscillations occur in the small world when the spatial-temporal dynamics of the epidemic are considered.The oscillations come from the small-world structure and the temporary immunization.Mathematical analyses on the small world of the Cayley tree are presented to reveal the epidemic dynamics with the two immunization strategies.
基金funded by the state Mecklenburg-Western Pomerania by the Landesgraduierten-Studentshipfunded by the University of Greifswald by the Bogislaw-Studentshipfunded by the German Academic Scholarship Foundation by a studentship.
文摘Recently,so-called tree-based phylogenetic networks have attracted considerable attention.These networks can be constructed from a phylogenetic tree,called the base tree,by adding additional edges.The primary aim of this study is to provide sufficient criteria for tree-basedness by reducing phylogenetic networks to related graph structures.Even though it is generally known that determining whether a network is tree-based is an NP-complete problem,one of these criteria,namely edge-basedness,can be verified in linear time.Surprisingly,the class of edgebased networks is closely related to a well-known family of graphs,namely,the class of generalized series-parallel graphs,and we explore this relationship in full detail.Additionally,we introduce further classes of tree-based networks and analyze their relationships.
文摘This article proposes the high-speed and high-accuracy code clone detection method based on the combination of tree-based and token-based methods. Existence of duplicated program codes, called code clone, is one of the main factors that reduces the quality and maintainability of software. If one code fragment contains faults (bugs) and they are copied and modified to other locations, it is necessary to correct all of them. But it is not easy to find all code clones in large and complex software. Much research efforts have been done for code clone detection. There are mainly two methods for code clone detection. One is token-based and the other is tree-based method. Token-based method is fast and requires less resources. However it cannot detect all kinds of code clones. Tree-based method can detect all kinds of code clones, but it is slow and requires much computing resources. In this paper combination of these two methods was proposed to improve the efficiency and accuracy of detecting code clones. Firstly some candidates of code clones will be extracted by token-based method that is fast and lightweight. Then selected candidates will be checked more precisely by using tree-based method that can find all kinds of code clones. The prototype system was developed. This system accepts source code and tokenizes it in the first step. Then token-based method is applied to this token sequence to find candidates of code clones. After extracting several candidates, selected source codes will be converted into abstract syntax tree (AST) for applying tree-based method. Some sample source codes were used to evaluate the proposed method. This evaluation proved the improvement of efficiency and precision of code clones detecting.
基金Knowledge Innovation Project and Intelligent Infor mation Service and Support Project of the Shanghai Education Commission, China
文摘Towards the crossing and coupling permissions in tasks existed widely in many fields and considering the design of role view must rely on the activities of the tasks process,based on Role Based Accessing Control (RBAC) model,this paper put forward a Role Tree-Based Access Control (RTBAC) model. In addition,the model definition and its constraint formal description is also discussed in this paper. RTBAC model is able to realize the dynamic organizing,self-determination and convenience of the design of role view,and guarantee the least role permission when task separating in the mean time.
文摘With the increasing interest in e-commerce shopping, customer reviews have become one of the most important elements that determine customer satisfaction regarding products. This demonstrates the importance of working with Text Mining. This study is based on The Women’s Clothing E-Commerce Reviews database, which consists of reviews written by real customers. The aim of this paper is to conduct a Text Mining approach on a set of customer reviews. Each review was classified as either a positive or negative review by employing a classification method. Four tree-based methods were applied to solve the classification problem, namely Classification Tree, Random Forest, Gradient Boosting and XGBoost. The dataset was categorized into training and test sets. The results indicate that the Random Forest method displays an overfitting, XGBoost displays an overfitting if the number of trees is too high, Classification Tree is good at detecting negative reviews and bad at detecting positive reviews and the Gradient Boosting shows stable values and quality measures above 77% for the test dataset. A consensus between the applied methods is noted for important classification terms.
文摘Tunnel Boring Machines(TBMs)are vital for tunnel and underground construction due to their high safety and efficiency.Accurately predicting TBM operational parameters based on the surrounding environment is crucial for planning schedules and managing costs.This study investigates the effectiveness of tree-based machine learning models,including Random Forest,Extremely Randomized Trees,Adaptive Boosting Machine,Gradient Boosting Machine,Extreme Gradient Boosting Machine(XGBoost),Light Gradient Boosting Machine,and CatBoost,in predicting the Penetration Rate(PR)of TBMs by considering rock mass and material characteristics.These techniques are able to provide a good relationship between input(s)and output parameters;hence,obtaining a high level of accuracy.To do that,a comprehensive database comprising various rock mass and material parameters,including Rock Mass Rating,Brazilian Tensile Strength,and Weathering Zone,was utilized for model development.The practical application of these models was assessed with a new dataset representing diverse rock mass and material properties.To evaluate model performance,ranking systems and Taylor diagrams were employed.CatBoost emerged as the most accurate model during training and testing,with R2 scores of 0.927 and 0.861,respectively.However,during validation,XGBoost demonstrated superior performance with an R2 of 0.713.Despite these variations,all tree-based models showed promising accuracy in predicting TBM performance,providing valuable insights for similar projects in the future.
基金supported by the National Natural Science Foundation of China(61261023)the Guangxi Natural Science Foundation(2012GXNSFBA053160,2011GXNSFA018169,2011GXNSFD018024)+1 种基金the Guangxi Education Department Science Foundation(201010LX016)the Guangxi Science Research and Technology Development Program(12118017-9A)
文摘This paper proposes a tree-based backoff (TBB) protocol that reduces the number of iterations implemented in the procedure of tag collision arbitration in radio frequency identification (RFID) systems. This is achieved by employing the following mechanisms: one is send the request command iteratively to all tags in the interrogation zone until a single tag is identified. The other is backward to the parent node instead of root node to obtain the request parameters and send the request command again until all tags are identified. Compared with the traditional tree-based protocol, on average, simulated results show that the TBB protocol reduces the number of the iterations by 72.3% and the identification delay by 58.6% and achieves the goal of fast tag identification.
基金supported by the Korea Research Foundation Grant funded by the Korean Government(MOEHRD) under Grant No.KRF-2007-314-D00223.
文摘Hybrid Peer-to-Peer (P2P) systems that construct overlay networks structured among superpeers have great potential in that they can give the benefits such as scalability, search speed and network traffic, taking advantages of superpeer-based and the structured P2P systems. In this article, we enhance keyword search in hybrid P2P systems by constructing a tree-based index overlay among directory nodes that maintain indices, according to the load and popularity of a keyword. The mathematical analysis shows that the keyword search based on semi-structured P2P overlay can improve the search performance, reducing the message traffic and maintenance costs.
基金Projects(42177164,52474121)supported by the National Science Foundation of ChinaProject(PBSKL2023A12)supported by the State Key Laboratory of Precision Blasting and Hubei Key Laboratory of Blasting Engineering,China。
文摘In the mining industry,precise forecasting of rock fragmentation is critical for optimising blasting processes.In this study,we address the challenge of enhancing rock fragmentation assessment by developing a novel hybrid predictive model named GWO-RF.This model combines the grey wolf optimization(GWO)algorithm with the random forest(RF)technique to predict the D_(80)value,a critical parameter in evaluating rock fragmentation quality.The study is conducted using a dataset from Sarcheshmeh Copper Mine,employing six different swarm sizes for the GWO-RF hybrid model construction.The GWO-RF model’s hyperparameters are systematically optimized within established bounds,and its performance is rigorously evaluated using multiple evaluation metrics.The results show that the GWO-RF hybrid model has higher predictive skills,exceeding traditional models in terms of accuracy.Furthermore,the interpretability of the GWO-RF model is enhanced through the utilization of SHapley Additive exPlanations(SHAP)values.The insights gained from this research contribute to optimizing blasting operations and rock fragmentation outcomes in the mining industry.
基金We are grateful to the Department of Soil Science,Agricultural,Research Institute,Quetta,Pakistan for provision of laboratory facilities for chemical analysis of soil samples.We extend our gratitude to the Department of Zoology,University of Balochistan,Pakistan for the identification of soil fauna。
文摘Barshore is a small village in the Pishin District,Balochistan,Pakistan,with dry summers and cold rainy winters.This is an agrarian region,mostly with orchards of various fruit trees.This study investigated the physico-chemical properties and macrofauna of soils under various agricultural management practices of this region.The concentrations of soil organic matter(SOM),soil organic carbon(SOC),nutrients,pH,electrical conductivity,soil texture,and the abundance and number of species of soil macrofauna of the agricultural fields were measured.Fifteen agricultural fields were sampled.Fourteen fields were orchards of apple,apricot or the mixture of apple and apricot trees and one field was a cropland,cultivated with wheat as a monocrop.The orchards were under conservation agricultural practices;whereas,the cropland was under conventional management.These agricultural lands were 2-26 years old.The concentration of soil organic matter(SOM)in the upper 0-10 cm depth of these field sites ranged from 11.6 g kg^(-1)to 32.8 g kg^(-1)soil.As compared to cropland,orchards had significantly higher concentration of SOM and SOC.A total of 18 soil macrofauna species were found and the most common and abundant were ants(Monomorium minimum,Camponotus pennsylvanicus,Solenopsis invicta,and Lasius niger)followed by Arion ssp.(Brown Slug)and earthworm Lumbricus terrestris.Regression analysis revealed non-significant relationship of the age and the concentration of SOM with the number of macrofauna species and with the concentrations of total mineral nitrogen,bioavailable phosphorus and clay.The existence of ants had no relationship with the concentration of SOM;whereas,existence of Lumbricus terrestris tended to had a positive relationship with the concentration of SOM.The field of tree-based intercropping system was 2 years of age since the land was converted from rangeland to a cropland,had two ant species coexisting.This indicates the positive influence of crop diversification on soil macrofauna.
文摘In recent years there has been an increasing interest in developing spatial statistical models for data sets that are seemingly spatially independent.This lack of spatial structure makes it difficult,if not impossible to use optimal predictors such as ordinary kriging for modeling the spatial variability in the data.In many instances,the data still contain a wealth of information that could be used to gain flexibility and precision in estimation.In this paper we propose using a combination of regression analysis to describe the large-scale spatial variability in a set of survey data and a tree-based stratification design to enhance the estimation process of the small-scale spatial variability.With this approach,sample units(i.e.,pixel of a satellite image) are classified with respect to predictions of error attributes into homogeneous classes,and the classes are then used as strata in the stratified analysis.Independent variables used as a basis of stratification included terrain data and satellite imagery.A decision rule was used to identify a tree size that minimized the error in estimating the variance of the mean response and prediction uncertainties at new spatial locations.This approach was applied to a set of n=937 forested plots from a state-wide inventory conducted in 2006 in the Mexican State of Jalisco.The final models accounted for 62% to 82% of the variability observed in canopy closure(%),basal area(m2·ha-1),cubic volumes(m3·ha-1) and biomass(t·ha-1) on the sample plots.The spatial models provided unbiased estimates and when averaged over all sample units in the population,estimates of forest structure were very close to those obtained using classical estimates based on the sampling strategy used in the state-wide inventory.The spatial models also provided unbiased estimates of model variances leading to confidence and prediction coverage rates close to the 0.95 nominal rate.
基金funded by the Australian Government through the Australian Research Council’s Discovery Projects funding scheme (Project DP190101592)the National Natural Science Foundation of China (Grant Nos. 41972280 and 52179103)。
文摘This study introduces a generic framework for geotechnical subsurface modeling, which accounts for spatial autocorrelation with local mapping machine learning(ML) methods. Instead of using XY coordinate fields directly as model input, a series of autocorrelated geotechnical distance fields(GDFs) is designed to enable the ML models to infer the spatial relationship between the sampled locations and unknown locations. The whole framework using GDF with ML methods is named GDF-ML. This framework is purely data-driven which avoids the tedious work in the scale of fluctuations(SOFs)estimating and data detrending in the conventional spatial interpolation methods. Six local mapping ML methods(extra trees(ETs), gradient boosting(GB), extreme gradient boosting(XGBoost), random forest(RF), general regression neural network(GRNN) and k-nearest neighbors(KNN)) are compared in the GDF-ML framework. The results show that the GDFs are better than the conventional XY coordinate fields based ML methods in both accuracy and spatial continuity. GDF-ML is flexible which can be applied to high-dimensional, multi-variable and incomplete datasets. Among these six methods, GDF with ET method(GDF-ET) clearly shows the best accuracy and best spatial continuity. The proposed GDF-ET method can provide a fast and accurate interpretation of the soil property profile. Sensitivity analysis shows that this method is applicable to very small training dataset size. The associated statistical uncertainty can also be quantified so that the reliability of the subsurface modeling results can be estimated objectively and explicitly. The uncertainty results clearly show that the prediction becomes more accurate when more sampled data are available.
文摘This paper addresses an interesting security problem in wireless ad hoc networks: the dynamic group key agreement key establishment. For secure group communication in an ad hoc network, a group key shared by all group members is required. This group key should be updated when there are membership changes (when the new member joins or current member leaves) in the group. In this paper, we propose a novel, secure, scalable and efficient region-based group key agreement protocol for ad hoc networks. This is implemented by a two-level structure and a new scheme of group key update. The idea is to divide the group into subgroups, each maintaining its subgroup keys using group elliptic curve diffie-hellman (GECDH) Protocol and links with other subgroups in a tree structure using tree-based group elliptic curve diffie-hellman (TGECDH) protocol. By introducing region-based approach, messages and key updates will be limited within subgroup and outer group;hence computation load is distributed to many hosts. Both theoretical analysis and experimental results show that this Region-based key agreement protocol performs well for the key establishment problem in ad hoc network in terms of memory cost, computation cost and communication cost.
基金Supported by National Natural Science Foundation of P. R. China (60673178) National Basic Research Program of P.R. China (2006 CB 303000)
文摘In wireless sensor networks, topology control plays an important role for data forwarding efficiency in the data gathering applications. In this paper, we present a novel topology control and data forwarding mechanism called REMUDA, which is designed for a practical indoor parking lot management system. REMUDA forms a tree-based hierarchical network topology which brings as many nodes as possible to be leaf nodes and constructs a virtual cluster structure. Meanwhile, it takes the reliability, stability and path length into account in the tree construction process. Through an experiment in a network of 30 real sensor nodes, we evaluate the performance of REMUDA and compare it with LEPS which is also a practical routing protocol in TinyOS. Experiment results show that REMUDA can achieve better performance than LEPS.
文摘Field experiments were conducted at the experimental farm Cocoa Re-search Institute of Nigeria (CRIN) Sub-Station, Ochaja, in the Southern Guinea Savannaagro ecological zone of Nigeria to examine uptake and use efficien-cies of nutrients by Sesame and Bambara nut alley crops as influenced by manuring in a Cashew-based intercropping system. Experimental treatments were based on responses of sole and intercrop mixtures of Sesame and Bam-bara nut alley crops to Cocoa Pod Husk (CPH), pelletized organic fertilizer and NPK fertilizer in a cashew-based intercropping system. Data were collected on the growth and yield variables of the alley crops. Highest nitrogen harvest in-dex (NHI) for seed and leaf of alley crops were obtained from un-manure treated plants. Cocoa pod husk (CPH) significantly enhanced P uptake com-pared with other fertilizers applied. CPH improved Na, Ca, Mg Zn, Cu, P, K and carbohydrate in the leaves and Ca, Mg, Zn, Fe, Cu, crude fibre and car-bohydrate contents of seeds of sole crops while Sesame + Bambara had en-hanced contents of N, Ca, Mg, Zn, Cu, P, N, K, moisture, protein, and crude fi-bre, crude protein, moisture content in leaves. The effects of NPK were signifi-cant for N, K Ca, Zn, Fe, Cu, P, moisture and crude fibre, while in the un-manure (control) plots influenced N, fat and protein and nitrogen harvest index (NHI) of leaf and seeds. CPH and NPK fertilizers enhanced nutrient up-take and nitrogen harvest index of alley crops. Nutrient uptake was similar for the varieties of Sesame and Bambara nut as affected by the application of 4.84 and 9.68 Kg pelletized organic fertilizer. Sole Bambara had higher N and K concentration in leaves compared with Bambara +Sesame. In addition, sole Bambara had higher values of Physiology efficiency (PE), and fertilizer use ef-ficiency (FAE) compared to the mixed crops of Bambara + sesame. However, physiology efficiency (PE), and fertilizer use efficiency (FAE) were significantly lower for Bambara + Sesame. The un-manure plants had enhanced N, P and K uptake. Varietal effects were pronounced for most of the resource use effi-ciency variables measured. The alley crop varieties responded differently to 4.84 and 9.68 kg pelletized fertilizer treatments (Agronomy Efficiency (AE), N-removed at harvest and Internal Utilization Efficiency (IE) and partial fac-tor productivity (PFP)). Sesame variety NCRIBen04E had enhanced AE, N-remove at harvest, IE and PFP while variety E8 had significantly higher ap-parent Recovery Efficiency (RE), apparent Recovery Efficiency by difference (RE%), Physiology Efficiency (PE), Utilization Efficiency (UE), and internal Utilization Efficient (IE). Bambara variety TVSu999 had higher IUE, Agron-omy Efficiency (AE), Apparent Recovery Efficiency (RE), Physiology Effi-ciency (PE) and Fertilizer Agronomy using Efficiency respectively (FAE) com-pared to variety TVSu1166. The fertilizers affected most of the indicators of nutrient use efficiency (NUE) measured. The effects were significant on AE, agronomic N-use efficiency (ANUE), RE, UE and PFP. NPK fertilizer enhanced Physiology efficiency (PE) and Partial factor production. NPK fertilizer signifi-cantly enhanced NUE parameters compared to CPH and un-manure. CPH manure significantly influenced RE%, PE and IE. The Internal Utilization Effi-ciency and N-remove at harvest were compared with the un-manure plants (control). The effects of 9.68 kg/plot pelletized fertilizer, were pronounced on Agronomy Efficiency (AE), Apparent Recovery Efficiency by difference (RE%), Physiology Efficiency (PE), Utilization Efficiency (UE), N-removed at harvest and Internal Utilization Efficiency (IE). Similar trends were observed in the responses NUE of Sesame and Bambara manuring. The responses sole crops in terms of RE, PE UE PFP were similar while their intercrop combina-tions had significantly higher AE, RE, UE, PFP and N removed at harvest. Sole Sesame significantly influence Agronomy Efficiency (AE), Utilization Effi-ciency (UE), Internal Efficiency (IE) and Partial Fertilizer Production (PFP) and sole Bambara under NPK fertilizer had enhanced N-removed at harvest and apparent recovery by difference (RE%). Bambara + Sesame under cocoa pod husk (CPH) manure had enhanced apparent recovery efficiency by difference (RE%), fertilizer use efficiency (FAE) and internal utilization efficiency (IE). Sesame variety NCRIBen04E had enhanced Agronomy Efficiency (AE), N-removed at harvest and Internal Utilization Efficiency (IE) under 9.68 kg treatment while variety E8 had higher partial factor productivity (PFP) at 4.84 kg/plot pelletized organic fertilizer.
文摘In recent years, introduction of alternative energy sources such as solar energy is expected. Solar heat energy utilization systems are rapidly gaining acceptance as one of the best solutions to be an alternative energy source. However, thermal energy collection is influenced by solar radiation and weather conditions. In order to control a solar heat energy utilization system as accurate as possible, it requires method of solar radiation estimation. This paper proposes the forecast technique of a thermal energy collection of solar heat energy utilization system based on solar radiation forecasting at one-day-ahead 24-hour thermal energy collection by using three different NN models. The proposed technique with application of NN is trained by weather data based on tree-based model, and tested according to forecast day. Since tree-based-model classifies a meteorological data exactly, NN will train a solar radiation with smoothly. The validity of the proposed technique is confirmed by computer simulations by use of actual meteorological data.
基金funded by the Australian Government through the Australian Research Council’s Discovery Projects funding scheme (Project DP190101592)the National Natural Science Foundation of China (No.41972280,52179103).
文摘Utilizing both borehole and Cone Penetration Testing(CPT)data in soil stratification helps to get more convincing soil stratification results.However,the soil classification results revealed by borehole(Unified Soil Classification System,USCS)and CPT tests(soil behavior type,SBT)are commonly not con-sistent.This study proposes a feasible solution to integrate the borehole and CPT data with the tree-based method.The tree-based method is naturally suitable for soil stratification tasks as it aims to divide the subsurface space into several clusters based on the similarities of the soil types.A novel boundary dic-tionary method is proposed to enhance the model performance on complex soil layer conditions.A prob-abilistic mapping matrix between the USCS-SBT system is built based on a collected municipal database with collocated borehole and CPT data.The optimal soil stratification results can be selected based on considering multiple borehole information and pruning the structure of trees.The structure of the trees can be optimized in a back analysis perspective with the Sequential Model-Based Global Optimization(SMBO)algorithm which aims to maximize the possibility of observing the borehole information based on the USCS-SBT probabilistic mapping matrix.The uncertainties of the optimal soil stratification results can be estimated based on a weighted Gini index method.The performance of the proposed method is validated based on a real case in New Zealand with a cross-validation method.The results indicate that the proposed method is robust and effective.
文摘Surface chokes are widely utilized equipment installed on wellheads to control hydrocarbon flow rates.Several correlations have been suggested to model the multiphase flow of oil and gas via surface chokes.However,substantial errors have been reported in empirical fitting models and correlations to estimate hydrocarbon flow because of the reservoir's heterogeneity,anisotropism,variance in reservoir fluid characteristics at diverse subsurface depths,which introduces complexity in production data.Therefore,the estimation of daily oil and gas production rates is still challenging for the petroleum industry.Recently,hybrid data-driven techniques have been reported to be effective for estimation problems in various aspects of the petroleum domain.This paper investigates hybrid ensemble data-driven approaches to forecast multiphase flow rates through the surface choke(viz.stacked generalization and voting architectures),followed by an assessment of the impact of input production control variables.Otherwise,machine learning models are also trained and tested individually on the production data of hydrocarbon wells located in North Sea.Feature engineering has been properly applied to select the most suitable contributing control variables for daily production rate forecasting.This study provides a chronological explanation of the data analytics required for the interpretation of production data.The test results reveal the estimation performance of the stacked generalization architecture has outperformed other significant paradigms considered for production forecasting.