Accurate capacity estimation is of great importance for the reliable state monitoring,timely maintenance,and second-life utilization of lithium-ion batteries.Despite numerous works on battery capacity estimation using...Accurate capacity estimation is of great importance for the reliable state monitoring,timely maintenance,and second-life utilization of lithium-ion batteries.Despite numerous works on battery capacity estimation using laboratory datasets,most of them are applied to battery cells and lack satisfactory fidelity when extended to real-world electric vehicle(EV)battery packs.The challenges intensify for large-sized EV battery packs,where unpredictable operating profiles and low-quality data acquisition hinder precise capacity estimation.To fill the gap,this study introduces a novel data-driven battery pack capacity estimation method grounded in field data.The proposed approach begins by determining labeled capacity through an innovative combination of the inverse ampere-hour integral,open circuit voltage-based,and resistance-based correction methods.Then,multiple health features are extracted from incremental capacity curves,voltage curves,equivalent circuit model parameters,and operating temperature to thoroughly characterize battery aging behavior.A feature selection procedure is performed to determine the optimal feature set based on the Pearson correlation coefficient.Moreover,a convolutional neural network and bidirectional gated recurrent unit,enhanced by an attention mechanism,are employed to estimate the battery pack capacity in real-world EV applications.Finally,the proposed method is validated with a field dataset from two EVs,covering approximately 35,000 kilometers.The results demonstrate that the proposed method exhibits better estimation performance with an error of less than 1.1%compared to existing methods.This work shows great potential for accurate large-sized EV battery pack capacity estimation based on field data,which provides significant insights into reliable labeled capacity calculation,effective features extraction,and machine learning-enabled health diagnosis.展开更多
The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based o...The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based on complete data. This paper studies the optimal estimation of high-dimensional covariance matrices based on missing and noisy sample under the norm. First, the model with sub-Gaussian additive noise is presented. The generalized sample covariance is then modified to define a hard thresholding estimator , and the minimax upper bound is derived. After that, the minimax lower bound is derived, and it is concluded that the estimator presented in this article is rate-optimal. Finally, numerical simulation analysis is performed. The result shows that for missing samples with sub-Gaussian noise, if the true covariance matrix is sparse, the hard thresholding estimator outperforms the traditional estimate method.展开更多
This paper investigates the security issue of multisensor remote estimation systems.An optimal stealthy false data injection(FDI)attack scheme based on historical and current residuals,which only tampers with the meas...This paper investigates the security issue of multisensor remote estimation systems.An optimal stealthy false data injection(FDI)attack scheme based on historical and current residuals,which only tampers with the measurement residuals of partial sensors due to limited attack resources,is proposed to maximally degrade system estimation performance.The attack stealthiness condition is given,and then the estimation error covariance in compromised state is derived to quantify the system performance under attack.The optimal attack strategy is obtained by solving several convex optimization problems which maximize the trace of the compromised estimation error covariance subject to the stealthiness condition.Moreover,due to the constraint of attack resources,the selection principle of the attacked sensor is provided to determine which sensor is attacked so as to hold the most impact on system performance.Finally,simulation results are presented to verify the theoretical analysis.展开更多
This paper aims to explore the application of Extreme Value Theory (EVT) in estimating the conditional extreme quantile for time-to-event outcomes by examining the functional relationship between ambulatory blood pres...This paper aims to explore the application of Extreme Value Theory (EVT) in estimating the conditional extreme quantile for time-to-event outcomes by examining the functional relationship between ambulatory blood pressure trajectories and clinical outcomes in stroke patients. The study utilizes EVT to analyze the functional connection between ambulatory blood pressure trajectories and clinical outcomes in a sample of 297 stroke patients. The 24-hour ambulatory blood pressure measurement curves for every 15 minutes are considered, acknowledging a censored rate of 40%. The findings reveal that the sample mean excess function exhibits a positive gradient above a specific threshold, confirming the heavy-tailed distribution of data in stroke patients with a positive extreme value index. Consequently, the estimated conditional extreme quantile indicates that stroke patients with higher blood pressure measurements face an elevated risk of recurrent stroke occurrence at an early stage. This research contributes to the understanding of the relationship between ambulatory blood pressure and recurrent stroke, providing valuable insights for clinical considerations and potential interventions in stroke management.展开更多
For many environmental and agricultural applications, an accurate estimation of surface soil moisture is essential. This study sought to determine whether combining Sentinel-1A, Sentinel-2A, and meteorological data wi...For many environmental and agricultural applications, an accurate estimation of surface soil moisture is essential. This study sought to determine whether combining Sentinel-1A, Sentinel-2A, and meteorological data with artificial neural networks (ANN) could improve soil moisture estimation in various land cover types. To train and evaluate the model’s performance, we used field data (provided by La Tuscia University) on the study area collected during time periods between October 2022, and December 2022. Surface soil moisture was measured at 29 locations. The performance of the model was trained, validated, and tested using input features in a 60:10:30 ratio, using the feed-forward ANN model. It was found that the ANN model exhibited high precision in predicting soil moisture. The model achieved a coefficient of determination (R<sup>2</sup>) of 0.71 and correlation coefficient (R) of 0.84. Furthermore, the incorporation of Random Forest (RF) algorithms for soil moisture prediction resulted in an improved R<sup>2</sup> of 0.89. The unique combination of active microwave, meteorological data and multispectral data provides an opportunity to exploit the complementary nature of the datasets. Through preprocessing, fusion, and ANN modeling, this research contributes to advancing soil moisture estimation techniques and providing valuable insights for water resource management and agricultural planning in the study area.展开更多
Purpose–The study aims to provide a basis for the effective use of safety-related information data and a quantitative assessment way for the occurrence probability of the safety risk such as the fatigue fracture of t...Purpose–The study aims to provide a basis for the effective use of safety-related information data and a quantitative assessment way for the occurrence probability of the safety risk such as the fatigue fracture of the key components.Design/methodology/approach–The fatigue crack growth rate is of dispersion,which is often used to accurately describe with probability density.In view of the external dispersion caused by the load,a simple and applicable probability expression of fatigue crack growth rate is adopted based on the fatigue growth theory.Considering the isolation among the pairs of crack length a and crack formation time t(a∼t data)obtained from same kind of structural parts,a statistical analysis approach of t distribution is proposed,which divides the crack length in several segments.Furthermore,according to the compatibility criterion of crack growth,that is,there is statistical development correspondence among a∼t data,the probability model of crack growth rate is established.Findings–The results show that the crack growth rate in the stable growth stage can be approximately expressed by the crack growth control curve da/dt=5 Q•a,and the probability density of the crack growth parameter Q represents the external dispersion;t follows two-parameter Weibull distribution in certain a values.Originality/value–The probability density f(Q)can be estimated by using the probability model of crack growth rate,and a calculation example shows that the estimation method is effective and practical.展开更多
The application of frequency distribution statistics to data provides objective means to assess the nature of the data distribution and viability of numerical models that are used to visualize and interpret data.Two c...The application of frequency distribution statistics to data provides objective means to assess the nature of the data distribution and viability of numerical models that are used to visualize and interpret data.Two commonly used tools are the kernel density estimation and reduced chi-squared statistic used in combination with a weighted mean.Due to the wide applicability of these tools,we present a Java-based computer application called KDX to facilitate the visualization of data and the utilization of these numerical tools.展开更多
An adaptive estimation of forecast error covariance matrices is proposed for Kalman filtering data assim- ilation. A forecast error covariance matrix is initially estimated using an ensemble of perturbation forecasts....An adaptive estimation of forecast error covariance matrices is proposed for Kalman filtering data assim- ilation. A forecast error covariance matrix is initially estimated using an ensemble of perturbation forecasts. This initially estimated matrix is then adjusted with scale parameters that are adaptively estimated by minimizing -2log-likelihood of observed-minus-forecast residuals. The proposed approach could be applied to Kalman filtering data assimilation with imperfect models when the model error statistics are not known. A simple nonlinear model (Burgers' equation model) is used to demonstrate the efficacy of the proposed approach.展开更多
In order to estimate vehicular queue length at signalized intersections accurately and overcome the shortcomings and restrictions of existing studies especially those based on shockwave theory,a new methodology is pre...In order to estimate vehicular queue length at signalized intersections accurately and overcome the shortcomings and restrictions of existing studies especially those based on shockwave theory,a new methodology is presented for estimating vehicular queue length using data from both point detectors and probe vehicles. The methodology applies the shockwave theory to model queue evolution over time and space. Using probe vehicle locations and times as well as point detector measured traffic states,analytical formulations for calculating the maximum and minimum( residual) queue length are developed. The proposed methodology is verified using ground truth data collected from numerical experiments conducted in Shanghai,China. It is found that the methodology has a mean absolute percentage error of 17. 09%,which is reasonably effective in estimating the queue length at traffic signalized intersections. Limitations of the proposed models and algorithms are also discussed in the paper.展开更多
Phase errors in synthetic aperture sonar (SAS) imaging must be reduced to less than one eighth of a wavelength so as to avoid image destruction. Most of the phase errors occur as a result of platform motion errors, fo...Phase errors in synthetic aperture sonar (SAS) imaging must be reduced to less than one eighth of a wavelength so as to avoid image destruction. Most of the phase errors occur as a result of platform motion errors, for example, sway yaw and surge that are the most important error sources. The phase error of a wide band synthetic aperture sonar is modeled and solutions to sway yaw and surge motion estimation based on the raw sonar echo data with a Displaced Phase Center Antenna (DPCA) method are proposed and their implementations are detailed in this paper. It is shown that the sway estimates can be obtained from the correlation lag and phase difference between the returns at coincident phase centers. An estimate of yaw is also possible if such a technique is applied to more than one overlapping phase center positions. Surge estimates can be obtained by identifying pairs of phase centers with a maximum correlation coefficient. The method works only if the platform velocity is low enough such that a number of phase centers from adjacent pings overlap.展开更多
A direction-of-arrival (DOA) estimation algorithm based on direct data domain (D3) approach is presented. This method can accuracy estimate DOA using one snapshot modified data, called the temporal and spatial two...A direction-of-arrival (DOA) estimation algorithm based on direct data domain (D3) approach is presented. This method can accuracy estimate DOA using one snapshot modified data, called the temporal and spatial two-dimensional vector reconstruction (TSR) method. The key idea is to apply the D3 approach which can extract the signal of given frequency but null out other frequency signals in temporal domain. Then the spatial vector reconstruction processing is used to estimate the angle of the spatial coherent signal source based on extract signal data. Compared with the common temporal and spatial processing approach, the TSR method has a lower computational load, higher real-time performance, robustness and angular accuracy of DOA. The proposed algorithm can be directly applied to the phased array radar of coherent pulses. Simulation results demonstrate the performance of the proposed technique.展开更多
Sea surface salinity(SSS)is an essential variable of ocean dynamics and climate research.The Soil Moisture and Ocean Salinity(SMOS),Aquarius,and Soil Moisture Active Passive(SMAP)satellite missions all provide SSS mea...Sea surface salinity(SSS)is an essential variable of ocean dynamics and climate research.The Soil Moisture and Ocean Salinity(SMOS),Aquarius,and Soil Moisture Active Passive(SMAP)satellite missions all provide SSS measurements.The European Space Agency(ESA)Climate Change Initiative Sea Surface Salinity(CCI-SSS)project merged these three satellite SSS data to produce CCI L4SSS products.We validated the accuracy of the four satellite products(CCI,SMOS,Aquarius,and SMAP)using in-situ gridded data and Argo floats in the South China Sea(SCS).Compared with in-situ gridded data,it shows that the CCI achieved the best performance(RMSD:0.365)on monthly time scales.The RMSD of SMOS,Aquarius,and SMAP(SMOS:0.389;Aquarius:0.409;SMAP:0.391)are close,and the SMOS takes a slight advantage in contrast with Aquarius and SMAP.Large discrepancies can be found near the coastline and in the shelf seas.Meanwhile,CCI with lower RMSD(0.295)perform better than single satellite data(SMOS:0.517;SMAP:0.297)on weekly time scales compared with Argo floats.Overall,the merged CCI have the smallest RMSD among the four satellite products in the SCS on both weekly time scales and monthly time scales,which illustrates the improved accuracy of merged CCI compared with the individual satellite data.展开更多
In this paper,a systematic description of the artificial intelligence(AI)-based channel estimation track of the 2nd Wireless Communication AI Competition(WAIC)is provided,which is hosted by IMT-2020(5G)Promotion Group...In this paper,a systematic description of the artificial intelligence(AI)-based channel estimation track of the 2nd Wireless Communication AI Competition(WAIC)is provided,which is hosted by IMT-2020(5G)Promotion Group 5G+AIWork Group.Firstly,the system model of demodulation reference signal(DMRS)based channel estimation problem and its corresponding dataset are introduced.Then the potential approaches for enhancing the performance of AI based channel estimation are discussed from the viewpoints of data analysis,pre-processing,key components and backbone network structures.At last,the final competition results composed of different solutions are concluded.It is expected that the AI-based channel estimation track of the 2nd WAIC could provide insightful guidance for both the academia and industry.展开更多
In wireless sensor networks, the missing of sensor data is inevitable due to the inherent characteristic of wireless sensor networks, and it causes many difficulties in various applications. To solve the problem, the ...In wireless sensor networks, the missing of sensor data is inevitable due to the inherent characteristic of wireless sensor networks, and it causes many difficulties in various applications. To solve the problem, the missing data should be estimated as accurately as possible. In this paper, a k-nearest neighbor based missing data estimation algorithm is proposed based on the temporal and spatial correlation of sensor data. It adopts the linear regression model to describe the spatial correlation of sensor data among different sensor nodes, and utilizes the data information of multiple neighbor nodes to estimate the missing data jointly rather than independently, so that a stable and reliable estimation performance can be achieved. Experimental results on two real-world datasets show that the proposed algorithm can estimate the missing data accurately.展开更多
This study presents a simplified multivariate bias correction scheme that is sequentially implemented in the GEOS5 data assimilation system and compared against a control experiment without model bias correction. The ...This study presents a simplified multivariate bias correction scheme that is sequentially implemented in the GEOS5 data assimilation system and compared against a control experiment without model bias correction. The results show considerable improvement in terms of the mean biases of rawinsonde observation-minus-background (OmB) residuals for observed water vapor, wind and temperature variables. The time series spectral analysis shows whitening of bias-corrected OmB residuals, and mean biases for rawinsonde observation-minus-analysis (OmA) are also improved. Some wind and temperature biases in the control experiment near the equatorial tropopause nearly vanish from the bias-corrected experiment. Despite the analysis improvement, the bias correction scheme has only a moderate impact on forecast skill. Significant interaction is also found among quality-control, satellite observation bias correction, and background bias correction, and the latter positively impacts satellite bias correction.展开更多
Mineral exploration is done by different methods. Geophysical and geochemical studies are two powerful tools in this field. In integrated studies, the results of each study are used to determine the location of the dr...Mineral exploration is done by different methods. Geophysical and geochemical studies are two powerful tools in this field. In integrated studies, the results of each study are used to determine the location of the drilling boreholes. The purpose of this study is to use field geophysics to calculate the depth of mineral reserve. The study area is located 38 km from Zarand city called Jalalabad iron mine. In this study, gravimetric data were measured and mineral depth was calculated using the Euler method. 1314 readings have been performed in this area. The rocks of the region include volcanic and sedimentary. The source of the mineralization in the area is hydrothermal processes. After gravity measuring in the region, the data were corrected, then various methods such as anomalous map remaining in levels one and two, upward expansion, first and second-degree vertical derivatives, analytical method, and analytical signal were drawn, and finally, the depth of the deposit was estimated by Euler method. As a result, the depth of the mineral deposit was calculated to be between 20 and 30 meters on average.展开更多
The parameter estimation and the coefficient of contamination for the regression models with repeated measures are studied when its response variables are contaminated by another random variable sequence.Under the sui...The parameter estimation and the coefficient of contamination for the regression models with repeated measures are studied when its response variables are contaminated by another random variable sequence.Under the suitable conditions it is proved that the estimators which are established in the paper are strongly consistent estimators.展开更多
The state of charge(SOC)estimation of lithium-ion battery is an important function in the battery management system(BMS)of electric vehicles.The long short term memory(LSTM)model can be employed for SOC estimation,whi...The state of charge(SOC)estimation of lithium-ion battery is an important function in the battery management system(BMS)of electric vehicles.The long short term memory(LSTM)model can be employed for SOC estimation,which is capable of estimating the future changing states of a nonlinear system.Since the BMS usually works under complicated operating conditions,i.e the real measurement data used for model training may be corrupted by non-Gaussian noise,and thus the performance of the original LSTM with the mean square error(MSE)loss may deteriorate.Therefore,a novel LSTM with mixture kernel mean p-power error(MKMPE)loss,called MKMPE-LSTM,is developed by using the MKMPE loss to replace the MSE as the learning criterion in LSTM framework,which can achieve robust SOC estimation under the measurement data contaminated with non-Gaussian noises(or outliers)because of the MKMPE containing the p-order moments of the error distribution.In addition,a meta-heuristic algorithm,called heap-based-optimizer(HBO),is employed to optimize the hyper-parameters(mainly including learning rate,number of hidden layer neuron and value of p in MKMPE)of the proposed MKMPE-LSTM model to further improve its flexibility and generalization performance,and a novel hybrid model(HBO-MKMPE-LSTM)is established for SOC estimation under non-Gaussian noise cases.Finally,several tests are performed under various cases through a benchmark to evaluate the performance of the proposed HBO-MKMPE-LSTM model,and the results demonstrate that the proposed hybrid method can provide a good robustness and accuracy under different non-Gaussian measurement noises,and the SOC estimation results in terms of mean square error(MSE),root MSE(RMSE),mean absolute relative error(MARE),and determination coefficient R2are less than 0.05%,3%,3%,and above 99.8%at 25℃,respectively.展开更多
Next Generation Sequencing (NGS) provides an effective basis for estimating the survival time of cancer patients, but it also poses the problem of high data dimensionality, in addition to the fact that some patients d...Next Generation Sequencing (NGS) provides an effective basis for estimating the survival time of cancer patients, but it also poses the problem of high data dimensionality, in addition to the fact that some patients drop out of the study, making the data missing, so a method for estimating the mean of the response variable with missing values for the ultra-high dimensional datasets is needed. In this paper, we propose a two-stage ultra-high dimensional variable screening method, RF-SIS, based on random forest regression, which effectively solves the problem of estimating missing values due to excessive data dimension. After the dimension reduction process by applying RF-SIS, mean interpolation is executed on the missing responses. The results of the simulated data show that compared with the estimation method of directly deleting missing observations, the estimation results of RF-SIS-MI have significant advantages in terms of the proportion of intervals covered, the average length of intervals, and the average absolute deviation.展开更多
The smart grid is an evolving critical infrastructure,which combines renewable energy and the most advanced information and communication technologies to provide more economic and secure power supply services.To cope ...The smart grid is an evolving critical infrastructure,which combines renewable energy and the most advanced information and communication technologies to provide more economic and secure power supply services.To cope with the intermittency of ever-increasing renewable energy and ensure the security of the smart grid,state estimation,which serves as a basic tool for understanding the true states of a smart grid,should be performed with high frequency.More complete system state data are needed to support high-frequency state estimation.The data completeness problem for smart grid state estimation is therefore studied in this paper.The problem of improving data completeness by recovering highfrequency data from low-frequency data is formulated as a super resolution perception(SRP)problem in this paper.A novel machine-learning-based SRP approach is thereafter proposed.The proposed method,namely the Super Resolution Perception Net for State Estimation(SRPNSE),consists of three steps:feature extraction,information completion,and data reconstruction.Case studies have demonstrated the effectiveness and value of the proposed SRPNSE approach in recovering high-frequency data from low-frequency data for the state estimation.展开更多
基金supported in part by the National Key Research and Development Program of China(No.2022YFB3305403)Project of basic research funds for central universities(2022CDJDX006)+1 种基金Talent Plan Project of Chongqing(No.cstc2021ycjhbgzxm0295)National Natural Science Foundation of China(No.52111530194)。
文摘Accurate capacity estimation is of great importance for the reliable state monitoring,timely maintenance,and second-life utilization of lithium-ion batteries.Despite numerous works on battery capacity estimation using laboratory datasets,most of them are applied to battery cells and lack satisfactory fidelity when extended to real-world electric vehicle(EV)battery packs.The challenges intensify for large-sized EV battery packs,where unpredictable operating profiles and low-quality data acquisition hinder precise capacity estimation.To fill the gap,this study introduces a novel data-driven battery pack capacity estimation method grounded in field data.The proposed approach begins by determining labeled capacity through an innovative combination of the inverse ampere-hour integral,open circuit voltage-based,and resistance-based correction methods.Then,multiple health features are extracted from incremental capacity curves,voltage curves,equivalent circuit model parameters,and operating temperature to thoroughly characterize battery aging behavior.A feature selection procedure is performed to determine the optimal feature set based on the Pearson correlation coefficient.Moreover,a convolutional neural network and bidirectional gated recurrent unit,enhanced by an attention mechanism,are employed to estimate the battery pack capacity in real-world EV applications.Finally,the proposed method is validated with a field dataset from two EVs,covering approximately 35,000 kilometers.The results demonstrate that the proposed method exhibits better estimation performance with an error of less than 1.1%compared to existing methods.This work shows great potential for accurate large-sized EV battery pack capacity estimation based on field data,which provides significant insights into reliable labeled capacity calculation,effective features extraction,and machine learning-enabled health diagnosis.
文摘The estimation of covariance matrices is very important in many fields, such as statistics. In real applications, data are frequently influenced by high dimensions and noise. However, most relevant studies are based on complete data. This paper studies the optimal estimation of high-dimensional covariance matrices based on missing and noisy sample under the norm. First, the model with sub-Gaussian additive noise is presented. The generalized sample covariance is then modified to define a hard thresholding estimator , and the minimax upper bound is derived. After that, the minimax lower bound is derived, and it is concluded that the estimator presented in this article is rate-optimal. Finally, numerical simulation analysis is performed. The result shows that for missing samples with sub-Gaussian noise, if the true covariance matrix is sparse, the hard thresholding estimator outperforms the traditional estimate method.
基金supported by the National Natural Science Foundation of China(61925303,62173034,62088101,U20B2073,62173002)the National Key Research and Development Program of China(2021YFB1714800)Beijing Natural Science Foundation(4222045)。
文摘This paper investigates the security issue of multisensor remote estimation systems.An optimal stealthy false data injection(FDI)attack scheme based on historical and current residuals,which only tampers with the measurement residuals of partial sensors due to limited attack resources,is proposed to maximally degrade system estimation performance.The attack stealthiness condition is given,and then the estimation error covariance in compromised state is derived to quantify the system performance under attack.The optimal attack strategy is obtained by solving several convex optimization problems which maximize the trace of the compromised estimation error covariance subject to the stealthiness condition.Moreover,due to the constraint of attack resources,the selection principle of the attacked sensor is provided to determine which sensor is attacked so as to hold the most impact on system performance.Finally,simulation results are presented to verify the theoretical analysis.
文摘This paper aims to explore the application of Extreme Value Theory (EVT) in estimating the conditional extreme quantile for time-to-event outcomes by examining the functional relationship between ambulatory blood pressure trajectories and clinical outcomes in stroke patients. The study utilizes EVT to analyze the functional connection between ambulatory blood pressure trajectories and clinical outcomes in a sample of 297 stroke patients. The 24-hour ambulatory blood pressure measurement curves for every 15 minutes are considered, acknowledging a censored rate of 40%. The findings reveal that the sample mean excess function exhibits a positive gradient above a specific threshold, confirming the heavy-tailed distribution of data in stroke patients with a positive extreme value index. Consequently, the estimated conditional extreme quantile indicates that stroke patients with higher blood pressure measurements face an elevated risk of recurrent stroke occurrence at an early stage. This research contributes to the understanding of the relationship between ambulatory blood pressure and recurrent stroke, providing valuable insights for clinical considerations and potential interventions in stroke management.
文摘For many environmental and agricultural applications, an accurate estimation of surface soil moisture is essential. This study sought to determine whether combining Sentinel-1A, Sentinel-2A, and meteorological data with artificial neural networks (ANN) could improve soil moisture estimation in various land cover types. To train and evaluate the model’s performance, we used field data (provided by La Tuscia University) on the study area collected during time periods between October 2022, and December 2022. Surface soil moisture was measured at 29 locations. The performance of the model was trained, validated, and tested using input features in a 60:10:30 ratio, using the feed-forward ANN model. It was found that the ANN model exhibited high precision in predicting soil moisture. The model achieved a coefficient of determination (R<sup>2</sup>) of 0.71 and correlation coefficient (R) of 0.84. Furthermore, the incorporation of Random Forest (RF) algorithms for soil moisture prediction resulted in an improved R<sup>2</sup> of 0.89. The unique combination of active microwave, meteorological data and multispectral data provides an opportunity to exploit the complementary nature of the datasets. Through preprocessing, fusion, and ANN modeling, this research contributes to advancing soil moisture estimation techniques and providing valuable insights for water resource management and agricultural planning in the study area.
基金This research was supported by the China National Railway Group Co.,Ltd.Research and Development Project(N2022T008).
文摘Purpose–The study aims to provide a basis for the effective use of safety-related information data and a quantitative assessment way for the occurrence probability of the safety risk such as the fatigue fracture of the key components.Design/methodology/approach–The fatigue crack growth rate is of dispersion,which is often used to accurately describe with probability density.In view of the external dispersion caused by the load,a simple and applicable probability expression of fatigue crack growth rate is adopted based on the fatigue growth theory.Considering the isolation among the pairs of crack length a and crack formation time t(a∼t data)obtained from same kind of structural parts,a statistical analysis approach of t distribution is proposed,which divides the crack length in several segments.Furthermore,according to the compatibility criterion of crack growth,that is,there is statistical development correspondence among a∼t data,the probability model of crack growth rate is established.Findings–The results show that the crack growth rate in the stable growth stage can be approximately expressed by the crack growth control curve da/dt=5 Q•a,and the probability density of the crack growth parameter Q represents the external dispersion;t follows two-parameter Weibull distribution in certain a values.Originality/value–The probability density f(Q)can be estimated by using the probability model of crack growth rate,and a calculation example shows that the estimation method is effective and practical.
文摘The application of frequency distribution statistics to data provides objective means to assess the nature of the data distribution and viability of numerical models that are used to visualize and interpret data.Two commonly used tools are the kernel density estimation and reduced chi-squared statistic used in combination with a weighted mean.Due to the wide applicability of these tools,we present a Java-based computer application called KDX to facilitate the visualization of data and the utilization of these numerical tools.
基金The study has been continued under the support of the Foundation for Research Science and Technology of New Zealand under contract C01X0401
文摘An adaptive estimation of forecast error covariance matrices is proposed for Kalman filtering data assim- ilation. A forecast error covariance matrix is initially estimated using an ensemble of perturbation forecasts. This initially estimated matrix is then adjusted with scale parameters that are adaptively estimated by minimizing -2log-likelihood of observed-minus-forecast residuals. The proposed approach could be applied to Kalman filtering data assimilation with imperfect models when the model error statistics are not known. A simple nonlinear model (Burgers' equation model) is used to demonstrate the efficacy of the proposed approach.
基金Sponsored by the National Natural Science Foundation of China(Grant No.51138003)
文摘In order to estimate vehicular queue length at signalized intersections accurately and overcome the shortcomings and restrictions of existing studies especially those based on shockwave theory,a new methodology is presented for estimating vehicular queue length using data from both point detectors and probe vehicles. The methodology applies the shockwave theory to model queue evolution over time and space. Using probe vehicle locations and times as well as point detector measured traffic states,analytical formulations for calculating the maximum and minimum( residual) queue length are developed. The proposed methodology is verified using ground truth data collected from numerical experiments conducted in Shanghai,China. It is found that the methodology has a mean absolute percentage error of 17. 09%,which is reasonably effective in estimating the queue length at traffic signalized intersections. Limitations of the proposed models and algorithms are also discussed in the paper.
文摘Phase errors in synthetic aperture sonar (SAS) imaging must be reduced to less than one eighth of a wavelength so as to avoid image destruction. Most of the phase errors occur as a result of platform motion errors, for example, sway yaw and surge that are the most important error sources. The phase error of a wide band synthetic aperture sonar is modeled and solutions to sway yaw and surge motion estimation based on the raw sonar echo data with a Displaced Phase Center Antenna (DPCA) method are proposed and their implementations are detailed in this paper. It is shown that the sway estimates can be obtained from the correlation lag and phase difference between the returns at coincident phase centers. An estimate of yaw is also possible if such a technique is applied to more than one overlapping phase center positions. Surge estimates can be obtained by identifying pairs of phase centers with a maximum correlation coefficient. The method works only if the platform velocity is low enough such that a number of phase centers from adjacent pings overlap.
文摘A direction-of-arrival (DOA) estimation algorithm based on direct data domain (D3) approach is presented. This method can accuracy estimate DOA using one snapshot modified data, called the temporal and spatial two-dimensional vector reconstruction (TSR) method. The key idea is to apply the D3 approach which can extract the signal of given frequency but null out other frequency signals in temporal domain. Then the spatial vector reconstruction processing is used to estimate the angle of the spatial coherent signal source based on extract signal data. Compared with the common temporal and spatial processing approach, the TSR method has a lower computational load, higher real-time performance, robustness and angular accuracy of DOA. The proposed algorithm can be directly applied to the phased array radar of coherent pulses. Simulation results demonstrate the performance of the proposed technique.
基金Supported by the National Natural Science Foundation of China(No.42075149)。
文摘Sea surface salinity(SSS)is an essential variable of ocean dynamics and climate research.The Soil Moisture and Ocean Salinity(SMOS),Aquarius,and Soil Moisture Active Passive(SMAP)satellite missions all provide SSS measurements.The European Space Agency(ESA)Climate Change Initiative Sea Surface Salinity(CCI-SSS)project merged these three satellite SSS data to produce CCI L4SSS products.We validated the accuracy of the four satellite products(CCI,SMOS,Aquarius,and SMAP)using in-situ gridded data and Argo floats in the South China Sea(SCS).Compared with in-situ gridded data,it shows that the CCI achieved the best performance(RMSD:0.365)on monthly time scales.The RMSD of SMOS,Aquarius,and SMAP(SMOS:0.389;Aquarius:0.409;SMAP:0.391)are close,and the SMOS takes a slight advantage in contrast with Aquarius and SMAP.Large discrepancies can be found near the coastline and in the shelf seas.Meanwhile,CCI with lower RMSD(0.295)perform better than single satellite data(SMOS:0.517;SMAP:0.297)on weekly time scales compared with Argo floats.Overall,the merged CCI have the smallest RMSD among the four satellite products in the SCS on both weekly time scales and monthly time scales,which illustrates the improved accuracy of merged CCI compared with the individual satellite data.
文摘In this paper,a systematic description of the artificial intelligence(AI)-based channel estimation track of the 2nd Wireless Communication AI Competition(WAIC)is provided,which is hosted by IMT-2020(5G)Promotion Group 5G+AIWork Group.Firstly,the system model of demodulation reference signal(DMRS)based channel estimation problem and its corresponding dataset are introduced.Then the potential approaches for enhancing the performance of AI based channel estimation are discussed from the viewpoints of data analysis,pre-processing,key components and backbone network structures.At last,the final competition results composed of different solutions are concluded.It is expected that the AI-based channel estimation track of the 2nd WAIC could provide insightful guidance for both the academia and industry.
文摘In wireless sensor networks, the missing of sensor data is inevitable due to the inherent characteristic of wireless sensor networks, and it causes many difficulties in various applications. To solve the problem, the missing data should be estimated as accurately as possible. In this paper, a k-nearest neighbor based missing data estimation algorithm is proposed based on the temporal and spatial correlation of sensor data. It adopts the linear regression model to describe the spatial correlation of sensor data among different sensor nodes, and utilizes the data information of multiple neighbor nodes to estimate the missing data jointly rather than independently, so that a stable and reliable estimation performance can be achieved. Experimental results on two real-world datasets show that the proposed algorithm can estimate the missing data accurately.
文摘This study presents a simplified multivariate bias correction scheme that is sequentially implemented in the GEOS5 data assimilation system and compared against a control experiment without model bias correction. The results show considerable improvement in terms of the mean biases of rawinsonde observation-minus-background (OmB) residuals for observed water vapor, wind and temperature variables. The time series spectral analysis shows whitening of bias-corrected OmB residuals, and mean biases for rawinsonde observation-minus-analysis (OmA) are also improved. Some wind and temperature biases in the control experiment near the equatorial tropopause nearly vanish from the bias-corrected experiment. Despite the analysis improvement, the bias correction scheme has only a moderate impact on forecast skill. Significant interaction is also found among quality-control, satellite observation bias correction, and background bias correction, and the latter positively impacts satellite bias correction.
文摘Mineral exploration is done by different methods. Geophysical and geochemical studies are two powerful tools in this field. In integrated studies, the results of each study are used to determine the location of the drilling boreholes. The purpose of this study is to use field geophysics to calculate the depth of mineral reserve. The study area is located 38 km from Zarand city called Jalalabad iron mine. In this study, gravimetric data were measured and mineral depth was calculated using the Euler method. 1314 readings have been performed in this area. The rocks of the region include volcanic and sedimentary. The source of the mineralization in the area is hydrothermal processes. After gravity measuring in the region, the data were corrected, then various methods such as anomalous map remaining in levels one and two, upward expansion, first and second-degree vertical derivatives, analytical method, and analytical signal were drawn, and finally, the depth of the deposit was estimated by Euler method. As a result, the depth of the mineral deposit was calculated to be between 20 and 30 meters on average.
文摘The parameter estimation and the coefficient of contamination for the regression models with repeated measures are studied when its response variables are contaminated by another random variable sequence.Under the suitable conditions it is proved that the estimators which are established in the paper are strongly consistent estimators.
基金supported by the National Key R.D Program of China(2021YFB2401904)the Joint Fund project of the National Natural Science Foundation of China(U21A20485)+1 种基金the National Natural Science Foundation of China(61976175)the Key Laboratory Project of Shaanxi Provincial Education Department Scientific Research Projects(20JS109)。
文摘The state of charge(SOC)estimation of lithium-ion battery is an important function in the battery management system(BMS)of electric vehicles.The long short term memory(LSTM)model can be employed for SOC estimation,which is capable of estimating the future changing states of a nonlinear system.Since the BMS usually works under complicated operating conditions,i.e the real measurement data used for model training may be corrupted by non-Gaussian noise,and thus the performance of the original LSTM with the mean square error(MSE)loss may deteriorate.Therefore,a novel LSTM with mixture kernel mean p-power error(MKMPE)loss,called MKMPE-LSTM,is developed by using the MKMPE loss to replace the MSE as the learning criterion in LSTM framework,which can achieve robust SOC estimation under the measurement data contaminated with non-Gaussian noises(or outliers)because of the MKMPE containing the p-order moments of the error distribution.In addition,a meta-heuristic algorithm,called heap-based-optimizer(HBO),is employed to optimize the hyper-parameters(mainly including learning rate,number of hidden layer neuron and value of p in MKMPE)of the proposed MKMPE-LSTM model to further improve its flexibility and generalization performance,and a novel hybrid model(HBO-MKMPE-LSTM)is established for SOC estimation under non-Gaussian noise cases.Finally,several tests are performed under various cases through a benchmark to evaluate the performance of the proposed HBO-MKMPE-LSTM model,and the results demonstrate that the proposed hybrid method can provide a good robustness and accuracy under different non-Gaussian measurement noises,and the SOC estimation results in terms of mean square error(MSE),root MSE(RMSE),mean absolute relative error(MARE),and determination coefficient R2are less than 0.05%,3%,3%,and above 99.8%at 25℃,respectively.
文摘Next Generation Sequencing (NGS) provides an effective basis for estimating the survival time of cancer patients, but it also poses the problem of high data dimensionality, in addition to the fact that some patients drop out of the study, making the data missing, so a method for estimating the mean of the response variable with missing values for the ultra-high dimensional datasets is needed. In this paper, we propose a two-stage ultra-high dimensional variable screening method, RF-SIS, based on random forest regression, which effectively solves the problem of estimating missing values due to excessive data dimension. After the dimension reduction process by applying RF-SIS, mean interpolation is executed on the missing responses. The results of the simulated data show that compared with the estimation method of directly deleting missing observations, the estimation results of RF-SIS-MI have significant advantages in terms of the proportion of intervals covered, the average length of intervals, and the average absolute deviation.
基金the Training Program of the Major Research Plan of the National Natural Science Foundation of China(91746118)the Shenzhen Municipal Science and Technology Innovation Committee Basic Research project(JCYJ20170410172224515)。
文摘The smart grid is an evolving critical infrastructure,which combines renewable energy and the most advanced information and communication technologies to provide more economic and secure power supply services.To cope with the intermittency of ever-increasing renewable energy and ensure the security of the smart grid,state estimation,which serves as a basic tool for understanding the true states of a smart grid,should be performed with high frequency.More complete system state data are needed to support high-frequency state estimation.The data completeness problem for smart grid state estimation is therefore studied in this paper.The problem of improving data completeness by recovering highfrequency data from low-frequency data is formulated as a super resolution perception(SRP)problem in this paper.A novel machine-learning-based SRP approach is thereafter proposed.The proposed method,namely the Super Resolution Perception Net for State Estimation(SRPNSE),consists of three steps:feature extraction,information completion,and data reconstruction.Case studies have demonstrated the effectiveness and value of the proposed SRPNSE approach in recovering high-frequency data from low-frequency data for the state estimation.