期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
Physics-informed neural network-based petroleum reservoir simulation with sparse data using domain decomposition
1
作者 Jiang-Xia Han Liang Xue +4 位作者 Yun-Sheng Wei Ya-Dong Qi Jun-Lei Wang Yue-Tian Liu Yu-Qi Zhang 《Petroleum Science》 SCIE EI CAS CSCD 2023年第6期3450-3460,共11页
Recent advances in deep learning have expanded new possibilities for fluid flow simulation in petroleum reservoirs.However,the predominant approach in existing research is to train neural networks using high-fidelity ... Recent advances in deep learning have expanded new possibilities for fluid flow simulation in petroleum reservoirs.However,the predominant approach in existing research is to train neural networks using high-fidelity numerical simulation data.This presents a significant challenge because the sole source of authentic wellbore production data for training is sparse.In response to this challenge,this work introduces a novel architecture called physics-informed neural network based on domain decomposition(PINN-DD),aiming to effectively utilize the sparse production data of wells for reservoir simulation with large-scale systems.To harness the capabilities of physics-informed neural networks(PINNs)in handling small-scale spatial-temporal domain while addressing the challenges of large-scale systems with sparse labeled data,the computational domain is divided into two distinct sub-domains:the well-containing and the well-free sub-domain.Moreover,the two sub-domains and the interface are rigorously constrained by the governing equations,data matching,and boundary conditions.The accuracy of the proposed method is evaluated on two problems,and its performance is compared against state-of-the-art PINNs through numerical analysis as a benchmark.The results demonstrate the superiority of PINN-DD in handling large-scale reservoir simulation with limited data and show its potential to outperform conventional PINNs in such scenarios. 展开更多
关键词 Physical-informed neural networks Fluid flow simulation Sparse data Domain decomposition
下载PDF
Fast Computation of Sparse Data Cubes with Constraints 被引量:2
2
作者 FengYu-cai ChenChang-qing FengJian-lin XiangLong-gang 《Wuhan University Journal of Natural Sciences》 EI CAS 2004年第2期167-172,共6页
For a data cube there are always constraints between dimensions or among attributes in a dimension, such as functional dependencies. We introduce the problem that when there are functional dependencies, how to use the... For a data cube there are always constraints between dimensions or among attributes in a dimension, such as functional dependencies. We introduce the problem that when there are functional dependencies, how to use them to speed up the computation of sparse data cubes. A new algorithm CFD (Computation by Functional Dependencies) is presented to satisfy this demand. CFD determines the order of dimensions by considering cardinalities of dimensions and functional dependencies between dimensions together, thus reduce the number of partitions for such dimensions. CFD also combines partitioning from bottom to up and aggregate computation from top to bottom to speed up the computation further. CFD can efficiently compute a data cube with hierarchies in a dimension from the smallest granularity to the coarsest one. Key words sparse data cube - functional dependency - dimension - partition - CFD CLC number TP 311 Foundation item: Supported by the E-Government Project of the Ministry of Science and Technology of China (2001BA110B01)Biography: Feng Yu-cai (1945-), male, Professor, research direction: database system. 展开更多
关键词 sparse data cube functional dependency DIMENSION PARTITION CFD
下载PDF
Probabilistic outlier detection for sparse multivariate geotechnical site investigation data using Bayesian learning 被引量:3
3
作者 Shuo Zheng Yu-Xin Zhu +3 位作者 Dian-Qing Li Zi-Jun Cao Qin-Xuan Deng Kok-Kwang Phoon 《Geoscience Frontiers》 SCIE CAS CSCD 2021年第1期425-439,共15页
Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse mult... Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse multivariate data obtained from geotechnical site investigation,it is impossible to identify outliers with certainty due to the distortion of statistics of geotechnical parameters caused by outliers and their associated statistical uncertainty resulted from data sparsity.This paper develops a probabilistic outlier detection method for sparse multivariate data obtained from geotechnical site investigation.The proposed approach quantifies the outlying probability of each data instance based on Mahalanobis distance and determines outliers as those data instances with outlying probabilities greater than 0.5.It tackles the distortion issue of statistics estimated from the dataset with outliers by a re-sampling technique and accounts,rationally,for the statistical uncertainty by Bayesian machine learning.Moreover,the proposed approach also suggests an exclusive method to determine outlying components of each outlier.The proposed approach is illustrated and verified using simulated and real-life dataset.It showed that the proposed approach properly identifies outliers among sparse multivariate data and their corresponding outlying components in a probabilistic manner.It can significantly reduce the masking effect(i.e.,missing some actual outliers due to the distortion of statistics by the outliers and statistical uncertainty).It also found that outliers among sparse multivariate data instances affect significantly the construction of multivariate distribution of geotechnical parameters for uncertainty quantification.This emphasizes the necessity of data cleaning process(e.g.,outlier detection)for uncertainty quantification based on geoscience data. 展开更多
关键词 Outlier detection Site investigation Sparse multivariate data Mahalanobis distance Resampling by half-means Bayesian machine learning
下载PDF
Minimum MSE Weighted Estimator to Make Inferences for a Common Risk Ratio across Sparse Meta-Analysis Data
4
作者 Chukiat Viwatwongkasem Sutthisak Srisawad +4 位作者 Pichitpong Soontornpipit Jutatip Sillabutra Pratana Satitvipawee Prasong Kitidamrongsuk Hathaikan Chootrakool 《Open Journal of Statistics》 2022年第1期49-69,共21页
The paper aims to discuss three interesting issues of statistical inferences for a common risk ratio (RR) in sparse meta-analysis data. Firstly, the conventional log-risk ratio estimator encounters a number of problem... The paper aims to discuss three interesting issues of statistical inferences for a common risk ratio (RR) in sparse meta-analysis data. Firstly, the conventional log-risk ratio estimator encounters a number of problems when the number of events in the experimental or control group is zero in sparse data of a 2 × 2 table. The adjusted log-risk ratio estimator with the continuity correction points  based upon the minimum Bayes risk with respect to the uniform prior density over (0, 1) and the Euclidean loss function is proposed. Secondly, the interest is to find the optimal weights of the pooled estimate  that minimize the mean square error (MSE) of  subject to the constraint on  where , , . Finally, the performance of this minimum MSE weighted estimator adjusted with various values of points  is investigated to compare with other popular estimators, such as the Mantel-Haenszel (MH) estimator and the weighted least squares (WLS) estimator (also equivalently known as the inverse-variance weighted estimator) in senses of point estimation and hypothesis testing via simulation studies. The results of estimation illustrate that regardless of the true values of RR, the MH estimator achieves the best performance with the smallest MSE when the study size is rather large  and the sample sizes within each study are small. The MSE of WLS estimator and the proposed-weight estimator adjusted by , or , or are close together and they are the best when the sample sizes are moderate to large (and) while the study size is rather small. 展开更多
关键词 Minimum MSE Weights Adjusted Log-Risk Ratio Estimator Sparse Meta-Analysis data Continuity Correction
下载PDF
Bayesian ensemble methods for predicting ground deformation due to tunnelling with sparse monitoring data
5
作者 Zilong Zhang Tingting Zhang +1 位作者 Xiaozhou Li Daniel Dias 《Underground Space》 SCIE EI CSCD 2024年第3期79-93,共15页
Numerous analytical models have been developed to predict ground deformations induced by tunneling,which is a critical issue in tunnel engineering.However,the accuracy of these predictions is often limited by errors a... Numerous analytical models have been developed to predict ground deformations induced by tunneling,which is a critical issue in tunnel engineering.However,the accuracy of these predictions is often limited by errors and uncertainties resulting from model selection and parameter fittings,given the paucity of monitoring data in field settings.This paper proposes a novel approach to estimate tunnelling-induced ground deformations by applying Bayesian model averaging to several representative prediction models.By accounting for both model and parameter uncertainties,this approach enables more realistic predictions of ground deformations than individual models.Specifically,our results indicate that the Gonzalez-Sagaseta model outperforms other models in predicting ground surface settlements,while the Loganathan-Poulos model is most suitable for predicting subsurface vertical and horizontal deformations.Importantly,our analysis reveals that when monitoring data are sparse,model uncertainties may contribute up to 78.7%of the total uncertainties.Thus,obtaining sufficient data for parameter fitting is crucial for accurate predictions.The proposed method in this study offers a more realistic and efficient prediction of tunnelling-induced ground deformations. 展开更多
关键词 Tunnelling-induced ground deformations Sparse data Model uncertainties Bayesian model averaging
原文传递
A generative deep learning framework for airfoil flow field prediction with sparse data 被引量:4
6
作者 Haizhou WU Xuejun LIU +1 位作者 Wei AN Hongqiang LYU 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2022年第1期470-484,共15页
Deep learning has been probed for the airfoil performance prediction in recent years.Compared with the expensive CFD simulations and wind tunnel experiments,deep learning models can be leveraged to somewhat mitigate s... Deep learning has been probed for the airfoil performance prediction in recent years.Compared with the expensive CFD simulations and wind tunnel experiments,deep learning models can be leveraged to somewhat mitigate such expenses with proper means.Nevertheless,effective training of the data-driven models in deep learning severely hinges on the data in diversity and quantity.In this paper,we present a novel data augmented Generative Adversarial Network(GAN),daGAN,for rapid and accurate flow filed prediction,allowing the adaption to the task with sparse data.The presented approach consists of two modules,pre-training module and fine-tuning module.The pre-training module utilizes a conditional GAN(cGAN)to preliminarily estimate the distribution of the training data.In the fine-tuning module,we propose a novel adversarial architecture with two generators one of which fulfils a promising data augmentation operation,so that the complement data is adequately incorporated to boost the generalization of the model.We use numerical simulation data to verify the generalization of daGAN on airfoils and flow conditions with sparse training data.The results show that daGAN is a promising tool for rapid and accurate evaluation of detailed flow field without the requirement for big training data. 展开更多
关键词 CFD Flow field Generative adversarial networks(GANs) Sparse data Supercritical airfoil
原文传递
An integrated autoencoder-based filter for sparse big data
7
作者 Wei Peng Baogui Xin 《Journal of Control and Decision》 EI 2021年第3期260-268,共9页
We propose a novel filter for sparse big data,called an integrated autoencoder(IAE),which utilises auxiliary information to mitigate data sparsity.The proposed model achieves an appropriate balance between prediction ... We propose a novel filter for sparse big data,called an integrated autoencoder(IAE),which utilises auxiliary information to mitigate data sparsity.The proposed model achieves an appropriate balance between prediction accuracy,convergence speed,and complexity.We implement experiments on a GPS trajectory dataset,and the results demonstrate that the IAE is more accurate and robust than some state-of-the-art methods. 展开更多
关键词 Sparse big data integrated autoencoder(IAE) data sparsity PREDICTION FILTER
原文传递
Generating high-resolution climate maps from sparse and irregular observations using a novel hybrid RBF network
8
作者 Yue Han Zhihua Zhang M.James C.Crabbe 《Big Earth Data》 EI CSCD 2023年第4期1120-1145,共26页
Sparse and irregular climate observations in many developing countries are not enough to satisfy the need of assessing climate change risks and planning suitable mitigation strategies.The wideused statistical downscal... Sparse and irregular climate observations in many developing countries are not enough to satisfy the need of assessing climate change risks and planning suitable mitigation strategies.The wideused statistical downscaling model(SDSM)software tools use multi-linear regression to extract linear relations between largescale and local climate variables and then produce high-resolution climate maps from sparse climate observations.The latest machine learning techniques(e.g.SRCNN,SRGAN)can extract nonlinear links,but they are only suitable for downscaling low-resolution grid data and cannot utilize the link to other climate variables to improve the downscaling performance.In this study,we proposed a novel hybrid RBF(Radial Basis Function)network by embedding several RBF networks into new RBF networks.Our model can well incorporate climate and topographical variables with different resolutions and extract their nonlinear relations for spatial downscaling.To test the performance of our model,we generated high-resolution precipitation,air temperature and humidity maps from 34 meteorological stations in Bangladesh.In terms of three statistical indicators,the accuracy of high-resolution climate maps generated by our hybrid RBF network clearly outperformed those using a multi-linear regression(MLR),Kriging interpolation or a pure RBF network. 展开更多
关键词 Hybrid RBF network climate map sparse observed climate data high resolution
原文传递
Short text classification based on strong feature thesaurus 被引量:7
9
作者 Bing-kun WANG Yong-feng HUANG +1 位作者 Wan-xia YANG Xing LI 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2012年第9期649-659,共11页
Data sparseness, the evident characteristic of short text, has always been regarded as the main cause of the low ac- curacy in the classification of short texts using statistical methods. Intensive research has been c... Data sparseness, the evident characteristic of short text, has always been regarded as the main cause of the low ac- curacy in the classification of short texts using statistical methods. Intensive research has been conducted in this area during the past decade. However, most researchers failed to notice that ignoring the semantic importance of certain feature terms might also contribute to low classification accuracy. In this paper we present a new method to tackle the problem by building a strong feature thesaurus (SFT) based on latent Dirichlet allocation (LDA) and information gain (IG) models. By giving larger weights to feature terms in SFT, the classification accuracy can be improved. Specifically, our method appeared to be more effective with more detailed classification. Experiments in two short text datasets demonstrate that our approach achieved improvement compared with the state-of-the-art methods including support vector machine (SVM) and Naive Bayes Multinomial. 展开更多
关键词 Short text CLASSIFICATION data sparseness SEMANTIC Strong feature thesaurus (SFT) Latent Dirichlet allocation(LDA)
原文传递
Factorization Machine Based on Bitwise Feature Importance for CTR Prediction
10
作者 Hao Li Caimao Li +2 位作者 Yuquan Hou Hao Lin Qiuhong Chen 《国际计算机前沿大会会议论文集》 2022年第1期29-40,共12页
Click-through-rate(CTR)prediction is a crucial task in recommendation systems.The accuracy of CTR prediction is strongly influenced by the precise extraction of essential data and the modeling strategy chosen.The data... Click-through-rate(CTR)prediction is a crucial task in recommendation systems.The accuracy of CTR prediction is strongly influenced by the precise extraction of essential data and the modeling strategy chosen.The data of the CTR task are often very sparse,and Factorization Machines(FMs)are a class of general predictors working effectively with it.However,the performance of FMs can be limited by the fixed feature representation and the same weight of different features.In this work,we propose an improved Bitwise Feature Importance Factorization Machine(BFIFM)to improve the accuracy.The necessity of learning the degree of effect of the same feature under various situations is learned through the low-order intersection method,and the deep neural network(DNN)in our model is used in parallel to study high-order intersections.According to the final results obtained,the BFIFM model significantly outperforms other state-of-the-art models. 展开更多
关键词 Factorization machines Deep learning RECOMMENDATION Sparse data
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部