Crime scene investigation(CSI)image is key evidence carrier during criminal investiga-tion,in which CSI image retrieval can assist the public police to obtain criminal clues.Moreover,with the rapid development of deep...Crime scene investigation(CSI)image is key evidence carrier during criminal investiga-tion,in which CSI image retrieval can assist the public police to obtain criminal clues.Moreover,with the rapid development of deep learning,data-driven paradigm has become the mainstreammethod of CSI image feature extraction and representation,and in this process,datasets provideeffective support for CSI retrieval performance.However,there is a lack of systematic research onCSI image retrieval methods and datasets.Therefore,we present an overview of the existing worksabout one-class and multi-class CSI image retrieval based on deep learning.According to theresearch,based on their technical functionalities and implementation methods,CSI image retrievalis roughly classified into five categories:feature representation,metric learning,generative adversar-ial networks,autoencoder networks and attention networks.Furthermore,We analyzed the remain-ing challenges and discussed future work directions in this field.展开更多
Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Co...Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Conghua District,which is the most prone to landslide disasters in Guangzhou,was selected for landslide susceptibility evaluation.The evaluation factors were selected by using correlation analysis and variance expansion factor method.Applying four machine learning methods namely Logistic Regression(LR),Random Forest(RF),Support Vector Machines(SVM),and Extreme Gradient Boosting(XGB),landslide models were constructed.Comparative analysis and evaluation of the model were conducted through statistical indices and receiver operating characteristic(ROC)curves.The results showed that LR,RF,SVM,and XGB models have good predictive performance for landslide susceptibility,with the area under curve(AUC)values of 0.752,0.965,0.996,and 0.998,respectively.XGB model had the highest predictive ability,followed by RF model,SVM model,and LR model.The frequency ratio(FR)accuracy of LR,RF,SVM,and XGB models was 0.775,0.842,0.759,and 0.822,respectively.RF and XGB models were superior to LR and SVM models,indicating that the integrated algorithm has better predictive ability than a single classification algorithm in regional landslide classification problems.展开更多
Geological Hazards Investigation and Evaluation is the core course of Environmental Geological Engineering,aiming to cultivate skilled talents with solid theoretical knowledge and excellent practical skills.At present...Geological Hazards Investigation and Evaluation is the core course of Environmental Geological Engineering,aiming to cultivate skilled talents with solid theoretical knowledge and excellent practical skills.At present,the course faces several issues,including a teaching environment disconnected from real-world work scenarios,course content that deviates from job-related tasks,a lack of digital teaching resources,and reliance on a single teaching method,leading to students’poor feedback from employers.Based on the concept of outcome-based education,the course team of Geological Hazards Investigation and Evaluation establishes a“five-step double-rotation”blended teaching model with the help of a Small Private Online Course platform.The program is designed to improve the teaching environment and expand the digitalized teaching resources in order to improve students’learning motivation,enhance learning effectiveness,and cultivate skillful talents who meet employers’satisfaction.展开更多
Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse mult...Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse multivariate data obtained from geotechnical site investigation,it is impossible to identify outliers with certainty due to the distortion of statistics of geotechnical parameters caused by outliers and their associated statistical uncertainty resulted from data sparsity.This paper develops a probabilistic outlier detection method for sparse multivariate data obtained from geotechnical site investigation.The proposed approach quantifies the outlying probability of each data instance based on Mahalanobis distance and determines outliers as those data instances with outlying probabilities greater than 0.5.It tackles the distortion issue of statistics estimated from the dataset with outliers by a re-sampling technique and accounts,rationally,for the statistical uncertainty by Bayesian machine learning.Moreover,the proposed approach also suggests an exclusive method to determine outlying components of each outlier.The proposed approach is illustrated and verified using simulated and real-life dataset.It showed that the proposed approach properly identifies outliers among sparse multivariate data and their corresponding outlying components in a probabilistic manner.It can significantly reduce the masking effect(i.e.,missing some actual outliers due to the distortion of statistics by the outliers and statistical uncertainty).It also found that outliers among sparse multivariate data instances affect significantly the construction of multivariate distribution of geotechnical parameters for uncertainty quantification.This emphasizes the necessity of data cleaning process(e.g.,outlier detection)for uncertainty quantification based on geoscience data.展开更多
Autonomous learning has been attracting more and more attention in the field of second language teaching and learning since it was put forward. In order to get a better understanding about autonomous learning competen...Autonomous learning has been attracting more and more attention in the field of second language teaching and learning since it was put forward. In order to get a better understanding about autonomous learning competence of freshmen of English major in university, this investigation was conducted in the form of questionnaire and was analyzed according to the data collected. The investigation found that autonomous learning competence of freshmen is poor and worrying. Freshmen have strong motivation for English learning, but they keep old learning habit and more rely on teachers. They are incapable of employing metacognitive strategies in their language learning and are not good at utilizing related learning resources available to them. All these deficiencies hinder improvement of them. So they are in great need of fostering the competence of automous learning.展开更多
This paper attempts to investigate the feasibility and learning effects of English fragmented learning via mobile devices.Questionnaires and interviews were employed to do the survey on 157 undergraduates from 24 univ...This paper attempts to investigate the feasibility and learning effects of English fragmented learning via mobile devices.Questionnaires and interviews were employed to do the survey on 157 undergraduates from 24 universities in China.The research findings reveal that English fragmented learning via mobile devices has some positive effects on the improvement of learners’knowledge and learning ability.In addition,there have been a certain number of useful learning resources and platforms with diversified features.The study has the implications that fragmented mobile learning is feasible and can be popularized in English learning.展开更多
English learning motivation plays a more and more important role in junior middle school students' study,and it is necessary for students to learn English effectively.Therefore,teachers should take responsibilitie...English learning motivation plays a more and more important role in junior middle school students' study,and it is necessary for students to learn English effectively.Therefore,teachers should take responsibilities for stimulating students' English learning motivation.This present thesis investigates 63 students who are from class 1 grade 1,class 2 grade 1 and class 3 grade 1 in NanJie country junior middle school in LinYing town LuoHe city by the way of questionnaire.And the thesis discusses the source of students' learning motivation,for the purpose of putting forward strategies of motivating students' English learning motivation in accordance with students' type of motivation.展开更多
This study investigated the application and the effect of Group Investigation(GI) in the College English Program in a Chinese University. A qualitative case study method was used to understand the GI system used by Ch...This study investigated the application and the effect of Group Investigation(GI) in the College English Program in a Chinese University. A qualitative case study method was used to understand the GI system used by Chinese instructors as well as the achievements acquired and challenges met by the participants. Three instructors and fifteen second-year-undergraduates taking a course titled Sources of European Culture participated. Interviews, observations, and documents were used to collect the data. Data analysis showed Chinese instructors applied a GI technique similar to that discussed by Johnson and Johnson(1999); however, GI in the Chinese context demanded more effort from the teacher for designing tasks and provided help in modeling uses of English and in preparing visual, especially Power Point, presentations. Although participants used their mother tongue at some stages, their autonomy over English learning was activated, and horizons in the course content were broadened.展开更多
This study introduces a generic framework for geotechnical subsurface modeling, which accounts for spatial autocorrelation with local mapping machine learning(ML) methods. Instead of using XY coordinate fields directl...This study introduces a generic framework for geotechnical subsurface modeling, which accounts for spatial autocorrelation with local mapping machine learning(ML) methods. Instead of using XY coordinate fields directly as model input, a series of autocorrelated geotechnical distance fields(GDFs) is designed to enable the ML models to infer the spatial relationship between the sampled locations and unknown locations. The whole framework using GDF with ML methods is named GDF-ML. This framework is purely data-driven which avoids the tedious work in the scale of fluctuations(SOFs)estimating and data detrending in the conventional spatial interpolation methods. Six local mapping ML methods(extra trees(ETs), gradient boosting(GB), extreme gradient boosting(XGBoost), random forest(RF), general regression neural network(GRNN) and k-nearest neighbors(KNN)) are compared in the GDF-ML framework. The results show that the GDFs are better than the conventional XY coordinate fields based ML methods in both accuracy and spatial continuity. GDF-ML is flexible which can be applied to high-dimensional, multi-variable and incomplete datasets. Among these six methods, GDF with ET method(GDF-ET) clearly shows the best accuracy and best spatial continuity. The proposed GDF-ET method can provide a fast and accurate interpretation of the soil property profile. Sensitivity analysis shows that this method is applicable to very small training dataset size. The associated statistical uncertainty can also be quantified so that the reliability of the subsurface modeling results can be estimated objectively and explicitly. The uncertainty results clearly show that the prediction becomes more accurate when more sampled data are available.展开更多
Personal credit risk assessment is an important part of the development of financial enterprises. Big data credit investigation is an inevitable trend of personal credit risk assessment, but some data are missing and ...Personal credit risk assessment is an important part of the development of financial enterprises. Big data credit investigation is an inevitable trend of personal credit risk assessment, but some data are missing and the amount of data is small, so it is difficult to train. At the same time, for different financial platforms, we need to use different models to train according to the characteristics of the current samples, which is time-consuming. <span style="font-family:Verdana;">In view of</span><span style="font-family:Verdana;"> these two problems, this paper uses the idea of transfer learning to build a transferable personal credit risk model based on Instance-based Transfer Learning (Instance-based TL). The model balances the weight of the samples in the source domain, and migrates the existing large dataset samples to the target domain of small samples, and finds out the commonness between them. At the same time, we have done a lot of experiments on the selection of base learners, including traditional machine learning algorithms and ensemble learning algorithms, such as decision tree, logistic regression, </span><span style="font-family:Verdana;">xgboost</span> <span style="font-family:Verdana;">and</span><span style="font-family:Verdana;"> so on. The datasets are from P2P platform and bank, the results show that the AUC value of Instance-based TL is 24% higher than that of the traditional machine learning model, which fully proves that the model in this paper has good application value. The model’s evaluation uses AUC, prediction, recall, F1. These criteria prove that this model has good application value from many aspects. At present, we are trying to apply this model to more fields to improve the robustness and applicability of the model;on the other hand, we are trying to do more in-depth research on domain adaptation to enrich the model.</span>展开更多
文摘Crime scene investigation(CSI)image is key evidence carrier during criminal investiga-tion,in which CSI image retrieval can assist the public police to obtain criminal clues.Moreover,with the rapid development of deep learning,data-driven paradigm has become the mainstreammethod of CSI image feature extraction and representation,and in this process,datasets provideeffective support for CSI retrieval performance.However,there is a lack of systematic research onCSI image retrieval methods and datasets.Therefore,we present an overview of the existing worksabout one-class and multi-class CSI image retrieval based on deep learning.According to theresearch,based on their technical functionalities and implementation methods,CSI image retrievalis roughly classified into five categories:feature representation,metric learning,generative adversar-ial networks,autoencoder networks and attention networks.Furthermore,We analyzed the remain-ing challenges and discussed future work directions in this field.
基金supported by the projects of the China Geological Survey(DD20221729,DD20190291)Zhuhai Urban Geological Survey(including informatization)(MZCD–2201–008).
文摘Machine learning is currently one of the research hotspots in the field of landslide prediction.To clarify and evaluate the differences in characteristics and prediction effects of different machine learning models,Conghua District,which is the most prone to landslide disasters in Guangzhou,was selected for landslide susceptibility evaluation.The evaluation factors were selected by using correlation analysis and variance expansion factor method.Applying four machine learning methods namely Logistic Regression(LR),Random Forest(RF),Support Vector Machines(SVM),and Extreme Gradient Boosting(XGB),landslide models were constructed.Comparative analysis and evaluation of the model were conducted through statistical indices and receiver operating characteristic(ROC)curves.The results showed that LR,RF,SVM,and XGB models have good predictive performance for landslide susceptibility,with the area under curve(AUC)values of 0.752,0.965,0.996,and 0.998,respectively.XGB model had the highest predictive ability,followed by RF model,SVM model,and LR model.The frequency ratio(FR)accuracy of LR,RF,SVM,and XGB models was 0.775,0.842,0.759,and 0.822,respectively.RF and XGB models were superior to LR and SVM models,indicating that the integrated algorithm has better predictive ability than a single classification algorithm in regional landslide classification problems.
基金Scientific Research Fund of Hunan Provincial Education Department Excellent Youth Project(23B0953)Hunan Province Vocational College Education and Teaching Reform Research Project(ZJGB2022427)。
文摘Geological Hazards Investigation and Evaluation is the core course of Environmental Geological Engineering,aiming to cultivate skilled talents with solid theoretical knowledge and excellent practical skills.At present,the course faces several issues,including a teaching environment disconnected from real-world work scenarios,course content that deviates from job-related tasks,a lack of digital teaching resources,and reliance on a single teaching method,leading to students’poor feedback from employers.Based on the concept of outcome-based education,the course team of Geological Hazards Investigation and Evaluation establishes a“five-step double-rotation”blended teaching model with the help of a Small Private Online Course platform.The program is designed to improve the teaching environment and expand the digitalized teaching resources in order to improve students’learning motivation,enhance learning effectiveness,and cultivate skillful talents who meet employers’satisfaction.
基金supported by the National Key R&D Program of China(Project No.2016YFC0800200)the NRF-NSFC 3rd Joint Research Grant(Earth Science)(Project No.41861144022)+2 种基金the National Natural Science Foundation of China(Project Nos.51679174,and 51779189)the Shenzhen Key Technology R&D Program(Project No.20170324)The financial support is grateful acknowledged。
文摘Various uncertainties arising during acquisition process of geoscience data may result in anomalous data instances(i.e.,outliers)that do not conform with the expected pattern of regular data instances.With sparse multivariate data obtained from geotechnical site investigation,it is impossible to identify outliers with certainty due to the distortion of statistics of geotechnical parameters caused by outliers and their associated statistical uncertainty resulted from data sparsity.This paper develops a probabilistic outlier detection method for sparse multivariate data obtained from geotechnical site investigation.The proposed approach quantifies the outlying probability of each data instance based on Mahalanobis distance and determines outliers as those data instances with outlying probabilities greater than 0.5.It tackles the distortion issue of statistics estimated from the dataset with outliers by a re-sampling technique and accounts,rationally,for the statistical uncertainty by Bayesian machine learning.Moreover,the proposed approach also suggests an exclusive method to determine outlying components of each outlier.The proposed approach is illustrated and verified using simulated and real-life dataset.It showed that the proposed approach properly identifies outliers among sparse multivariate data and their corresponding outlying components in a probabilistic manner.It can significantly reduce the masking effect(i.e.,missing some actual outliers due to the distortion of statistics by the outliers and statistical uncertainty).It also found that outliers among sparse multivariate data instances affect significantly the construction of multivariate distribution of geotechnical parameters for uncertainty quantification.This emphasizes the necessity of data cleaning process(e.g.,outlier detection)for uncertainty quantification based on geoscience data.
文摘Autonomous learning has been attracting more and more attention in the field of second language teaching and learning since it was put forward. In order to get a better understanding about autonomous learning competence of freshmen of English major in university, this investigation was conducted in the form of questionnaire and was analyzed according to the data collected. The investigation found that autonomous learning competence of freshmen is poor and worrying. Freshmen have strong motivation for English learning, but they keep old learning habit and more rely on teachers. They are incapable of employing metacognitive strategies in their language learning and are not good at utilizing related learning resources available to them. All these deficiencies hinder improvement of them. So they are in great need of fostering the competence of automous learning.
基金This paper was supported by“the National Social Science Fund of China”(Grant Number:17BYY098).
文摘This paper attempts to investigate the feasibility and learning effects of English fragmented learning via mobile devices.Questionnaires and interviews were employed to do the survey on 157 undergraduates from 24 universities in China.The research findings reveal that English fragmented learning via mobile devices has some positive effects on the improvement of learners’knowledge and learning ability.In addition,there have been a certain number of useful learning resources and platforms with diversified features.The study has the implications that fragmented mobile learning is feasible and can be popularized in English learning.
文摘English learning motivation plays a more and more important role in junior middle school students' study,and it is necessary for students to learn English effectively.Therefore,teachers should take responsibilities for stimulating students' English learning motivation.This present thesis investigates 63 students who are from class 1 grade 1,class 2 grade 1 and class 3 grade 1 in NanJie country junior middle school in LinYing town LuoHe city by the way of questionnaire.And the thesis discusses the source of students' learning motivation,for the purpose of putting forward strategies of motivating students' English learning motivation in accordance with students' type of motivation.
基金supported by the 2013 Fundamental Re-search Funds for the Central Universities of Xi’an Jiaotong UniversityThe Subject of Shaanxi Province Educational Science Twelveth-Five-Year Plan
文摘This study investigated the application and the effect of Group Investigation(GI) in the College English Program in a Chinese University. A qualitative case study method was used to understand the GI system used by Chinese instructors as well as the achievements acquired and challenges met by the participants. Three instructors and fifteen second-year-undergraduates taking a course titled Sources of European Culture participated. Interviews, observations, and documents were used to collect the data. Data analysis showed Chinese instructors applied a GI technique similar to that discussed by Johnson and Johnson(1999); however, GI in the Chinese context demanded more effort from the teacher for designing tasks and provided help in modeling uses of English and in preparing visual, especially Power Point, presentations. Although participants used their mother tongue at some stages, their autonomy over English learning was activated, and horizons in the course content were broadened.
基金funded by the Australian Government through the Australian Research Council’s Discovery Projects funding scheme (Project DP190101592)the National Natural Science Foundation of China (Grant Nos. 41972280 and 52179103)。
文摘This study introduces a generic framework for geotechnical subsurface modeling, which accounts for spatial autocorrelation with local mapping machine learning(ML) methods. Instead of using XY coordinate fields directly as model input, a series of autocorrelated geotechnical distance fields(GDFs) is designed to enable the ML models to infer the spatial relationship between the sampled locations and unknown locations. The whole framework using GDF with ML methods is named GDF-ML. This framework is purely data-driven which avoids the tedious work in the scale of fluctuations(SOFs)estimating and data detrending in the conventional spatial interpolation methods. Six local mapping ML methods(extra trees(ETs), gradient boosting(GB), extreme gradient boosting(XGBoost), random forest(RF), general regression neural network(GRNN) and k-nearest neighbors(KNN)) are compared in the GDF-ML framework. The results show that the GDFs are better than the conventional XY coordinate fields based ML methods in both accuracy and spatial continuity. GDF-ML is flexible which can be applied to high-dimensional, multi-variable and incomplete datasets. Among these six methods, GDF with ET method(GDF-ET) clearly shows the best accuracy and best spatial continuity. The proposed GDF-ET method can provide a fast and accurate interpretation of the soil property profile. Sensitivity analysis shows that this method is applicable to very small training dataset size. The associated statistical uncertainty can also be quantified so that the reliability of the subsurface modeling results can be estimated objectively and explicitly. The uncertainty results clearly show that the prediction becomes more accurate when more sampled data are available.
文摘Personal credit risk assessment is an important part of the development of financial enterprises. Big data credit investigation is an inevitable trend of personal credit risk assessment, but some data are missing and the amount of data is small, so it is difficult to train. At the same time, for different financial platforms, we need to use different models to train according to the characteristics of the current samples, which is time-consuming. <span style="font-family:Verdana;">In view of</span><span style="font-family:Verdana;"> these two problems, this paper uses the idea of transfer learning to build a transferable personal credit risk model based on Instance-based Transfer Learning (Instance-based TL). The model balances the weight of the samples in the source domain, and migrates the existing large dataset samples to the target domain of small samples, and finds out the commonness between them. At the same time, we have done a lot of experiments on the selection of base learners, including traditional machine learning algorithms and ensemble learning algorithms, such as decision tree, logistic regression, </span><span style="font-family:Verdana;">xgboost</span> <span style="font-family:Verdana;">and</span><span style="font-family:Verdana;"> so on. The datasets are from P2P platform and bank, the results show that the AUC value of Instance-based TL is 24% higher than that of the traditional machine learning model, which fully proves that the model in this paper has good application value. The model’s evaluation uses AUC, prediction, recall, F1. These criteria prove that this model has good application value from many aspects. At present, we are trying to apply this model to more fields to improve the robustness and applicability of the model;on the other hand, we are trying to do more in-depth research on domain adaptation to enrich the model.</span>
文摘目的研究听觉词汇学习测验(auditory vocabulary learning test,AVLT)对轻度认知障碍(mild cognitive impairment,MCI)进展为痴呆的预测能力。方法对257例MCI患者进行纵向随访,然后根据临床结果将其分为痴呆进展组和非痴呆进展组。比较这些组的基线人口统计学信息和AVLT评分。构建受试者工作特征(receiver operating characteristic,ROC)曲线以评估AVLT评分对MCI转归的区分值。结果在6年后的随访中,有45例受试者进展为痴呆,归为痴呆进展组(MCI progression,MCIp),3例受试者恢复正常认知,209例受试者维持MCI,一同归为非痴呆进展组(MCI non-progression,MCInp)。在基线时,MCIp组的AVLT评分明显低于MCInp,差异有统计学意义(P<0.05)。ROC曲线分析显示,AVLT延迟回忆(delayed recall,AVLT-DR)在区分MCI患者进展为痴呆方面有最大的曲线下面积(largest area under the curve,AUC),是重要预测指标。结论AVLT,尤其是AVLT-DR评分较低能较好预测MCI进展为痴呆,但由于其特异度偏低,需要联合其他特异度高的量表综合使用来运用于临床工作。