针对卡方自动交互诊断(CHAID)决策树易过拟合的问题,提出CHAID随机森林方法(CHAID Random Forest,CHAID-RF)。该方法采用随机采样、随机选择特征以及集成的策略,将CHAID决策树作为基分类器,形成CHAID-RF。为了验证CHAID-RF的有效性,选取...针对卡方自动交互诊断(CHAID)决策树易过拟合的问题,提出CHAID随机森林方法(CHAID Random Forest,CHAID-RF)。该方法采用随机采样、随机选择特征以及集成的策略,将CHAID决策树作为基分类器,形成CHAID-RF。为了验证CHAID-RF的有效性,选取CART、CHAID、SVM、RF作为对比算法,以准确率、加权查准率、加权查全率、加权F值作为分类模型评价指标,以均方根误差作为回归模型评价指标,采用10个分类数据集和7个回归数据集进行验证。实验结果表明CHAID-RF可行有效。展开更多
The Lunar Environment heliospheric X-ray Imager(LEXI)and Solar wind Magnetosphere Ionosphere Link Explorer(SMILE)missions will image the Earth’s dayside magneto pause and cusps in soft X-rays after their respective l...The Lunar Environment heliospheric X-ray Imager(LEXI)and Solar wind Magnetosphere Ionosphere Link Explorer(SMILE)missions will image the Earth’s dayside magneto pause and cusps in soft X-rays after their respective launches in the near future,to specify glo bal magnetic reconnection modes for varying solar wind conditions.To suppo rt the success of these scientific missions,it is critical to develop techniques that extract the magnetopause locations from the observed soft X-ray images.In this research,we introduce a new geometric equation that calculates the subsolar magnetopause position(RS)from a satellite position,the look direction of the instrument,and the angle at which the X-ray emission is maximized.Two assumptions are used in this method:(1)The look direction where soft X-ray emissions are maximized lies tangent to the magnetopause,and(2)the magnetopause surface near the subsolar point is almost spherical and thus RSis nea rly equal to the radius of the magneto pause curvature.We create synthetic soft X-ray images by using the Open Geospace General Circulation Model(OpenGGCM)global magnetohydrodynamic model,the galactic background,the instrument point spread function,and Poisson noise.We then apply the fast Fourier transform and Gaussian low-pass filte rs to the synthetic images to re move noise and obtain accurate look angles for the soft X-ray pea ks.From the filte red images,we calculate RS and its accuracy for different LEXI locations,look directions,and solar wind densities by using the OpenGGCM subsolar magnetopause location as ground truth.Our method estimates RS with an accuracy of<0.3 RE when the solar wind density exceeds>10 cm-3.The accuracy improves for greater solar wind densities and during southward interplanetary magnetic fields.The method ca ptures the magnetopause motion during southwa rd interplaneta ry magnetic field turnings.Consequently,the technique will enable quantitative analysis of the magnetopause motion and help reveal the dayside reconnection modes for dynamic solar wind conditions.This technique will suppo rt the LEXI and SMILE missions in achieving their scientific o bjectives.展开更多
文摘针对卡方自动交互诊断(CHAID)决策树易过拟合的问题,提出CHAID随机森林方法(CHAID Random Forest,CHAID-RF)。该方法采用随机采样、随机选择特征以及集成的策略,将CHAID决策树作为基分类器,形成CHAID-RF。为了验证CHAID-RF的有效性,选取CART、CHAID、SVM、RF作为对比算法,以准确率、加权查准率、加权查全率、加权F值作为分类模型评价指标,以均方根误差作为回归模型评价指标,采用10个分类数据集和7个回归数据集进行验证。实验结果表明CHAID-RF可行有效。
基金supported by NASA(Grant Nos.80NSSC19K0844,80NSSC20K1670,80MSFC20C0019,and 80GSFC21M0002)support from NASA Goddard Space Flight Center internal funding programs(HIF,Internal Scientist Funding Model,and Internal Research and Development)。
文摘The Lunar Environment heliospheric X-ray Imager(LEXI)and Solar wind Magnetosphere Ionosphere Link Explorer(SMILE)missions will image the Earth’s dayside magneto pause and cusps in soft X-rays after their respective launches in the near future,to specify glo bal magnetic reconnection modes for varying solar wind conditions.To suppo rt the success of these scientific missions,it is critical to develop techniques that extract the magnetopause locations from the observed soft X-ray images.In this research,we introduce a new geometric equation that calculates the subsolar magnetopause position(RS)from a satellite position,the look direction of the instrument,and the angle at which the X-ray emission is maximized.Two assumptions are used in this method:(1)The look direction where soft X-ray emissions are maximized lies tangent to the magnetopause,and(2)the magnetopause surface near the subsolar point is almost spherical and thus RSis nea rly equal to the radius of the magneto pause curvature.We create synthetic soft X-ray images by using the Open Geospace General Circulation Model(OpenGGCM)global magnetohydrodynamic model,the galactic background,the instrument point spread function,and Poisson noise.We then apply the fast Fourier transform and Gaussian low-pass filte rs to the synthetic images to re move noise and obtain accurate look angles for the soft X-ray pea ks.From the filte red images,we calculate RS and its accuracy for different LEXI locations,look directions,and solar wind densities by using the OpenGGCM subsolar magnetopause location as ground truth.Our method estimates RS with an accuracy of<0.3 RE when the solar wind density exceeds>10 cm-3.The accuracy improves for greater solar wind densities and during southward interplanetary magnetic fields.The method ca ptures the magnetopause motion during southwa rd interplaneta ry magnetic field turnings.Consequently,the technique will enable quantitative analysis of the magnetopause motion and help reveal the dayside reconnection modes for dynamic solar wind conditions.This technique will suppo rt the LEXI and SMILE missions in achieving their scientific o bjectives.