Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, ther...Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, there exist two types of hazards models: the multiplicative hazards model and the additive hazards model. In the paper, we propose a more flexible additive-multiplicative hazards model for multiple type of recurrent gap times data, wherein some covariates are assumed to be additive while others are multiplicative. An estimating equation approach is presented to estimate the regression parameters. We establish asymptotic properties of the proposed estimators.展开更多
Software defined networking(SDN)has attracted significant attention from both academia and industry by its ability to reconfigure network devices with logically centralized applications.However,some critical security ...Software defined networking(SDN)has attracted significant attention from both academia and industry by its ability to reconfigure network devices with logically centralized applications.However,some critical security issues have also been introduced along with the benefits,which put an obstruction to the deployment of SDN.One root cause of these issues lies in the limited resources and capability of devices involved in the SDN architecture,especially the hardware switches lied in the data plane.In this paper,we analyze the vulnerability of SDN and present two kinds of SDN-targeted attacks:1)data-to-control plane saturation attack which exhausts resources of all SDN components,including control plane,data plane,and the in-between downlink channel and 2)control plane reflection attack which only attacks the data plane and gets conducted in a more efficient and hidden way.Finally,we propose the corresponding defense frameworks to mitigate such attacks.展开更多
The cause of substorm onset is not yet understood. Chen CX(2016) proposed an entropy switch model, in which substorm onset results from the development of interchange instability. In this study, we sought observationa...The cause of substorm onset is not yet understood. Chen CX(2016) proposed an entropy switch model, in which substorm onset results from the development of interchange instability. In this study, we sought observational evidence for this model by using Time History of Events and Macroscale Interactions during Substorms(THEMIS) data. We examined two events, one with and the other without a streamer before substorm onset. In contrast to the stable magnetosphere, where the total magnetic field strength is a decreasing function and entropy is an increasing function of the downtail distance, in both events the total magnetic field strength and entropy were reversed before substorm onset. After onset, the total magnetic field strength, entropy, and other plasma quantities fluctuated. In addition, a statistical study was performed. By confining the events with THEMIS satellites located in the downtail region between ~8 and ~12 Earth radii, and 3 hours before and after midnight, we found the occurrence rate of the total magnetic field strength reversal to be 69% and the occurrence rate of entropy reversal to be 77% of the total 205 events.展开更多
Phasor measurement units(PMUs) can provide real-time measurement data to construct the ubiquitous electric of the Internet of Things. However, due to complex factors on site, PMU data can be easily compromised by inte...Phasor measurement units(PMUs) can provide real-time measurement data to construct the ubiquitous electric of the Internet of Things. However, due to complex factors on site, PMU data can be easily compromised by interference or synchronization jitter. It will lead to various levels of PMU data quality issues, which can directly affect the PMU-based application and even threaten the safety of power systems. In order to improve the PMU data quality, a data-driven PMU bad data detection algorithm based on spectral clustering using single PMU data is proposed in this paper. The proposed algorithm does not require the system topology and parameters. Firstly, a data identification method based on a decision tree is proposed to distinguish event data and bad data by using the slope feature of each data. Then, a bad data detection method based on spectral clustering is developed. By analyzing the weighted relationships among all the data, this method can detect the bad data with a small deviation. Simulations and results of field recording data test illustrate that this data-driven method can achieve bad data identification and detection effectively. This technique can improve PMU data quality to guarantee its applications in the power systems.展开更多
Recurrent event data frequently occur in longitudinal studies, and it is often of interest to estimate the effects of covariates on the recurrent event rate. This paper considers a class of semiparametric transformati...Recurrent event data frequently occur in longitudinal studies, and it is often of interest to estimate the effects of covariates on the recurrent event rate. This paper considers a class of semiparametric transformation rate models for recurrent event data, which uses an additive AMen model as its covariate dependent baseline. The new models are flexible in that they allow for both additive and multiplicative covariate effects, and some covariate effects are allowed to be nonparametric and time-varying. An estimating procedure is proposed for parameter estimation, and the resulting estimators are shown to be consistent and asymptotically normal. Simulation studies and a real data analysis demonstrate that the proposed method performs well and is appropriate for practical use.展开更多
The Jiangmen Underground Neutrino Observatory(JUNO) detector is designed to determine the neutrino mass hierarchy and precisely measure oscillation parameters. The general purpose design also allows measurements of ...The Jiangmen Underground Neutrino Observatory(JUNO) detector is designed to determine the neutrino mass hierarchy and precisely measure oscillation parameters. The general purpose design also allows measurements of neutrinos from many terrestrial and non-terrestrial sources. The JUNO Event Data Model(EDM) plays a central role in the offline software system. It describes the event data entities through all processing stages for both simulated and collected data, and provides persistency via the input/output system. Also, the EDM is designed to enable flexible event handling such as event navigation, as well as the splitting of MC IBD signals and mixing of MC backgrounds. This paper describes the design, implementation and performance of the JUNO EDM.展开更多
Recurrent events data with a terminal event (e.g., death) often arise in clinical and ob- servational studies. Variable selection is an important issue in all regression analysis. In this paper, the authors first pr...Recurrent events data with a terminal event (e.g., death) often arise in clinical and ob- servational studies. Variable selection is an important issue in all regression analysis. In this paper, the authors first propose the estimation methods to select the significant variables, and then prove the asymptotic behavior of the proposed estimator. Furthermore, the authors discuss the computing algorithm to assess the proposed estimator via the linear function approximation and generalized cross validation method for determination of the tuning parameters. Finally, the finite sample estimation for the asymptotical covariance matrix is also proposed.展开更多
Longitudinal data often occur in follow-up studies, and in many situations, there may exist informative observation times and a dependent terminal event such as death that stops the follow-up. We propose a semiparamet...Longitudinal data often occur in follow-up studies, and in many situations, there may exist informative observation times and a dependent terminal event such as death that stops the follow-up. We propose a semiparametric mixed effect model with time-varying latent effects in the analysis of longitudinal data with informative observation times and a dependent terminal event. Estimating equation approaches are developed for parameter estimation, and asymptotic properties of the resulting estimators are established. The finite sample behavior of the proposed estimators is evaluated through simulation studies, and an application to a bladder cancer study is provided.展开更多
In this paper, we consider an inference method for recurrent event data in which the primary exposure covariate is assessed only in a validation set, while as an auxiliary covariate for the main exposure is available ...In this paper, we consider an inference method for recurrent event data in which the primary exposure covariate is assessed only in a validation set, while as an auxiliary covariate for the main exposure is available for the full cohort. Additive rate model is considered. The existing estimating equations in the absence of primary exposure are corrected by taking use of the validation data and auxiliary information, which yield consistent and asymptotically normal estimators of the regression parameters. The estimated baseline mean process is shown to converge weakly to a zero-mean Gaussian process. Extensive simulations are conducted to evaluate finite sample performance.展开更多
Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare.Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians t...Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare.Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians to make decisions with high confidence.However,such analysis is not straightforward due to the characteristics of medical records:high dimensionality,irregularity in time,and sparsity.To address this challenge,we introduce a method for similarity calculation of medical records.Our method employs event and sequence embeddings.While we use an autoencoder for the event embedding,we apply its variant with the self-attention mechanism for the sequence embedding.Moreover,in order to better handle the irregularity of data,we enhance the self-attention mechanism with consideration of different time intervals.We have developed a visual analytics system to support comparative studies of patient records.To make a comparison of sequences with different lengths easier,our system incorporates a sequence alignment method.Through its interactive interface,the user can quickly identify patients of interest and conveniently review both the temporal and multivariate aspects of the patient records.We demonstrate the effectiveness of our design and system with case studies using a real-world dataset from the neonatal intensive care unit of UC Davis.展开更多
基金The Science Foundation(JA12301)of Fujian Educational Committeethe Teaching Quality Project(ZL0902/TZ(SJ))of Higher Education in Fujian Provincial Education Department
文摘Recurrent event gap times data frequently arise in biomedical studies and often more than one type of event is of interest. To evaluate the effects of covariates on the marginal recurrent event hazards functions, there exist two types of hazards models: the multiplicative hazards model and the additive hazards model. In the paper, we propose a more flexible additive-multiplicative hazards model for multiple type of recurrent gap times data, wherein some covariates are assumed to be additive while others are multiplicative. An estimating equation approach is presented to estimate the regression parameters. We establish asymptotic properties of the proposed estimators.
基金supported in part by the National Key R&D Program of China under Grant No.2017YFB0801701the National Science Foundation of China under Grant No.61472213CERNET Innovation Project(NGII20160123)
文摘Software defined networking(SDN)has attracted significant attention from both academia and industry by its ability to reconfigure network devices with logically centralized applications.However,some critical security issues have also been introduced along with the benefits,which put an obstruction to the deployment of SDN.One root cause of these issues lies in the limited resources and capability of devices involved in the SDN architecture,especially the hardware switches lied in the data plane.In this paper,we analyze the vulnerability of SDN and present two kinds of SDN-targeted attacks:1)data-to-control plane saturation attack which exhausts resources of all SDN components,including control plane,data plane,and the in-between downlink channel and 2)control plane reflection attack which only attacks the data plane and gets conducted in a more efficient and hidden way.Finally,we propose the corresponding defense frameworks to mitigate such attacks.
基金supported by the National Natural Science Foundation of China(Grant No.NSFC41974204)。
文摘The cause of substorm onset is not yet understood. Chen CX(2016) proposed an entropy switch model, in which substorm onset results from the development of interchange instability. In this study, we sought observational evidence for this model by using Time History of Events and Macroscale Interactions during Substorms(THEMIS) data. We examined two events, one with and the other without a streamer before substorm onset. In contrast to the stable magnetosphere, where the total magnetic field strength is a decreasing function and entropy is an increasing function of the downtail distance, in both events the total magnetic field strength and entropy were reversed before substorm onset. After onset, the total magnetic field strength, entropy, and other plasma quantities fluctuated. In addition, a statistical study was performed. By confining the events with THEMIS satellites located in the downtail region between ~8 and ~12 Earth radii, and 3 hours before and after midnight, we found the occurrence rate of the total magnetic field strength reversal to be 69% and the occurrence rate of entropy reversal to be 77% of the total 205 events.
基金supported by the National Key R&D Program (No.2017YFB0902901)the National Natural Science Foundation of China (No.51627811,No.51725702,and No.51707064)。
文摘Phasor measurement units(PMUs) can provide real-time measurement data to construct the ubiquitous electric of the Internet of Things. However, due to complex factors on site, PMU data can be easily compromised by interference or synchronization jitter. It will lead to various levels of PMU data quality issues, which can directly affect the PMU-based application and even threaten the safety of power systems. In order to improve the PMU data quality, a data-driven PMU bad data detection algorithm based on spectral clustering using single PMU data is proposed in this paper. The proposed algorithm does not require the system topology and parameters. Firstly, a data identification method based on a decision tree is proposed to distinguish event data and bad data by using the slope feature of each data. Then, a bad data detection method based on spectral clustering is developed. By analyzing the weighted relationships among all the data, this method can detect the bad data with a small deviation. Simulations and results of field recording data test illustrate that this data-driven method can achieve bad data identification and detection effectively. This technique can improve PMU data quality to guarantee its applications in the power systems.
基金supported by National Natural Science Foundation of China (Grant Nos. 11301545, 11501578 and 11501579)
文摘Recurrent event data frequently occur in longitudinal studies, and it is often of interest to estimate the effects of covariates on the recurrent event rate. This paper considers a class of semiparametric transformation rate models for recurrent event data, which uses an additive AMen model as its covariate dependent baseline. The new models are flexible in that they allow for both additive and multiplicative covariate effects, and some covariate effects are allowed to be nonparametric and time-varying. An estimating procedure is proposed for parameter estimation, and the resulting estimators are shown to be consistent and asymptotically normal. Simulation studies and a real data analysis demonstrate that the proposed method performs well and is appropriate for practical use.
基金Supported by Joint Large-Scale Scientific Facility Funds of the NSFC and CAS(U1532258)the Program for New Century Excellent Talents in University(NCET-13-0342)+1 种基金the Shandong Natural Science Funds for Distinguished Young Scholar(JQ201402)the Strategic Priority Research Program of the Chinese Academy of Sciences(XDA10010900)
文摘The Jiangmen Underground Neutrino Observatory(JUNO) detector is designed to determine the neutrino mass hierarchy and precisely measure oscillation parameters. The general purpose design also allows measurements of neutrinos from many terrestrial and non-terrestrial sources. The JUNO Event Data Model(EDM) plays a central role in the offline software system. It describes the event data entities through all processing stages for both simulated and collected data, and provides persistency via the input/output system. Also, the EDM is designed to enable flexible event handling such as event navigation, as well as the splitting of MC IBD signals and mixing of MC backgrounds. This paper describes the design, implementation and performance of the JUNO EDM.
文摘Recurrent events data with a terminal event (e.g., death) often arise in clinical and ob- servational studies. Variable selection is an important issue in all regression analysis. In this paper, the authors first propose the estimation methods to select the significant variables, and then prove the asymptotic behavior of the proposed estimator. Furthermore, the authors discuss the computing algorithm to assess the proposed estimator via the linear function approximation and generalized cross validation method for determination of the tuning parameters. Finally, the finite sample estimation for the asymptotical covariance matrix is also proposed.
基金supported by National Natural Science Foundation of China (Grant Nos. 11231010, 11171330 and 11201315)Key Laboratory of Random Complex Structures and Data Science, Chinese Academy of Sciences (Grant No. 2008DP173182)Beijing Center for Mathematics and Information Interdisciplinary Sciences
文摘Longitudinal data often occur in follow-up studies, and in many situations, there may exist informative observation times and a dependent terminal event such as death that stops the follow-up. We propose a semiparametric mixed effect model with time-varying latent effects in the analysis of longitudinal data with informative observation times and a dependent terminal event. Estimating equation approaches are developed for parameter estimation, and asymptotic properties of the resulting estimators are established. The finite sample behavior of the proposed estimators is evaluated through simulation studies, and an application to a bladder cancer study is provided.
基金Supported by the National Natural Science Foundation of China(No.11571263,11371299)
文摘In this paper, we consider an inference method for recurrent event data in which the primary exposure covariate is assessed only in a validation set, while as an auxiliary covariate for the main exposure is available for the full cohort. Additive rate model is considered. The existing estimating equations in the absence of primary exposure are corrected by taking use of the validation data and auxiliary information, which yield consistent and asymptotically normal estimators of the regression parameters. The estimated baseline mean process is shown to converge weakly to a zero-mean Gaussian process. Extensive simulations are conducted to evaluate finite sample performance.
基金the U.S.National Science Foundation through grant IIS-1741536 and a 2019 Seed Fund Award from CITRIS and the Banatao Institute at the University of California,United States.
文摘Machine learning for data-driven diagnosis has been actively studied in medicine to provide better healthcare.Supporting analysis of a patient cohort similar to a patient under treatment is a key task for clinicians to make decisions with high confidence.However,such analysis is not straightforward due to the characteristics of medical records:high dimensionality,irregularity in time,and sparsity.To address this challenge,we introduce a method for similarity calculation of medical records.Our method employs event and sequence embeddings.While we use an autoencoder for the event embedding,we apply its variant with the self-attention mechanism for the sequence embedding.Moreover,in order to better handle the irregularity of data,we enhance the self-attention mechanism with consideration of different time intervals.We have developed a visual analytics system to support comparative studies of patient records.To make a comparison of sequences with different lengths easier,our system incorporates a sequence alignment method.Through its interactive interface,the user can quickly identify patients of interest and conveniently review both the temporal and multivariate aspects of the patient records.We demonstrate the effectiveness of our design and system with case studies using a real-world dataset from the neonatal intensive care unit of UC Davis.