N6-methyladenosine(m6A)is an important RNA methylation modification involved in regulating diverse biological processes across multiple species.Hence,the identification of m6A modification sites provides valuable insi...N6-methyladenosine(m6A)is an important RNA methylation modification involved in regulating diverse biological processes across multiple species.Hence,the identification of m6A modification sites provides valuable insight into the biological mechanisms of complex diseases at the post-transcriptional level.Although a variety of identification algorithms have been proposed recently,most of them capture the features of m6A modification sites by focusing on the sequential dependencies of nucleotides at different positions in RNA sequences,while ignoring the structural dependencies of nucleotides in their threedimensional structures.To overcome this issue,we propose a cross-species end-to-end deep learning model,namely CR-NSSD,which conduct a cross-domain representation learning process integrating nucleotide structural and sequential dependencies for RNA m6A site identification.Specifically,CR-NSSD first obtains the pre-coded representations of RNA sequences by incorporating the position information into single-nucleotide states with chaos game representation theory.It then constructs a crossdomain reconstruction encoder to learn the sequential and structural dependencies between nucleotides.By minimizing the reconstruction and binary cross-entropy losses,CR-NSSD is trained to complete the task of m6A site identification.Extensive experiments have demonstrated the promising performance of CR-NSSD by comparing it with several state-of-the-art m6A identification algorithms.Moreover,the results of cross-species prediction indicate that the integration of sequential and structural dependencies allows CR-NSSD to capture general features of m6A modification sites among different species,thus improving the accuracy of cross-species identification.展开更多
Visual fire detection technologies can detect fire and alarm warnings earlier than conventional fire detectors. This study proposes an effective visual fire detection method that combines the statistical fire color mo...Visual fire detection technologies can detect fire and alarm warnings earlier than conventional fire detectors. This study proposes an effective visual fire detection method that combines the statistical fire color model and sequential pattern mining technology to detect fire in an image. Furthermore, the proposed method also supports real-time fire detection by integrating adaptive background subtraction technologies. Experimental results show that the proposed method can effectively detect fire in test images and videos. The detection accuracy of the proposed hybrid method is better than that of Celik's method.展开更多
In order to reduce the computational and spatial complexity in rerunning algorithm of sequential patterns query, this paper proposes sequential patterns based and projection database based algorithm for fast interacti...In order to reduce the computational and spatial complexity in rerunning algorithm of sequential patterns query, this paper proposes sequential patterns based and projection database based algorithm for fast interactive sequential patterns mining algorithm (FISP), in which the number of frequent items of the projection databases constructed by the correct mining which based on the previously mined sequences has been reduced. Furthermore, the algorithm's iterative running times are reduced greatly by using global-threshold. The results of experiments testify that FISP outperforms PrefixSpan in interactive mining展开更多
This paper aims to propose the sequential pattern discovery method of Deoxyribonucleic Acid (DNA) sequence database in order to identify cancer disease. The DNA which is composed of amino acids of gene P53 is mutated....This paper aims to propose the sequential pattern discovery method of Deoxyribonucleic Acid (DNA) sequence database in order to identify cancer disease. The DNA which is composed of amino acids of gene P53 is mutated. It effects to change of P53 formation. Sequential pattern discovery is a process of extracting data to generate knowledge about the series of events that has the sequences in a certain frequency so that it creates a pattern. PrefixSpan is to propose method to find a pattern of DNA sequence database. As a result, there are various selected patterns of DNA sequence. The pattem which has high similarity is used as biomarker to identify the breast cancer disease. The performance measure of support value average is 0.8. It means that the frequent sequence pattern is high. Another measure is confidence. All of the confidence values are 1. Then, the last performance measure is lift ratio at average more than 1. It means that the composed sequence items in the pattern has high dependency and relatedness. Futhermore, the selected patterns are applied as biomarker with accuracy as 100%.展开更多
This paper presents a theoretical analysis of evolutionary process that involves organisms distribution and their interaction of spatially distributed population with diffusion in a Holling-III ratio-dependent predato...This paper presents a theoretical analysis of evolutionary process that involves organisms distribution and their interaction of spatially distributed population with diffusion in a Holling-III ratio-dependent predator-prey model, the sufficient conditions for diffusion-driven instability with Neumann boundary conditions are obtained. Furthermore, it presents novel numerical evidence of time evolution of patterns controlled by diffusion in the model, and finds that the model dynamics exhibits complex pattern replication, and the pattern formation depends on the choice of the initial conditions. The ideas in this paper may provide a better understanding of the pattern formation in ecosystems.展开更多
The pattern dependence in synergistic effects was studied in a 0.18 μm static random access memory(SRAM) circuit.Experiments were performed under two SEU test environments:3 Me V protons and heavy ions.Measured re...The pattern dependence in synergistic effects was studied in a 0.18 μm static random access memory(SRAM) circuit.Experiments were performed under two SEU test environments:3 Me V protons and heavy ions.Measured results show different trends.In heavy ion SEU test,the degradation in the peripheral circuitry also existed because the measured SEU cross section decreased regardless of the patterns written to the SRAM array.TCAD simulation was performed.TIDinduced degradation in n MOSFETs mainly induced the imprint effect in the SRAM cell,which is consistent with the measured results under the proton environment,but cannot explain the phenomena observed under heavy ion environment.A possible explanation could be the contribution from the radiation-induced GIDL in pMOSFETs.展开更多
Periodic patternmining has become a popular research subject in recent years;this approach involves the discoveryof frequently recurring patterns in a transaction sequence. However, previous algorithms for periodic pa...Periodic patternmining has become a popular research subject in recent years;this approach involves the discoveryof frequently recurring patterns in a transaction sequence. However, previous algorithms for periodic patternmining have ignored the utility (profit, value) of patterns. Additionally, these algorithms only identify periodicpatterns in a single sequence. However, identifying patterns of high utility that are common to a set of sequencesis more valuable. In several fields, identifying high-utility periodic frequent patterns in multiple sequences isimportant. In this study, an efficient algorithm called MHUPFPS was proposed to identify such patterns. To addressexisting problems, three new measures are defined: the utility, high support, and high-utility period sequenceratios. Further, a new upper bound, upSeqRa, and two new pruning properties were proposed. MHUPFPS usesa newly defined HUPFPS-list structure to significantly accelerate the reduction of the search space and improvethe overall performance of the algorithm. Furthermore, the proposed algorithmis evaluated using several datasets.The experimental results indicate that the algorithm is accurate and effective in filtering several non-high-utilityperiodic frequent patterns.展开更多
Conspecific negative density dependence(CNDD)is a potentially important mechanism in maintaining species diversity.While previous evidence showed habitat heterogeneity and species’dispersal modes affect the strength ...Conspecific negative density dependence(CNDD)is a potentially important mechanism in maintaining species diversity.While previous evidence showed habitat heterogeneity and species’dispersal modes affect the strength of CNDD at early life stages of trees(e.g.,seedlings),it remains unclear how they affect the strength of CNDD at later life stages.We examined the degree of spatial aggregation between saplings and trees for species dispersed by wind and gravity in four topographic habitats within a 25-ha temperate forest dynamic plot in the Qinling Mountains of central China.We used the replicated spatial point pattern(RSPP)analysis and bivariate paircorrelation function(PCF)to detect the spatial distribution of saplings around trees at two scales,15 and 50 m,respectively.Although the signal was not apparent across the whole study region(or 25-ha),it is distinct on isolated areas with specific characteristics,suggesting that these characteristics could be important factors in CNDD.Further,we found that the gravity-dispersed tree species experienced CNDD across habitats,while for wind-dispersed species CNDD was found in gully,terrace and low-ridge habitats.Our study suggests that neglecting the habitat heterogeneity and dispersal mode can distort the signal of CNDD and community assembly in temperate forests.展开更多
针对传统序列模式挖掘(SPM)不考虑模式重复性且忽略各项的效用(单价或利润)与模式长度对用户兴趣度影响的问题,提出一次性条件下top-k高平均效用序列模式挖掘(TOUP)算法。TOUP算法主要包括两个核心步骤:平均效用计算和候选模式生成。首...针对传统序列模式挖掘(SPM)不考虑模式重复性且忽略各项的效用(单价或利润)与模式长度对用户兴趣度影响的问题,提出一次性条件下top-k高平均效用序列模式挖掘(TOUP)算法。TOUP算法主要包括两个核心步骤:平均效用计算和候选模式生成。首先,提出基于各项出现位置与项重复关系数组的CSP(Calculation Support of Pattern)算法计算模式支持度,从而实现模式平均效用的快速计算;其次,采用项集扩展和序列扩展生成候选模式,并提出了最大平均效用上界,基于该上界实现对候选模式的有效剪枝。在5个真实数据集和1个合成数据集上的实验结果表明,相较于TOUP-dfs和HAOP-ms算法,TOUP算法的候选模式数分别降低了38.5%~99.8%和0.9%~77.6%;运行时间分别降低了33.6%~97.1%和57.9%~97.2%。TOUP的算法性能更优,能更高效地挖掘用户感兴趣的模式。展开更多
当前的位置预测方法大多没有考虑到用户行为信息,由于用户的访问时间、行为模式等能够在很大程度上反映所处位置,因此在对位置潜在向量进行预训练时有必要使用该信息。进行位置预测时,采样粒度较细的序列长度较长,难以捕获长距离依赖。...当前的位置预测方法大多没有考虑到用户行为信息,由于用户的访问时间、行为模式等能够在很大程度上反映所处位置,因此在对位置潜在向量进行预训练时有必要使用该信息。进行位置预测时,采样粒度较细的序列长度较长,难以捕获长距离依赖。针对这2个问题,提出了基于用户行为和上下文语义的分层时空长短期记忆网络(Hierarchical Spatiotemporal Long Short-Term Memory Based on User Behavior and Contextual Semantics,CHST-LSTM)模型。该模型通过Transformer编码层处理轨迹数据,将用户相关行为信息考虑在内,融合位置的上下文语义信息,通过预训练得到位置的嵌入表征。根据用户的行为状态分割轨迹阶段,采用编码器-解码器方式对ST-LSTM进行分段分层扩展,利用BiLSTM对全局信息建模,同时处理轨迹的长短期变化,解决长序列的长距离依赖问题。对外卖员用户群体的真实移动轨迹数据进行分析和实验,通过聚类发现其特有的工作模式,在预训练时加入工作模式信息与到访时间信息,得到位置的特征向量并用于预测模型。结果表明CHST-LSTM模型在预测用户下一位置时精度更高。展开更多
基金supported in part by the National Natural Science Foundation of China(62373348)the Natural Science Foundation of Xinjiang Uygur Autonomous Region(2021D01D05)+1 种基金the Tianshan Talent Training Program(2023TSYCLJ0021)the Pioneer Hundred Talents Program of Chinese Academy of Sciences.
文摘N6-methyladenosine(m6A)is an important RNA methylation modification involved in regulating diverse biological processes across multiple species.Hence,the identification of m6A modification sites provides valuable insight into the biological mechanisms of complex diseases at the post-transcriptional level.Although a variety of identification algorithms have been proposed recently,most of them capture the features of m6A modification sites by focusing on the sequential dependencies of nucleotides at different positions in RNA sequences,while ignoring the structural dependencies of nucleotides in their threedimensional structures.To overcome this issue,we propose a cross-species end-to-end deep learning model,namely CR-NSSD,which conduct a cross-domain representation learning process integrating nucleotide structural and sequential dependencies for RNA m6A site identification.Specifically,CR-NSSD first obtains the pre-coded representations of RNA sequences by incorporating the position information into single-nucleotide states with chaos game representation theory.It then constructs a crossdomain reconstruction encoder to learn the sequential and structural dependencies between nucleotides.By minimizing the reconstruction and binary cross-entropy losses,CR-NSSD is trained to complete the task of m6A site identification.Extensive experiments have demonstrated the promising performance of CR-NSSD by comparing it with several state-of-the-art m6A identification algorithms.Moreover,the results of cross-species prediction indicate that the integration of sequential and structural dependencies allows CR-NSSD to capture general features of m6A modification sites among different species,thus improving the accuracy of cross-species identification.
基金supported by National Science Council under Grant No. NSC98-2221-E-218-046
文摘Visual fire detection technologies can detect fire and alarm warnings earlier than conventional fire detectors. This study proposes an effective visual fire detection method that combines the statistical fire color model and sequential pattern mining technology to detect fire in an image. Furthermore, the proposed method also supports real-time fire detection by integrating adaptive background subtraction technologies. Experimental results show that the proposed method can effectively detect fire in test images and videos. The detection accuracy of the proposed hybrid method is better than that of Celik's method.
基金Supported by the National Natural Science Funda-tion of China (70371015) andthe Natural Science Foundation of Jian-gsu Province (BK2004058)
文摘In order to reduce the computational and spatial complexity in rerunning algorithm of sequential patterns query, this paper proposes sequential patterns based and projection database based algorithm for fast interactive sequential patterns mining algorithm (FISP), in which the number of frequent items of the projection databases constructed by the correct mining which based on the previously mined sequences has been reduced. Furthermore, the algorithm's iterative running times are reduced greatly by using global-threshold. The results of experiments testify that FISP outperforms PrefixSpan in interactive mining
文摘This paper aims to propose the sequential pattern discovery method of Deoxyribonucleic Acid (DNA) sequence database in order to identify cancer disease. The DNA which is composed of amino acids of gene P53 is mutated. It effects to change of P53 formation. Sequential pattern discovery is a process of extracting data to generate knowledge about the series of events that has the sequences in a certain frequency so that it creates a pattern. PrefixSpan is to propose method to find a pattern of DNA sequence database. As a result, there are various selected patterns of DNA sequence. The pattem which has high similarity is used as biomarker to identify the breast cancer disease. The performance measure of support value average is 0.8. It means that the frequent sequence pattern is high. Another measure is confidence. All of the confidence values are 1. Then, the last performance measure is lift ratio at average more than 1. It means that the composed sequence items in the pattern has high dependency and relatedness. Futhermore, the selected patterns are applied as biomarker with accuracy as 100%.
基金supported by the Natural Science Foundation of Zhejiang Province of China (Grant No.Y7080041)
文摘This paper presents a theoretical analysis of evolutionary process that involves organisms distribution and their interaction of spatially distributed population with diffusion in a Holling-III ratio-dependent predator-prey model, the sufficient conditions for diffusion-driven instability with Neumann boundary conditions are obtained. Furthermore, it presents novel numerical evidence of time evolution of patterns controlled by diffusion in the model, and finds that the model dynamics exhibits complex pattern replication, and the pattern formation depends on the choice of the initial conditions. The ideas in this paper may provide a better understanding of the pattern formation in ecosystems.
基金Project supported by the National Natural Science Foundation of China(Grant No.U1532261)
文摘The pattern dependence in synergistic effects was studied in a 0.18 μm static random access memory(SRAM) circuit.Experiments were performed under two SEU test environments:3 Me V protons and heavy ions.Measured results show different trends.In heavy ion SEU test,the degradation in the peripheral circuitry also existed because the measured SEU cross section decreased regardless of the patterns written to the SRAM array.TCAD simulation was performed.TIDinduced degradation in n MOSFETs mainly induced the imprint effect in the SRAM cell,which is consistent with the measured results under the proton environment,but cannot explain the phenomena observed under heavy ion environment.A possible explanation could be the contribution from the radiation-induced GIDL in pMOSFETs.
文摘Periodic patternmining has become a popular research subject in recent years;this approach involves the discoveryof frequently recurring patterns in a transaction sequence. However, previous algorithms for periodic patternmining have ignored the utility (profit, value) of patterns. Additionally, these algorithms only identify periodicpatterns in a single sequence. However, identifying patterns of high utility that are common to a set of sequencesis more valuable. In several fields, identifying high-utility periodic frequent patterns in multiple sequences isimportant. In this study, an efficient algorithm called MHUPFPS was proposed to identify such patterns. To addressexisting problems, three new measures are defined: the utility, high support, and high-utility period sequenceratios. Further, a new upper bound, upSeqRa, and two new pruning properties were proposed. MHUPFPS usesa newly defined HUPFPS-list structure to significantly accelerate the reduction of the search space and improvethe overall performance of the algorithm. Furthermore, the proposed algorithmis evaluated using several datasets.The experimental results indicate that the algorithm is accurate and effective in filtering several non-high-utilityperiodic frequent patterns.
基金Shihong Jia was financially supported by the National Natural Science Foundation of China(Grant No.32001120)the Fundamental Research Funds for the Central Universities(Grant No.31020200QD026)+1 种基金Qiulong Yin was supported by the National Natural Science Foundation of China(Grant No.32001171)Ying Luo was supported by the Innovation Capability Support Program of Shaanxi(Grant No.2022KRM090).
文摘Conspecific negative density dependence(CNDD)is a potentially important mechanism in maintaining species diversity.While previous evidence showed habitat heterogeneity and species’dispersal modes affect the strength of CNDD at early life stages of trees(e.g.,seedlings),it remains unclear how they affect the strength of CNDD at later life stages.We examined the degree of spatial aggregation between saplings and trees for species dispersed by wind and gravity in four topographic habitats within a 25-ha temperate forest dynamic plot in the Qinling Mountains of central China.We used the replicated spatial point pattern(RSPP)analysis and bivariate paircorrelation function(PCF)to detect the spatial distribution of saplings around trees at two scales,15 and 50 m,respectively.Although the signal was not apparent across the whole study region(or 25-ha),it is distinct on isolated areas with specific characteristics,suggesting that these characteristics could be important factors in CNDD.Further,we found that the gravity-dispersed tree species experienced CNDD across habitats,while for wind-dispersed species CNDD was found in gully,terrace and low-ridge habitats.Our study suggests that neglecting the habitat heterogeneity and dispersal mode can distort the signal of CNDD and community assembly in temperate forests.
文摘针对传统序列模式挖掘(SPM)不考虑模式重复性且忽略各项的效用(单价或利润)与模式长度对用户兴趣度影响的问题,提出一次性条件下top-k高平均效用序列模式挖掘(TOUP)算法。TOUP算法主要包括两个核心步骤:平均效用计算和候选模式生成。首先,提出基于各项出现位置与项重复关系数组的CSP(Calculation Support of Pattern)算法计算模式支持度,从而实现模式平均效用的快速计算;其次,采用项集扩展和序列扩展生成候选模式,并提出了最大平均效用上界,基于该上界实现对候选模式的有效剪枝。在5个真实数据集和1个合成数据集上的实验结果表明,相较于TOUP-dfs和HAOP-ms算法,TOUP算法的候选模式数分别降低了38.5%~99.8%和0.9%~77.6%;运行时间分别降低了33.6%~97.1%和57.9%~97.2%。TOUP的算法性能更优,能更高效地挖掘用户感兴趣的模式。
文摘当前的位置预测方法大多没有考虑到用户行为信息,由于用户的访问时间、行为模式等能够在很大程度上反映所处位置,因此在对位置潜在向量进行预训练时有必要使用该信息。进行位置预测时,采样粒度较细的序列长度较长,难以捕获长距离依赖。针对这2个问题,提出了基于用户行为和上下文语义的分层时空长短期记忆网络(Hierarchical Spatiotemporal Long Short-Term Memory Based on User Behavior and Contextual Semantics,CHST-LSTM)模型。该模型通过Transformer编码层处理轨迹数据,将用户相关行为信息考虑在内,融合位置的上下文语义信息,通过预训练得到位置的嵌入表征。根据用户的行为状态分割轨迹阶段,采用编码器-解码器方式对ST-LSTM进行分段分层扩展,利用BiLSTM对全局信息建模,同时处理轨迹的长短期变化,解决长序列的长距离依赖问题。对外卖员用户群体的真实移动轨迹数据进行分析和实验,通过聚类发现其特有的工作模式,在预训练时加入工作模式信息与到访时间信息,得到位置的特征向量并用于预测模型。结果表明CHST-LSTM模型在预测用户下一位置时精度更高。