With the rapid advancement of social economies,intelligent transportation systems are gaining increasing atten-tion.Central to these systems is the detection of abnormal vehicle behavior,which remains a critical chall...With the rapid advancement of social economies,intelligent transportation systems are gaining increasing atten-tion.Central to these systems is the detection of abnormal vehicle behavior,which remains a critical challenge due to the complexity of urban roadways and the variability of external conditions.Current research on detecting abnormal traffic behaviors is still nascent,with significant room for improvement in recognition accuracy.To address this,this research has developed a new model for recognizing abnormal traffic behaviors.This model employs the R3D network as its core architecture,incorporating a dense block to facilitate feature reuse.This approach not only enhances performance with fewer parameters and reduced computational demands but also allows for the acquisition of new features while simplifying the overall network structure.Additionally,this research integrates a self-attentive method that dynamically adjusts to the prevailing traffic conditions,optimizing the relevance of features for the task at hand.For temporal analysis,a Bi-LSTM layer is utilized to extract and learn from time-based data nuances.This research conducted a series of comparative experiments using the UCF-Crime dataset,achieving a notable accuracy of 89.30%on our test set.Our results demonstrate that our model not only operates with fewer parameters but also achieves superior recognition accuracy compared to previous models.展开更多
The main task of magnetic resonance imaging (MRI) automatic brain tumor segmentation is to automaticallysegment the brain tumor edema, peritumoral edema, endoscopic core, enhancing tumor core and nonenhancingtumor cor...The main task of magnetic resonance imaging (MRI) automatic brain tumor segmentation is to automaticallysegment the brain tumor edema, peritumoral edema, endoscopic core, enhancing tumor core and nonenhancingtumor core from 3D MR images. Because the location, size, shape and intensity of brain tumors vary greatly, itis very difficult to segment these brain tumor regions automatically. In this paper, by combining the advantagesof DenseNet and ResNet, we proposed a new 3D U-Net with dense encoder blocks and residual decoder blocks.We used dense blocks in the encoder part and residual blocks in the decoder part. The number of output featuremaps increases with the network layers in contracting path of encoder, which is consistent with the characteristicsof dense blocks. Using dense blocks can decrease the number of network parameters, deepen network layers,strengthen feature propagation, alleviate vanishing-gradient and enlarge receptive fields. The residual blockswere used in the decoder to replace the convolution neural block of original U-Net, which made the networkperformance better. Our proposed approach was trained and validated on the BraTS2019 training and validationdata set. We obtained dice scores of 0.901, 0.815 and 0.766 for whole tumor, tumor core and enhancing tumorcore respectively on the BraTS2019 validation data set. Our method has the better performance than the original3D U-Net. The results of our experiment demonstrate that compared with some state-of-the-art methods, ourapproach is a competitive automatic brain tumor segmentation method.展开更多
Masking-based and spectrum mapping-based methods are the two main algorithms of speech enhancement with deep neural network(DNN).But the mapping-based methods only utilizes the phase of noisy speech,which limits the u...Masking-based and spectrum mapping-based methods are the two main algorithms of speech enhancement with deep neural network(DNN).But the mapping-based methods only utilizes the phase of noisy speech,which limits the upper bound of speech enhancement performance.Maskingbased methods need to accurately estimate the masking which is still the key problem.Combining the advantages of above two types of methods,this paper proposes the speech enhancement algorithm MM-RDN(maskingmapping residual dense network)based on masking-mapping(MM)and residual dense network(RDN).Using the logarithmic power spectrogram(LPS)of consecutive frames,MM estimates the ideal ratio masking(IRM)matrix of consecutive frames.RDN can make full use of feature maps of all layers.Meanwhile,using the global residual learning to combine the shallow features and deep features,RDN obtains the global dense features from the LPS,thereby improves estimated accuracy of the IRM matrix.Simulations show that the proposed method achieves attractive speech enhancement performance in various acoustic environments.Specifically,in the untrained acoustic test with limited priors,e.g.,unmatched signal-to-noise ratio(SNR)and unmatched noise category,MM-RDN can still outperform the existing convolutional recurrent network(CRN)method in themeasures of perceptual evaluation of speech quality(PESQ)and other evaluation indexes.It indicates that the proposed algorithm is more generalized in untrained conditions.展开更多
Fall behavior is closely related to high mortality in the elderly,so fall detection becomes an important and urgent research area.However,the existing fall detection methods are difficult to be applied in daily life d...Fall behavior is closely related to high mortality in the elderly,so fall detection becomes an important and urgent research area.However,the existing fall detection methods are difficult to be applied in daily life due to a large amount of calculation and poor detection accuracy.To solve the above problems,this paper proposes a dense spatial-temporal graph convolutional network based on lightweight OpenPose.Lightweight OpenPose uses MobileNet as a feature extraction network,and the prediction layer uses bottleneck-asymmetric structure,thus reducing the amount of the network.The bottleneck-asymmetrical structure compresses the number of input channels of feature maps by 1×1 convolution and replaces the 7×7 convolution structure with the asymmetric structure of 1×7 convolution,7×1 convolution,and 7×7 convolution in parallel.The spatial-temporal graph convolutional network divides the multi-layer convolution into dense blocks,and the convolutional layers in each dense block are connected,thus improving the feature transitivity,enhancing the network’s ability to extract features,thus improving the detection accuracy.Two representative datasets,Multiple Cameras Fall dataset(MCF),and Nanyang Technological University Red Green Blue+Depth Action Recognition dataset(NTU RGB+D),are selected for our experiments,among which NTU RGB+D has two evaluation benchmarks.The results show that the proposed model is superior to the current fall detection models.The accuracy of this network on the MCF dataset is 96.3%,and the accuracies on the two evaluation benchmarks of the NTU RGB+D dataset are 85.6%and 93.5%,respectively.展开更多
Generative adversarial networks(GANs)are paid more attention to dealing with the end-to-end speech enhancement in recent years.Various GANbased enhancement methods are presented to improve the quality of reconstructed...Generative adversarial networks(GANs)are paid more attention to dealing with the end-to-end speech enhancement in recent years.Various GANbased enhancement methods are presented to improve the quality of reconstructed speech.However,the performance of these GAN-based methods is worse than those of masking-based methods.To tackle this problem,we propose speech enhancement method with a residual dense generative adversarial network(RDGAN)contributing to map the log-power spectrum(LPS)of degraded speech to the clean one.In detail,a residual dense block(RDB)architecture is designed to better estimate the LPS of clean speech,which can extract rich local features of LPS through densely connected convolution layers.Meanwhile,sequential RDB connections are incorporated on various scales of LPS.It significantly increases the feature learning flexibility and robustness in the time-frequency domain.Simulations show that the proposed method achieves attractive speech enhancement performance in various acoustic environments.Specifically,in the untrained acoustic test with limited priors,e.g.,unmatched signal-to-noise ratio(SNR)and unmatched noise category,RDGAN can still outperform the existing GAN-based methods and masking-based method in the measures of PESQ and other evaluation indexes.It indicates that our method is more generalized in untrained conditions.展开更多
Carbon-free Al_(2)O_(3)-MgO dense bricks were produced by the pressing method,using tabular alumina,white fused alumina,alumina micro-powder as main raw materials,and inorganic powder as the binder.The comprehensive p...Carbon-free Al_(2)O_(3)-MgO dense bricks were produced by the pressing method,using tabular alumina,white fused alumina,alumina micro-powder as main raw materials,and inorganic powder as the binder.The comprehensive properties and performance in steel ladle side wall were made a comparison between Al_(2)O_(3)-MgO dense bricks and precast blocks.The results show that Al_(2)O_(3)-MgO dense bricks exhibit high dense structure and strength,as well as superior thermal shock resistance and better penetration and corrosion resistance to slag than precast blocks.While replacing precast blocks with dense bricks in 250 t steel ladle side wall in some domestic steel mills,the thickness of the metamorphic layer from slag penetration and the corrosion rate decrease evidently.The damage of dense bricks during service is mainly caused by the corrosion from molten steel and slag,and the structure spalling of the metamorphic layer also plays an important role.展开更多
基金supported by the National Natural Science Foundation of China(61971007&61571013).
文摘With the rapid advancement of social economies,intelligent transportation systems are gaining increasing atten-tion.Central to these systems is the detection of abnormal vehicle behavior,which remains a critical challenge due to the complexity of urban roadways and the variability of external conditions.Current research on detecting abnormal traffic behaviors is still nascent,with significant room for improvement in recognition accuracy.To address this,this research has developed a new model for recognizing abnormal traffic behaviors.This model employs the R3D network as its core architecture,incorporating a dense block to facilitate feature reuse.This approach not only enhances performance with fewer parameters and reduced computational demands but also allows for the acquisition of new features while simplifying the overall network structure.Additionally,this research integrates a self-attentive method that dynamically adjusts to the prevailing traffic conditions,optimizing the relevance of features for the task at hand.For temporal analysis,a Bi-LSTM layer is utilized to extract and learn from time-based data nuances.This research conducted a series of comparative experiments using the UCF-Crime dataset,achieving a notable accuracy of 89.30%on our test set.Our results demonstrate that our model not only operates with fewer parameters but also achieves superior recognition accuracy compared to previous models.
基金This was supported partially by Sichuan Science and Technology Program under Grants 2019YJ0356,21ZDYF2484,21GJHZ0061Scientific Research Foundation of Education Department of Sichuan Province under Grant 18ZB0117.
文摘The main task of magnetic resonance imaging (MRI) automatic brain tumor segmentation is to automaticallysegment the brain tumor edema, peritumoral edema, endoscopic core, enhancing tumor core and nonenhancingtumor core from 3D MR images. Because the location, size, shape and intensity of brain tumors vary greatly, itis very difficult to segment these brain tumor regions automatically. In this paper, by combining the advantagesof DenseNet and ResNet, we proposed a new 3D U-Net with dense encoder blocks and residual decoder blocks.We used dense blocks in the encoder part and residual blocks in the decoder part. The number of output featuremaps increases with the network layers in contracting path of encoder, which is consistent with the characteristicsof dense blocks. Using dense blocks can decrease the number of network parameters, deepen network layers,strengthen feature propagation, alleviate vanishing-gradient and enlarge receptive fields. The residual blockswere used in the decoder to replace the convolution neural block of original U-Net, which made the networkperformance better. Our proposed approach was trained and validated on the BraTS2019 training and validationdata set. We obtained dice scores of 0.901, 0.815 and 0.766 for whole tumor, tumor core and enhancing tumorcore respectively on the BraTS2019 validation data set. Our method has the better performance than the original3D U-Net. The results of our experiment demonstrate that compared with some state-of-the-art methods, ourapproach is a competitive automatic brain tumor segmentation method.
基金supported by the National Key Research and Development Program of China under Grant 2020YFC2004003 and Grant 2020YFC2004002the National Nature Science Foundation of China(NSFC)under Grant No.61571106.
文摘Masking-based and spectrum mapping-based methods are the two main algorithms of speech enhancement with deep neural network(DNN).But the mapping-based methods only utilizes the phase of noisy speech,which limits the upper bound of speech enhancement performance.Maskingbased methods need to accurately estimate the masking which is still the key problem.Combining the advantages of above two types of methods,this paper proposes the speech enhancement algorithm MM-RDN(maskingmapping residual dense network)based on masking-mapping(MM)and residual dense network(RDN).Using the logarithmic power spectrogram(LPS)of consecutive frames,MM estimates the ideal ratio masking(IRM)matrix of consecutive frames.RDN can make full use of feature maps of all layers.Meanwhile,using the global residual learning to combine the shallow features and deep features,RDN obtains the global dense features from the LPS,thereby improves estimated accuracy of the IRM matrix.Simulations show that the proposed method achieves attractive speech enhancement performance in various acoustic environments.Specifically,in the untrained acoustic test with limited priors,e.g.,unmatched signal-to-noise ratio(SNR)and unmatched noise category,MM-RDN can still outperform the existing convolutional recurrent network(CRN)method in themeasures of perceptual evaluation of speech quality(PESQ)and other evaluation indexes.It indicates that the proposed algorithm is more generalized in untrained conditions.
基金supported,in part,by the National Nature Science Foundation of China under Grant Numbers 62272236,62376128in part,by the Natural Science Foundation of Jiangsu Province under Grant Numbers BK20201136,BK20191401.
文摘Fall behavior is closely related to high mortality in the elderly,so fall detection becomes an important and urgent research area.However,the existing fall detection methods are difficult to be applied in daily life due to a large amount of calculation and poor detection accuracy.To solve the above problems,this paper proposes a dense spatial-temporal graph convolutional network based on lightweight OpenPose.Lightweight OpenPose uses MobileNet as a feature extraction network,and the prediction layer uses bottleneck-asymmetric structure,thus reducing the amount of the network.The bottleneck-asymmetrical structure compresses the number of input channels of feature maps by 1×1 convolution and replaces the 7×7 convolution structure with the asymmetric structure of 1×7 convolution,7×1 convolution,and 7×7 convolution in parallel.The spatial-temporal graph convolutional network divides the multi-layer convolution into dense blocks,and the convolutional layers in each dense block are connected,thus improving the feature transitivity,enhancing the network’s ability to extract features,thus improving the detection accuracy.Two representative datasets,Multiple Cameras Fall dataset(MCF),and Nanyang Technological University Red Green Blue+Depth Action Recognition dataset(NTU RGB+D),are selected for our experiments,among which NTU RGB+D has two evaluation benchmarks.The results show that the proposed model is superior to the current fall detection models.The accuracy of this network on the MCF dataset is 96.3%,and the accuracies on the two evaluation benchmarks of the NTU RGB+D dataset are 85.6%and 93.5%,respectively.
基金This work is supported by the National Key Research and Development Program of China under Grant 2020YFC2004003 and Grant 2020YFC2004002the National Nature Science Foundation of China(NSFC)under Grant No.61571106。
文摘Generative adversarial networks(GANs)are paid more attention to dealing with the end-to-end speech enhancement in recent years.Various GANbased enhancement methods are presented to improve the quality of reconstructed speech.However,the performance of these GAN-based methods is worse than those of masking-based methods.To tackle this problem,we propose speech enhancement method with a residual dense generative adversarial network(RDGAN)contributing to map the log-power spectrum(LPS)of degraded speech to the clean one.In detail,a residual dense block(RDB)architecture is designed to better estimate the LPS of clean speech,which can extract rich local features of LPS through densely connected convolution layers.Meanwhile,sequential RDB connections are incorporated on various scales of LPS.It significantly increases the feature learning flexibility and robustness in the time-frequency domain.Simulations show that the proposed method achieves attractive speech enhancement performance in various acoustic environments.Specifically,in the untrained acoustic test with limited priors,e.g.,unmatched signal-to-noise ratio(SNR)and unmatched noise category,RDGAN can still outperform the existing GAN-based methods and masking-based method in the measures of PESQ and other evaluation indexes.It indicates that our method is more generalized in untrained conditions.
文摘Carbon-free Al_(2)O_(3)-MgO dense bricks were produced by the pressing method,using tabular alumina,white fused alumina,alumina micro-powder as main raw materials,and inorganic powder as the binder.The comprehensive properties and performance in steel ladle side wall were made a comparison between Al_(2)O_(3)-MgO dense bricks and precast blocks.The results show that Al_(2)O_(3)-MgO dense bricks exhibit high dense structure and strength,as well as superior thermal shock resistance and better penetration and corrosion resistance to slag than precast blocks.While replacing precast blocks with dense bricks in 250 t steel ladle side wall in some domestic steel mills,the thickness of the metamorphic layer from slag penetration and the corrosion rate decrease evidently.The damage of dense bricks during service is mainly caused by the corrosion from molten steel and slag,and the structure spalling of the metamorphic layer also plays an important role.
文摘针对超密集网络(ultra dense network,UDN)中基站密集部署导致的严重层间干扰问题,构建了考虑频谱复用和共信道干扰条件下最大化系统总吞吐量问题模型,提出了一种基于块坐标下降(block coordinate descent,BCD)法的联合频谱资源优化(joint resource optimization based on BCD,JROBB)方法。该方法将原问题分解为分簇、子信道分配和功率分配三个子问题,通过BCD法迭代优化子信道分配和功率分配,逼近原问题的最优解。仿真分析表明,在复杂度提升有限的情况下,系统总吞吐量比现有典型算法平均至少提升22%,可以有效提升频谱利用率。
文摘针对工业场景下图像模糊、分辨率低、边缘细节不明显等问题,提出一种基于生成对抗网络的低质图像增强算法。首先,设计退化网络获得与真实场景更为接近的低质图像,以此与现实高清图像获得特征映射关系;其次,在使用密集残差块(residual in residual dense block,RRDB)的基础上添加卷积注意力模块,增强RRDB网络的特征表达能力,以有效地捕获关键特征信息;最后,设计边缘增强网络模块结合改进的RRDB作为生成器,图像细节信息的捕捉与还原能力得到显著提升,并与判别器对抗生成更高质量的图像。实验结果表明,相较于现有常用的图像增强算法,所提算法能有效提升工业场景图像清晰度、保留图像细节并减少失真。定量指标峰值信噪比平均提升10.45%,结构相似性平均提升15.92%,运行速度快,能满足工业生产需求。