Acoustic scene classification(ASC)is a method of recognizing and classifying environments that employ acoustic signals.Various ASC approaches based on deep learning have been developed,with convolutional neural networ...Acoustic scene classification(ASC)is a method of recognizing and classifying environments that employ acoustic signals.Various ASC approaches based on deep learning have been developed,with convolutional neural networks(CNNs)proving to be the most reliable and commonly utilized in ASC systems due to their suitability for constructing lightweight models.When using ASC systems in the real world,model complexity and device robustness are essential considerations.In this paper,we propose a two-pass mobile network for low-complexity classification of the acoustic scene,named TP-MobNet.With inverse residuals and linear bottlenecks,TPMobNet is based on MobileNetV2,and following mobile blocks,coordinate attention and two-pass fusion approaches are utilized.The log-range dependencies and precise position information in feature maps can be trained via coordinate attention.By capturing more diverse feature resolutions at the network’s end sides,two-pass fusions can also train generalization.Also,the model size is reduced by applying weight quantization to the trained model.By adding weight quantization to the trained model,the model size is also lowered.The TAU Urban Acoustic Scenes 2020 Mobile development set was used for all of the experiments.It has been confirmed that the proposed model,with a model size of 219.6 kB,achieves an accuracy of 73.94%.展开更多
The results of experiments on the synthesis of the off-axis quantized kinoforms of binary objects with the use of the weighting iterative Fourier transform (WIFT) algorithm are presented. Kinoforms are registered wi...The results of experiments on the synthesis of the off-axis quantized kinoforms of binary objects with the use of the weighting iterative Fourier transform (WIFT) algorithm are presented. Kinoforms are registered with a liquid-crystal spatial light modulator (SLM). A simple procedure to introduce the carrier frequency into the structure of an axial kinoform is proposed. An image reconstructed by an off-axis kinoform is free from the noises with the zero and close frequencies caused by the imperfection of both the phase mode of operation of the SLM and the effects of quantization of the registered phase. Data on the diffraction efficiency are also given.展开更多
基金This work was supported by Institute of Information&communications Technology Planning&Evaluation(IITP)grant funded by the Korea government(MSIT)[No.2021-0-0268,Artificial Intelligence Innovation Hub(Artificial Intelligence Institute,Seoul National University)]。
文摘Acoustic scene classification(ASC)is a method of recognizing and classifying environments that employ acoustic signals.Various ASC approaches based on deep learning have been developed,with convolutional neural networks(CNNs)proving to be the most reliable and commonly utilized in ASC systems due to their suitability for constructing lightweight models.When using ASC systems in the real world,model complexity and device robustness are essential considerations.In this paper,we propose a two-pass mobile network for low-complexity classification of the acoustic scene,named TP-MobNet.With inverse residuals and linear bottlenecks,TPMobNet is based on MobileNetV2,and following mobile blocks,coordinate attention and two-pass fusion approaches are utilized.The log-range dependencies and precise position information in feature maps can be trained via coordinate attention.By capturing more diverse feature resolutions at the network’s end sides,two-pass fusions can also train generalization.Also,the model size is reduced by applying weight quantization to the trained model.By adding weight quantization to the trained model,the model size is also lowered.The TAU Urban Acoustic Scenes 2020 Mobile development set was used for all of the experiments.It has been confirmed that the proposed model,with a model size of 219.6 kB,achieves an accuracy of 73.94%.
文摘The results of experiments on the synthesis of the off-axis quantized kinoforms of binary objects with the use of the weighting iterative Fourier transform (WIFT) algorithm are presented. Kinoforms are registered with a liquid-crystal spatial light modulator (SLM). A simple procedure to introduce the carrier frequency into the structure of an axial kinoform is proposed. An image reconstructed by an off-axis kinoform is free from the noises with the zero and close frequencies caused by the imperfection of both the phase mode of operation of the SLM and the effects of quantization of the registered phase. Data on the diffraction efficiency are also given.