In this Letter,we report a Golay3 sparse-aperture telescope newly built in the Key Laboratory of Optical Engineering,Chinese Academy of Sciences and present the experimental results of enhanced resolution.The telescop...In this Letter,we report a Golay3 sparse-aperture telescope newly built in the Key Laboratory of Optical Engineering,Chinese Academy of Sciences and present the experimental results of enhanced resolution.The telescope consisting of 3 collector telescopes of 127 mm diameter can achieve a theoretical resolution corresponding to a monolithic aperture of 245 mm diameter.It is shown by the experimental results that the resolution is improved to 3.33μrad with respect to the diffraction limit of 6.07μrad for a single aperture using the Rayleigh criteria at 632 nm.The compact optical configuration and cophasing approach are also described.展开更多
The cocktail party problem,i.e.,tracing and recognizing the speech of a specific speaker when multiple speakers talk simultaneously,is one of the critical problems yet to be solved to enable the wide application of au...The cocktail party problem,i.e.,tracing and recognizing the speech of a specific speaker when multiple speakers talk simultaneously,is one of the critical problems yet to be solved to enable the wide application of automatic speech recognition(ASR) systems.In this overview paper,we review the techniques proposed in the last two decades in attacking this problem.We focus our discussions on the speech separation problem given its central role in the cocktail party environment,and describe the conventional single-channel techniques such as computational auditory scene analysis(CASA),non-negative matrix factorization(NMF) and generative models,the conventional multi-channel techniques such as beamforming and multi-channel blind source separation,and the newly developed deep learning-based techniques,such as deep clustering(DPCL),the deep attractor network(DANet),and permutation invariant training(PIT).We also present techniques developed to improve ASR accuracy and speaker identification in the cocktail party environment.We argue effectively exploiting information in the microphone array,the acoustic training set,and the language itself using a more powerful model.Better optimization ob jective and techniques will be the approach to solving the cocktail party problem.展开更多
Because texture images cannot be directly processed by the gray level information of individual pixel,we propose a new texture descriptor which reflects the intensity distribution of the patch centered at each pixel.T...Because texture images cannot be directly processed by the gray level information of individual pixel,we propose a new texture descriptor which reflects the intensity distribution of the patch centered at each pixel.Then the general multiphase image segmentation model of Potts model is extended for texture segmentation by adding the region information of the texture descriptor.A fast numerical scheme based on the split Bregman method is designed to speed up the computational process.The algorithm is efficient,and both the texture descriptor and the characteristic functions can be implemented easily.Experiments using synthetic texture images,real natural scene images and synthetic aperture radar images are presented to give qualitative comparisons between our method and other state-of-the-art techniques.The results show that our method can accurately segment object regions and is competitive compared with other methods especially in segmenting natural images.展开更多
基金supported by the National Natural Science Foundation of China(No.61205144)the Research Project of National University of Defense Technology(No.JC13-07-01)the Key Laboratory of High Power Laser and Physics,CAS
文摘In this Letter,we report a Golay3 sparse-aperture telescope newly built in the Key Laboratory of Optical Engineering,Chinese Academy of Sciences and present the experimental results of enhanced resolution.The telescope consisting of 3 collector telescopes of 127 mm diameter can achieve a theoretical resolution corresponding to a monolithic aperture of 245 mm diameter.It is shown by the experimental results that the resolution is improved to 3.33μrad with respect to the diffraction limit of 6.07μrad for a single aperture using the Rayleigh criteria at 632 nm.The compact optical configuration and cophasing approach are also described.
基金supported by the Tencent and Shanghai Jiao Tong University Joint Project
文摘The cocktail party problem,i.e.,tracing and recognizing the speech of a specific speaker when multiple speakers talk simultaneously,is one of the critical problems yet to be solved to enable the wide application of automatic speech recognition(ASR) systems.In this overview paper,we review the techniques proposed in the last two decades in attacking this problem.We focus our discussions on the speech separation problem given its central role in the cocktail party environment,and describe the conventional single-channel techniques such as computational auditory scene analysis(CASA),non-negative matrix factorization(NMF) and generative models,the conventional multi-channel techniques such as beamforming and multi-channel blind source separation,and the newly developed deep learning-based techniques,such as deep clustering(DPCL),the deep attractor network(DANet),and permutation invariant training(PIT).We also present techniques developed to improve ASR accuracy and speaker identification in the cocktail party environment.We argue effectively exploiting information in the microphone array,the acoustic training set,and the language itself using a more powerful model.Better optimization ob jective and techniques will be the approach to solving the cocktail party problem.
基金supported by the National Natural Science Foundation of China(No.61170106)
文摘Because texture images cannot be directly processed by the gray level information of individual pixel,we propose a new texture descriptor which reflects the intensity distribution of the patch centered at each pixel.Then the general multiphase image segmentation model of Potts model is extended for texture segmentation by adding the region information of the texture descriptor.A fast numerical scheme based on the split Bregman method is designed to speed up the computational process.The algorithm is efficient,and both the texture descriptor and the characteristic functions can be implemented easily.Experiments using synthetic texture images,real natural scene images and synthetic aperture radar images are presented to give qualitative comparisons between our method and other state-of-the-art techniques.The results show that our method can accurately segment object regions and is competitive compared with other methods especially in segmenting natural images.