Computed Tomography(CT)is a commonly used technology in Printed Circuit Boards(PCB)non-destructive testing,and element segmentation of CT images is a key subsequent step.With the development of deep learning,researche...Computed Tomography(CT)is a commonly used technology in Printed Circuit Boards(PCB)non-destructive testing,and element segmentation of CT images is a key subsequent step.With the development of deep learning,researchers began to exploit the“pre-training and fine-tuning”training process for multi-element segmentation,reducing the time spent on manual annotation.However,the existing element segmentation model only focuses on the overall accuracy at the pixel level,ignoring whether the element connectivity relationship can be correctly identified.To this end,this paper proposes a PCB CT image element segmentation model optimizing the semantic perception of connectivity relationship(OSPC-seg).The overall training process adopts a“pre-training and fine-tuning”training process.A loss function that optimizes the semantic perception of circuit connectivity relationship(OSPC Loss)is designed from the aspect of alleviating the class imbalance problem and improving the correct connectivity rate.Also,the correct connectivity rate index(CCR)is proposed to evaluate the model’s connectivity relationship recognition capabilities.Experiments show that mIoU and CCR of OSPC-seg on our datasets are 90.1%and 97.0%,improved by 1.5%and 1.6%respectively compared with the baseline model.From visualization results,it can be seen that the segmentation performance of connection positions is significantly improved,which also demonstrates the effectiveness of OSPC-seg.展开更多
Image enhancement technology plays a very important role to improve image quality in image processing. By enhancing some information and restraining other information selectively, it can improve image visual effect. T...Image enhancement technology plays a very important role to improve image quality in image processing. By enhancing some information and restraining other information selectively, it can improve image visual effect. The objective of this work is to implement the image enhancement to gray scale images using different techniques. After the fundamental methods of image enhancement processing are demonstrated, image enhancement algorithms based on space and frequency domains are systematically investigated and compared. The advantage and defect of the above-mentioned algorithms are analyzed. The algorithms of wavelet based image enhancement are also deduced and generalized. Wavelet transform modulus maxima(WTMM) is a method for detecting the fractal dimension of a signal, it is well used for image enhancement. The image techniques are compared by using the mean(μ),standard deviation(?), mean square error(MSE) and PSNR(peak signal to noise ratio). A group of experimental results demonstrate that the image enhancement algorithm based on wavelet transform is effective for image de-noising and enhancement. Wavelet transform modulus maxima method is one of the best methods for image enhancement.展开更多
We present a masked vision-language transformer(MVLT)for fashion-specific multi-modal representation.Technically,we simply utilize the vision transformer architecture for replacing the bidirectional encoder representa...We present a masked vision-language transformer(MVLT)for fashion-specific multi-modal representation.Technically,we simply utilize the vision transformer architecture for replacing the bidirectional encoder representations from Transformers(BERT)in the pre-training model,making MVLT the first end-to-end framework for the fashion domain.Besides,we designed masked image reconstruction(MIR)for a fine-grained understanding of fashion.MVLT is an extensible and convenient architecture that admits raw multimodal inputs without extra pre-processing models(e.g.,ResNet),implicitly modeling the vision-language alignments.More importantly,MVLT can easily generalize to various matching and generative tasks.Experimental results show obvious improvements in retrieval(rank@5:17%)and recognition(accuracy:3%)tasks over the Fashion-Gen 2018 winner,Kaleido-BERT.The code is available at https://github.com/GewelsJI/MVLT.展开更多
A novel single color camera trichromatic mask 3D-PIV technique suitable for measurement of complex flow fields in confined spaces is presented in this paper.By using a trichromatic mask to modulate the imaging optical...A novel single color camera trichromatic mask 3D-PIV technique suitable for measurement of complex flow fields in confined spaces is presented in this paper.By using a trichromatic mask to modulate the imaging optical path of a color camera,the RGB(Red,Green,and Blue)channels of the photosensitive chip were used to record full-frame full-resolution images of tracer particles from three viewing angles.The MLOS-SMART particle reconstruction algorithm was used to obtain three-dimensional particle distribution matrix from particle trichromatic mask images.The impact of parameters such as the inter-hole spacing and hole diameter of the trichromatic mask on the quality of particle reconstruction was analyzed.Through numerical simulation experiments on artificially synthesized three-dimensional flow fields of Gaussian vortex rings,the practicality of this technique in measuring three-dimensional transient velocity fields and the accuracy of velocity measurements were examined.The accuracy and feasibility of the technique are illustrated based on experimental measurements of a zero-net-mass-flux jet.展开更多
文摘Computed Tomography(CT)is a commonly used technology in Printed Circuit Boards(PCB)non-destructive testing,and element segmentation of CT images is a key subsequent step.With the development of deep learning,researchers began to exploit the“pre-training and fine-tuning”training process for multi-element segmentation,reducing the time spent on manual annotation.However,the existing element segmentation model only focuses on the overall accuracy at the pixel level,ignoring whether the element connectivity relationship can be correctly identified.To this end,this paper proposes a PCB CT image element segmentation model optimizing the semantic perception of connectivity relationship(OSPC-seg).The overall training process adopts a“pre-training and fine-tuning”training process.A loss function that optimizes the semantic perception of circuit connectivity relationship(OSPC Loss)is designed from the aspect of alleviating the class imbalance problem and improving the correct connectivity rate.Also,the correct connectivity rate index(CCR)is proposed to evaluate the model’s connectivity relationship recognition capabilities.Experiments show that mIoU and CCR of OSPC-seg on our datasets are 90.1%and 97.0%,improved by 1.5%and 1.6%respectively compared with the baseline model.From visualization results,it can be seen that the segmentation performance of connection positions is significantly improved,which also demonstrates the effectiveness of OSPC-seg.
基金Projects(61376076,61274026,61377024)supported by the National Natural Science Foundation of ChinaProjects(12C0108,13C321)supported by the Scientific Research Fund of Hunan Provincial Education Department,ChinaProjects(2013FJ2011,2014FJ2017,2013FJ4232)supported by the Science and Technology Plan Foundation of Hunan Province,China
文摘Image enhancement technology plays a very important role to improve image quality in image processing. By enhancing some information and restraining other information selectively, it can improve image visual effect. The objective of this work is to implement the image enhancement to gray scale images using different techniques. After the fundamental methods of image enhancement processing are demonstrated, image enhancement algorithms based on space and frequency domains are systematically investigated and compared. The advantage and defect of the above-mentioned algorithms are analyzed. The algorithms of wavelet based image enhancement are also deduced and generalized. Wavelet transform modulus maxima(WTMM) is a method for detecting the fractal dimension of a signal, it is well used for image enhancement. The image techniques are compared by using the mean(μ),standard deviation(?), mean square error(MSE) and PSNR(peak signal to noise ratio). A group of experimental results demonstrate that the image enhancement algorithm based on wavelet transform is effective for image de-noising and enhancement. Wavelet transform modulus maxima method is one of the best methods for image enhancement.
文摘We present a masked vision-language transformer(MVLT)for fashion-specific multi-modal representation.Technically,we simply utilize the vision transformer architecture for replacing the bidirectional encoder representations from Transformers(BERT)in the pre-training model,making MVLT the first end-to-end framework for the fashion domain.Besides,we designed masked image reconstruction(MIR)for a fine-grained understanding of fashion.MVLT is an extensible and convenient architecture that admits raw multimodal inputs without extra pre-processing models(e.g.,ResNet),implicitly modeling the vision-language alignments.More importantly,MVLT can easily generalize to various matching and generative tasks.Experimental results show obvious improvements in retrieval(rank@5:17%)and recognition(accuracy:3%)tasks over the Fashion-Gen 2018 winner,Kaleido-BERT.The code is available at https://github.com/GewelsJI/MVLT.
基金co-supported by the National Natural Science Foundation of China(Nos.12102284,12172242,12332017)the Shanxi Province Science Foundation for Youths,China(No.20210302124262)the Chunhui Project Foundation of the Education Department of China(No.202200257)。
文摘A novel single color camera trichromatic mask 3D-PIV technique suitable for measurement of complex flow fields in confined spaces is presented in this paper.By using a trichromatic mask to modulate the imaging optical path of a color camera,the RGB(Red,Green,and Blue)channels of the photosensitive chip were used to record full-frame full-resolution images of tracer particles from three viewing angles.The MLOS-SMART particle reconstruction algorithm was used to obtain three-dimensional particle distribution matrix from particle trichromatic mask images.The impact of parameters such as the inter-hole spacing and hole diameter of the trichromatic mask on the quality of particle reconstruction was analyzed.Through numerical simulation experiments on artificially synthesized three-dimensional flow fields of Gaussian vortex rings,the practicality of this technique in measuring three-dimensional transient velocity fields and the accuracy of velocity measurements were examined.The accuracy and feasibility of the technique are illustrated based on experimental measurements of a zero-net-mass-flux jet.