Fine-grained image classification is a challenging research topic because of the high degree of similarity among categories and the high degree of dissimilarity for a specific category caused by different poses and scal...Fine-grained image classification is a challenging research topic because of the high degree of similarity among categories and the high degree of dissimilarity for a specific category caused by different poses and scales.A cul-tural heritage image is one of thefine-grained images because each image has the same similarity in most cases.Using the classification technique,distinguishing cultural heritage architecture may be difficult.This study proposes a cultural heri-tage content retrieval method using adaptive deep learning forfine-grained image retrieval.The key contribution of this research was the creation of a retrieval mod-el that could handle incremental streams of new categories while maintaining its past performance in old categories and not losing the old categorization of a cul-tural heritage image.The goal of the proposed method is to perform a retrieval task for classes.Incremental learning for new classes was conducted to reduce the re-training process.In this step,the original class is not necessary for re-train-ing which we call an adaptive deep learning technique.Cultural heritage in the case of Thai archaeological site architecture was retrieved through machine learn-ing and image processing.We analyze the experimental results of incremental learning forfine-grained images with images of Thai archaeological site architec-ture from world heritage provinces in Thailand,which have a similar architecture.Using afine-grained image retrieval technique for this group of cultural heritage images in a database can solve the problem of a high degree of similarity among categories and a high degree of dissimilarity for a specific category.The proposed method for retrieving the correct image from a database can deliver an average accuracy of 85 percent.Adaptive deep learning forfine-grained image retrieval was used to retrieve cultural heritage content,and it outperformed state-of-the-art methods infine-grained image retrieval.展开更多
The remote sensing ships’fine-grained classification technology makes it possible to identify certain ship types in remote sensing images,and it has broad application prospects in civil and military fields.However,th...The remote sensing ships’fine-grained classification technology makes it possible to identify certain ship types in remote sensing images,and it has broad application prospects in civil and military fields.However,the current model does not examine the properties of ship targets in remote sensing images with mixed multi-granularity features and a complicated backdrop.There is still an opportunity for future enhancement of the classification impact.To solve the challenges brought by the above characteristics,this paper proposes a Metaformer and Residual fusion network based on Visual Attention Network(VAN-MR)for fine-grained classification tasks.For the complex background of remote sensing images,the VAN-MR model adopts the parallel structure of large kernel attention and spatial attention to enhance the model’s feature extraction ability of interest targets and improve the classification performance of remote sensing ship targets.For the problem of multi-grained feature mixing in remote sensing images,the VAN-MR model uses a Metaformer structure and a parallel network of residual modules to extract ship features.The parallel network has different depths,considering both high-level and lowlevel semantic information.The model achieves better classification performance in remote sensing ship images with multi-granularity mixing.Finally,the model achieves 88.73%and 94.56%accuracy on the public fine-grained ship collection-23(FGSC-23)and FGSCR-42 datasets,respectively,while the parameter size is only 53.47 M,the floating point operations is 9.9 G.The experimental results show that the classification effect of VAN-MR is superior to that of traditional CNNs model and visual model with Transformer structure under the same parameter quantity.展开更多
Image captioning involves two different major modalities(image and sentence)that convert a given image into a language that adheres to visual semantics.Almost all methods first extract image features to reduce the dif...Image captioning involves two different major modalities(image and sentence)that convert a given image into a language that adheres to visual semantics.Almost all methods first extract image features to reduce the difficulty of visual semantic embedding and then use the caption model to generate fluent sentences.The Convolutional Neural Network(CNN)is often used to extract image features in image captioning,and the use of object detection networks to extract region features has achieved great success.However,the region features retrieved by this method are object-level and do not pay attention to fine-grained details because of the detection model’s limitation.We offer an approach to address this issue that more properly generates captions by fusing fine-grained features and region features.First,we extract fine-grained features using a panoramic segmentation algorithm.Second,we suggest two fusion methods and contrast their fusion outcomes.An X-linear Attention Network(X-LAN)serves as the foundation for both fusion methods.According to experimental findings on the COCO dataset,the two-branch fusion approach is superior.It is important to note that on the COCO Karpathy test split,CIDEr is increased up to 134.3%in comparison to the baseline,highlighting the potency and viability of our method.展开更多
Fine-grained image search is one of the most challenging tasks in computer vision that aims to retrieve similar images at the fine-grained level for a given query image.The key objective is to learn discriminative fin...Fine-grained image search is one of the most challenging tasks in computer vision that aims to retrieve similar images at the fine-grained level for a given query image.The key objective is to learn discriminative fine-grained features by training deep models such that similar images are clustered,and dissimilar images are separated in the low embedding space.Previous works primarily focused on defining local structure loss functions like triplet loss,pairwise loss,etc.However,training via these approaches takes a long training time,and they have poor accuracy.Additionally,representations learned through it tend to tighten up in the embedded space and lose generalizability to unseen classes.This paper proposes a noise-assisted representation learning method for fine-grained image retrieval to mitigate these issues.In the proposed work,class manifold learning is performed in which positive pairs are created with noise insertion operation instead of tightening class clusters.And other instances are treated as negatives within the same cluster.Then a loss function is defined to penalize when the distance between instances of the same class becomes too small relative to the noise pair in that class in embedded space.The proposed approach is validated on CARS-196 and CUB-200 datasets and achieved better retrieval results(85.38%recall@1 for CARS-196%and 70.13%recall@1 for CUB-200)compared to other existing methods.展开更多
The fine-grained ship image recognition task aims to identify various classes of ships.However,small inter-class,large intra-class differences between ships,and lacking of training samples are the reasons that make th...The fine-grained ship image recognition task aims to identify various classes of ships.However,small inter-class,large intra-class differences between ships,and lacking of training samples are the reasons that make the task difficult.Therefore,to enhance the accuracy of the fine-grained ship image recognition,we design a fine-grained ship image recognition network based on bilinear convolutional neural network(BCNN)with Inception and additive margin Softmax(AM-Softmax).This network improves the BCNN in two aspects.Firstly,by introducing Inception branches to the BCNN network,it is helpful to enhance the ability of extracting comprehensive features from ships.Secondly,by adding margin values to the decision boundary,the AM-Softmax function can better extend the inter-class differences and reduce the intra-class differences.In addition,as there are few publicly available datasets for fine-grained ship image recognition,we construct a Ship-43 dataset containing 47,300 ship images belonging to 43 categories.Experimental results on the constructed Ship-43 dataset demonstrate that our method can effectively improve the accuracy of ship image recognition,which is 4.08%higher than the BCNN model.Moreover,comparison results on the other three public fine-grained datasets(Cub,Cars,and Aircraft)further validate the effectiveness of the proposed method.展开更多
For the task of content retrieval,analysis and generation of film and television scene images in the field of intelligent editing,fine-grained emotion recognition and prediction of images is of great significance.In t...For the task of content retrieval,analysis and generation of film and television scene images in the field of intelligent editing,fine-grained emotion recognition and prediction of images is of great significance.In this paper,the fusion of traditional perceptual features,art features and multi-channel deep learning features are used to reflect the emotion expression of different levels of the image.In addition,the integrated learning model with stacking architecture based on linear regression coefficient and sentiment correlations,which is called the LS-stacking model,is proposed according to the factor association between multi-dimensional emotions.The experimental results prove that the mixed feature and LS-stacking model can predict well on the 16 emotion categories of the self-built image dataset.This study improves the fine-grained recognition ability of image emotion by computers,which helps to increase the intelligence and automation degree of visual retrieval and post-production system.展开更多
Core,thin section,conventional and image logs are used to provide insights into distribution of fractures in fine grained sedimentary rocks of Permian Lucaogou Formation in Jimusar Sag.Bedding parallel fractures are c...Core,thin section,conventional and image logs are used to provide insights into distribution of fractures in fine grained sedimentary rocks of Permian Lucaogou Formation in Jimusar Sag.Bedding parallel fractures are common in fine grained sedimentary rocks which are characterized by layered structures.Core and thin section analysis reveal that fractures in Lucaogou Formation include tectonic inclined fracture,bedding parallel fracture,and abnormal high pressure fracture.Bedding parallel fractures are abundant,but only minor amounts of them remain open,and most of them are partly to fully sealed by carbonate minerals(calcite)and bitumen.Bedding parallel fractures result in a rapid decrease in resistivity,and they are recognized on image logs to extend along bedding planes and have discontinuous surfaces due to partly-fully filled resistive carbonate minerals as well as late stage dissolution.A comprehensive interpretation of distribution of bedding parallel fractures is performed with green line,red line,yellow line and blue line representing bedding planes,induced fractures,resistive fractures,and open(bedding and inclined)fractures,respectively.The strike of bedding parallel fractures is coinciding with bedding planes.Bedding parallel fractures are closely associated with the amounts of bedding planes,and high density of bedding planes favor the formation of bedding parallel fractures.Alternating dark and bright layers have the most abundant bedding parallel fractures on the image logs,and the bedding parallel fractures are always associated with low resistivity zones.The results above may help optimize sweet spots in fine grained sedimentary rocks,and improve future fracturing design and optimize well spacing.展开更多
Fine-grained sedimentary rocks have become a research focus as important reservoirs and source rocks for tight and shale oil and gas.Laminae development determines the accumulation and production of tight and shale oi...Fine-grained sedimentary rocks have become a research focus as important reservoirs and source rocks for tight and shale oil and gas.Laminae development determines the accumulation and production of tight and shale oil and gas in fine-grained rocks.However,due to the resolution limit of conventional logs,it is challenging to recognize the features of centimeter-scale laminae.To close this gap,complementary studies,including core observation,thin section,X-ray diffraction(XRD),conventional log analysis,and slabs of image logs,were conducted to unravel the centimeter-scale laminae.The laminae recognition models were built using well logs.The fine-grained rocks can be divided into laminated rocks(lamina thickness of<0.01 m),layered rocks(0.01-0.1 m),and massive rocks(no layer or layer spacing of>0.1 m)according to the laminae scale from core observations.According to the mineral superposition assemblages from thin-section observations,the laminated rocks can be further divided into binary,ternary,and multiple structures.The typical mineral components,slabs,and T2spectrum distributions of various lamina types are unraveled.The core can identify the centimeter-millimeter-scale laminae,and the thin section can identify the millimeter-micrometer-scale laminae.Furthermore,they can detect mineral types and their superposition sequence.Conventional logs can identify the meter-scale layers,whereas image logs and related slabs can identify the laminae variations at millimeter-centimeter scales.Therefore,the slab of image logs combined with thin sections can identify laminae assemblage characteristics,including the thickness and vertical assemblage.The identification and classification of lamina structure of various scales on a single well can be predicted using conventional logs,image logs,and slabs combined with thin sections.The layered rocks have better reservoir quality and oil-bearing potential than the massive and laminated rocks.The laminated rocks’binary lamina is better than the ternary and multiple layers due to the high content of felsic minerals.The abovementioned results build the prediction model for multiscale laminae structure using well logs,helping sweet spots prediction in the Permian Lucaogou Formation in the Jimusar Sag and fine-grained sedimentary rocks worldwide.展开更多
To protect personal privacy and confidential preservation,access control is used to authorize legal users for safe browsing the authorized contents on photos.The access control generates an authorization rule accordin...To protect personal privacy and confidential preservation,access control is used to authorize legal users for safe browsing the authorized contents on photos.The access control generates an authorization rule according to each permission assignment.However,the general access control is inappropriate to apply in some social services(e.g.,photos posted on Flickr and Instagram,personal image management in mobile phone) because of the increasing popularity of digital images being stored and managed.With low maintenance loads,this paper integrates the data hiding technique to propose an access control mechanism for privacy preservation.The proposed scheme changes the partial regions of a given image as random pads (called selective image encryption) and only allows the authorized people to remedy the random pads back to meaningful ones which are with similar visual qualities of original ones.展开更多
The task of food image recognition,a nuanced subset of fine-grained image recognition,grapples with substantial intra-class variation and minimal inter-class differences.These challenges are compounded by the irregula...The task of food image recognition,a nuanced subset of fine-grained image recognition,grapples with substantial intra-class variation and minimal inter-class differences.These challenges are compounded by the irregular and multi-scale nature of food images.Addressing these complexities,our study introduces an advanced model that leverages multiple attention mechanisms and multi-stage local fusion,grounded in the ConvNeXt architecture.Our model employs hybrid attention(HA)mechanisms to pinpoint critical discriminative regions within images,substantially mitigating the influence of background noise.Furthermore,it introduces a multi-stage local fusion(MSLF)module,fostering long-distance dependencies between feature maps at varying stages.This approach facilitates the assimilation of complementary features across scales,significantly bolstering the model’s capacity for feature extraction.Furthermore,we constructed a dataset named Roushi60,which consists of 60 different categories of common meat dishes.Empirical evaluation of the ETH Food-101,ChineseFoodNet,and Roushi60 datasets reveals that our model achieves recognition accuracies of 91.12%,82.86%,and 92.50%,respectively.These figures not only mark an improvement of 1.04%,3.42%,and 1.36%over the foundational ConvNeXt network but also surpass the performance of most contemporary food image recognition methods.Such advancements underscore the efficacy of our proposed model in navigating the intricate landscape of food image recognition,setting a new benchmark for the field.展开更多
Hierarchical multi-granularity image classification is a challenging task that aims to tag each given image with multiple granularity labels simultaneously.Existing methods tend to overlook that different image region...Hierarchical multi-granularity image classification is a challenging task that aims to tag each given image with multiple granularity labels simultaneously.Existing methods tend to overlook that different image regions contribute differently to label prediction at different granularities,and also insufficiently consider relationships between the hierarchical multi-granularity labels.We introduce a sequence-to-sequence mechanism to overcome these two problems and propose a multi-granularity sequence generation(MGSG)approach for the hierarchical multi-granularity image classification task.Specifically,we introduce a transformer architecture to encode the image into visual representation sequences.Next,we traverse the taxonomic tree and organize the multi-granularity labels into sequences,and vectorize them and add positional information.The proposed multi-granularity sequence generation method builds a decoder that takes visual representation sequences and semantic label embedding as inputs,and outputs the predicted multi-granularity label sequence.The decoder models dependencies and correlations between multi-granularity labels through a masked multi-head self-attention mechanism,and relates visual information to the semantic label information through a crossmodality attention mechanism.In this way,the proposed method preserves the relationships between labels at different granularity levels and takes into account the influence of different image regions on labels with different granularities.Evaluations on six public benchmarks qualitatively and quantitatively demonstrate the advantages of the proposed method.Our project is available at https://github.com/liuxindazz/mgs.展开更多
As one of the most classic fields in computer vi- sion, image categorization has attracted widespread interests. Numerous algorithms have been proposed in the community, and many of them have advanced the state-of-the...As one of the most classic fields in computer vi- sion, image categorization has attracted widespread interests. Numerous algorithms have been proposed in the community, and many of them have advanced the state-of-the-art. How- ever, most existing algorithms are designed without consider- ation for the supply of computing resources. Therefore, when dealing with resource constrained tasks, these algorithms will fail to give satisfactory results. In this paper, we provide a comprehensive and in-depth introduction of recent develop- ments of the research in image categorization with resource constraints. While a large portion is based on our own work, we will also give a brief description of other elegant algo- rithms. Furthermore, we make an investigation into the re- cent developments of deep neural networks, with a focus on resource constrained deep nets.展开更多
Image categorization in massive image database is an important problem. This paper proposes an approach for image categorization, using sparse set of salient semantic information and hierarchy semantic label tree (H...Image categorization in massive image database is an important problem. This paper proposes an approach for image categorization, using sparse set of salient semantic information and hierarchy semantic label tree (HSLT) model. First, to provide more critical image semantics, the proposed sparse set of salient regions only at the focuses of visual attention instead of the entire scene was formed by our proposed saliency detection model with incorporating low and high level feature and Shotton's semantic texton forests (STFs) method. Second, we also propose a new HSLT model in terms of the sparse regional semantic information to automatically build a semantic image hierarchy, which explicitly encodes a general to specific image relationship. And last, we archived image dataset using image hierarchical semantic, which is help to improve the performance of image organizing and browsing. Extension experimefital results showed that the use of semantic hierarchies as a hierarchical organizing frame- work provides a better image annotation and organization, improves the accuracy and reduces human's effort.展开更多
Background:Dengue fever(DF)is a mosquito-borne infectious disease that has threatened tropical and subtropical regions in recent decades.An early and targeted warning of a dengue epidemic is important for vector contr...Background:Dengue fever(DF)is a mosquito-borne infectious disease that has threatened tropical and subtropical regions in recent decades.An early and targeted warning of a dengue epidemic is important for vector control.Current studies have primarily determined weather conditions to be the main factor for dengue forecasting,thereby neglecting that environmental suitability for mosquito breeding is also an important factor,especially in fine-grained intra-urban settings.Considering that street-view images are promising for depicting physical environments,this study proposes a framework for facilitating fine-grained intra-urban dengue forecasting by integrating the urban environments measured from street-view images.展开更多
基金This research was funded by King Mongkut’s University of Technology North Bangkok(Contract no.KMUTNB-62-KNOW-026).
文摘Fine-grained image classification is a challenging research topic because of the high degree of similarity among categories and the high degree of dissimilarity for a specific category caused by different poses and scales.A cul-tural heritage image is one of thefine-grained images because each image has the same similarity in most cases.Using the classification technique,distinguishing cultural heritage architecture may be difficult.This study proposes a cultural heri-tage content retrieval method using adaptive deep learning forfine-grained image retrieval.The key contribution of this research was the creation of a retrieval mod-el that could handle incremental streams of new categories while maintaining its past performance in old categories and not losing the old categorization of a cul-tural heritage image.The goal of the proposed method is to perform a retrieval task for classes.Incremental learning for new classes was conducted to reduce the re-training process.In this step,the original class is not necessary for re-train-ing which we call an adaptive deep learning technique.Cultural heritage in the case of Thai archaeological site architecture was retrieved through machine learn-ing and image processing.We analyze the experimental results of incremental learning forfine-grained images with images of Thai archaeological site architec-ture from world heritage provinces in Thailand,which have a similar architecture.Using afine-grained image retrieval technique for this group of cultural heritage images in a database can solve the problem of a high degree of similarity among categories and a high degree of dissimilarity for a specific category.The proposed method for retrieving the correct image from a database can deliver an average accuracy of 85 percent.Adaptive deep learning forfine-grained image retrieval was used to retrieve cultural heritage content,and it outperformed state-of-the-art methods infine-grained image retrieval.
文摘The remote sensing ships’fine-grained classification technology makes it possible to identify certain ship types in remote sensing images,and it has broad application prospects in civil and military fields.However,the current model does not examine the properties of ship targets in remote sensing images with mixed multi-granularity features and a complicated backdrop.There is still an opportunity for future enhancement of the classification impact.To solve the challenges brought by the above characteristics,this paper proposes a Metaformer and Residual fusion network based on Visual Attention Network(VAN-MR)for fine-grained classification tasks.For the complex background of remote sensing images,the VAN-MR model adopts the parallel structure of large kernel attention and spatial attention to enhance the model’s feature extraction ability of interest targets and improve the classification performance of remote sensing ship targets.For the problem of multi-grained feature mixing in remote sensing images,the VAN-MR model uses a Metaformer structure and a parallel network of residual modules to extract ship features.The parallel network has different depths,considering both high-level and lowlevel semantic information.The model achieves better classification performance in remote sensing ship images with multi-granularity mixing.Finally,the model achieves 88.73%and 94.56%accuracy on the public fine-grained ship collection-23(FGSC-23)and FGSCR-42 datasets,respectively,while the parameter size is only 53.47 M,the floating point operations is 9.9 G.The experimental results show that the classification effect of VAN-MR is superior to that of traditional CNNs model and visual model with Transformer structure under the same parameter quantity.
基金supported in part by the National Natural Science Foundation of China(NSFC)under Grant 6150140in part by the Youth Innovation Project(21032158-Y)of Zhejiang Sci-Tech University.
文摘Image captioning involves two different major modalities(image and sentence)that convert a given image into a language that adheres to visual semantics.Almost all methods first extract image features to reduce the difficulty of visual semantic embedding and then use the caption model to generate fluent sentences.The Convolutional Neural Network(CNN)is often used to extract image features in image captioning,and the use of object detection networks to extract region features has achieved great success.However,the region features retrieved by this method are object-level and do not pay attention to fine-grained details because of the detection model’s limitation.We offer an approach to address this issue that more properly generates captions by fusing fine-grained features and region features.First,we extract fine-grained features using a panoramic segmentation algorithm.Second,we suggest two fusion methods and contrast their fusion outcomes.An X-linear Attention Network(X-LAN)serves as the foundation for both fusion methods.According to experimental findings on the COCO dataset,the two-branch fusion approach is superior.It is important to note that on the COCO Karpathy test split,CIDEr is increased up to 134.3%in comparison to the baseline,highlighting the potency and viability of our method.
文摘Fine-grained image search is one of the most challenging tasks in computer vision that aims to retrieve similar images at the fine-grained level for a given query image.The key objective is to learn discriminative fine-grained features by training deep models such that similar images are clustered,and dissimilar images are separated in the low embedding space.Previous works primarily focused on defining local structure loss functions like triplet loss,pairwise loss,etc.However,training via these approaches takes a long training time,and they have poor accuracy.Additionally,representations learned through it tend to tighten up in the embedded space and lose generalizability to unseen classes.This paper proposes a noise-assisted representation learning method for fine-grained image retrieval to mitigate these issues.In the proposed work,class manifold learning is performed in which positive pairs are created with noise insertion operation instead of tightening class clusters.And other instances are treated as negatives within the same cluster.Then a loss function is defined to penalize when the distance between instances of the same class becomes too small relative to the noise pair in that class in embedded space.The proposed approach is validated on CARS-196 and CUB-200 datasets and achieved better retrieval results(85.38%recall@1 for CARS-196%and 70.13%recall@1 for CUB-200)compared to other existing methods.
基金This work is supported by the National Natural Science Foundation of China(61806013,61876010,62176009,and 61906005)General project of Science and Technology Planof Beijing Municipal Education Commission(KM202110005028)+2 种基金Beijing Municipal Education Commission Project(KZ201910005008)Project of Interdisciplinary Research Institute of Beijing University of Technology(2021020101)International Research Cooperation Seed Fund of Beijing University of Technology(2021A01).
文摘The fine-grained ship image recognition task aims to identify various classes of ships.However,small inter-class,large intra-class differences between ships,and lacking of training samples are the reasons that make the task difficult.Therefore,to enhance the accuracy of the fine-grained ship image recognition,we design a fine-grained ship image recognition network based on bilinear convolutional neural network(BCNN)with Inception and additive margin Softmax(AM-Softmax).This network improves the BCNN in two aspects.Firstly,by introducing Inception branches to the BCNN network,it is helpful to enhance the ability of extracting comprehensive features from ships.Secondly,by adding margin values to the decision boundary,the AM-Softmax function can better extend the inter-class differences and reduce the intra-class differences.In addition,as there are few publicly available datasets for fine-grained ship image recognition,we construct a Ship-43 dataset containing 47,300 ship images belonging to 43 categories.Experimental results on the constructed Ship-43 dataset demonstrate that our method can effectively improve the accuracy of ship image recognition,which is 4.08%higher than the BCNN model.Moreover,comparison results on the other three public fine-grained datasets(Cub,Cars,and Aircraft)further validate the effectiveness of the proposed method.
基金Supported by the Open Project of Key Laboratory of Audio and Video Restoration and Evaluation(2021KFKT005)。
文摘For the task of content retrieval,analysis and generation of film and television scene images in the field of intelligent editing,fine-grained emotion recognition and prediction of images is of great significance.In this paper,the fusion of traditional perceptual features,art features and multi-channel deep learning features are used to reflect the emotion expression of different levels of the image.In addition,the integrated learning model with stacking architecture based on linear regression coefficient and sentiment correlations,which is called the LS-stacking model,is proposed according to the factor association between multi-dimensional emotions.The experimental results prove that the mixed feature and LS-stacking model can predict well on the 16 emotion categories of the self-built image dataset.This study improves the fine-grained recognition ability of image emotion by computers,which helps to increase the intelligence and automation degree of visual retrieval and post-production system.
基金financially supported by the National Natural Science Foundation of China(No.42002133,42072150)Natural Science Foundation of Beijing(8204069)+1 种基金Strategic Cooperation Project of PetroChina and CUPB(ZLZX2020-01-06-01)Science Foundation of China University of Petroleum,Beijing(No.2462021YXZZ003)
文摘Core,thin section,conventional and image logs are used to provide insights into distribution of fractures in fine grained sedimentary rocks of Permian Lucaogou Formation in Jimusar Sag.Bedding parallel fractures are common in fine grained sedimentary rocks which are characterized by layered structures.Core and thin section analysis reveal that fractures in Lucaogou Formation include tectonic inclined fracture,bedding parallel fracture,and abnormal high pressure fracture.Bedding parallel fractures are abundant,but only minor amounts of them remain open,and most of them are partly to fully sealed by carbonate minerals(calcite)and bitumen.Bedding parallel fractures result in a rapid decrease in resistivity,and they are recognized on image logs to extend along bedding planes and have discontinuous surfaces due to partly-fully filled resistive carbonate minerals as well as late stage dissolution.A comprehensive interpretation of distribution of bedding parallel fractures is performed with green line,red line,yellow line and blue line representing bedding planes,induced fractures,resistive fractures,and open(bedding and inclined)fractures,respectively.The strike of bedding parallel fractures is coinciding with bedding planes.Bedding parallel fractures are closely associated with the amounts of bedding planes,and high density of bedding planes favor the formation of bedding parallel fractures.Alternating dark and bright layers have the most abundant bedding parallel fractures on the image logs,and the bedding parallel fractures are always associated with low resistivity zones.The results above may help optimize sweet spots in fine grained sedimentary rocks,and improve future fracturing design and optimize well spacing.
基金National Natural Science Foundation of China(Grant No.42002133,42072150)Science Foundation of China University of Petroleum,Beijing(No.2462021YXZZ003)the Strategic Cooperation Technology Projects of CNPC and CUPB(ZLZX2020-01-06)for the financial supports and permissions to publish this paper
文摘Fine-grained sedimentary rocks have become a research focus as important reservoirs and source rocks for tight and shale oil and gas.Laminae development determines the accumulation and production of tight and shale oil and gas in fine-grained rocks.However,due to the resolution limit of conventional logs,it is challenging to recognize the features of centimeter-scale laminae.To close this gap,complementary studies,including core observation,thin section,X-ray diffraction(XRD),conventional log analysis,and slabs of image logs,were conducted to unravel the centimeter-scale laminae.The laminae recognition models were built using well logs.The fine-grained rocks can be divided into laminated rocks(lamina thickness of<0.01 m),layered rocks(0.01-0.1 m),and massive rocks(no layer or layer spacing of>0.1 m)according to the laminae scale from core observations.According to the mineral superposition assemblages from thin-section observations,the laminated rocks can be further divided into binary,ternary,and multiple structures.The typical mineral components,slabs,and T2spectrum distributions of various lamina types are unraveled.The core can identify the centimeter-millimeter-scale laminae,and the thin section can identify the millimeter-micrometer-scale laminae.Furthermore,they can detect mineral types and their superposition sequence.Conventional logs can identify the meter-scale layers,whereas image logs and related slabs can identify the laminae variations at millimeter-centimeter scales.Therefore,the slab of image logs combined with thin sections can identify laminae assemblage characteristics,including the thickness and vertical assemblage.The identification and classification of lamina structure of various scales on a single well can be predicted using conventional logs,image logs,and slabs combined with thin sections.The layered rocks have better reservoir quality and oil-bearing potential than the massive and laminated rocks.The laminated rocks’binary lamina is better than the ternary and multiple layers due to the high content of felsic minerals.The abovementioned results build the prediction model for multiscale laminae structure using well logs,helping sweet spots prediction in the Permian Lucaogou Formation in the Jimusar Sag and fine-grained sedimentary rocks worldwide.
基金supported by MOST under Grant No. 107-2221-E-182-081-MY3。
文摘To protect personal privacy and confidential preservation,access control is used to authorize legal users for safe browsing the authorized contents on photos.The access control generates an authorization rule according to each permission assignment.However,the general access control is inappropriate to apply in some social services(e.g.,photos posted on Flickr and Instagram,personal image management in mobile phone) because of the increasing popularity of digital images being stored and managed.With low maintenance loads,this paper integrates the data hiding technique to propose an access control mechanism for privacy preservation.The proposed scheme changes the partial regions of a given image as random pads (called selective image encryption) and only allows the authorized people to remedy the random pads back to meaningful ones which are with similar visual qualities of original ones.
基金The support of this research was by Hubei Provincial Natural Science Foundation(2022CFB449)Science Research Foundation of Education Department of Hubei Province(B2020061),are gratefully acknowledged.
文摘The task of food image recognition,a nuanced subset of fine-grained image recognition,grapples with substantial intra-class variation and minimal inter-class differences.These challenges are compounded by the irregular and multi-scale nature of food images.Addressing these complexities,our study introduces an advanced model that leverages multiple attention mechanisms and multi-stage local fusion,grounded in the ConvNeXt architecture.Our model employs hybrid attention(HA)mechanisms to pinpoint critical discriminative regions within images,substantially mitigating the influence of background noise.Furthermore,it introduces a multi-stage local fusion(MSLF)module,fostering long-distance dependencies between feature maps at varying stages.This approach facilitates the assimilation of complementary features across scales,significantly bolstering the model’s capacity for feature extraction.Furthermore,we constructed a dataset named Roushi60,which consists of 60 different categories of common meat dishes.Empirical evaluation of the ETH Food-101,ChineseFoodNet,and Roushi60 datasets reveals that our model achieves recognition accuracies of 91.12%,82.86%,and 92.50%,respectively.These figures not only mark an improvement of 1.04%,3.42%,and 1.36%over the foundational ConvNeXt network but also surpass the performance of most contemporary food image recognition methods.Such advancements underscore the efficacy of our proposed model in navigating the intricate landscape of food image recognition,setting a new benchmark for the field.
基金supported by National Key R&D Program of China(2019YFC1521102)the National Natural Science Foundation of China(61932003)Beijing Science and Technology Plan(Z221100007722004).
文摘Hierarchical multi-granularity image classification is a challenging task that aims to tag each given image with multiple granularity labels simultaneously.Existing methods tend to overlook that different image regions contribute differently to label prediction at different granularities,and also insufficiently consider relationships between the hierarchical multi-granularity labels.We introduce a sequence-to-sequence mechanism to overcome these two problems and propose a multi-granularity sequence generation(MGSG)approach for the hierarchical multi-granularity image classification task.Specifically,we introduce a transformer architecture to encode the image into visual representation sequences.Next,we traverse the taxonomic tree and organize the multi-granularity labels into sequences,and vectorize them and add positional information.The proposed multi-granularity sequence generation method builds a decoder that takes visual representation sequences and semantic label embedding as inputs,and outputs the predicted multi-granularity label sequence.The decoder models dependencies and correlations between multi-granularity labels through a masked multi-head self-attention mechanism,and relates visual information to the semantic label information through a crossmodality attention mechanism.In this way,the proposed method preserves the relationships between labels at different granularity levels and takes into account the influence of different image regions on labels with different granularities.Evaluations on six public benchmarks qualitatively and quantitatively demonstrate the advantages of the proposed method.Our project is available at https://github.com/liuxindazz/mgs.
基金This research was supported by the National Natural Science Foundation of China (Grant No. 61422203).
文摘As one of the most classic fields in computer vi- sion, image categorization has attracted widespread interests. Numerous algorithms have been proposed in the community, and many of them have advanced the state-of-the-art. How- ever, most existing algorithms are designed without consider- ation for the supply of computing resources. Therefore, when dealing with resource constrained tasks, these algorithms will fail to give satisfactory results. In this paper, we provide a comprehensive and in-depth introduction of recent develop- ments of the research in image categorization with resource constraints. While a large portion is based on our own work, we will also give a brief description of other elegant algo- rithms. Furthermore, we make an investigation into the re- cent developments of deep neural networks, with a focus on resource constrained deep nets.
基金Acknowledgements This work was supported by National Natural Science Foundation of China (Grant Nos. 61272258, 61170124, 61170020, 61070223), and Application Foundation Research Plan of Suzhou City, China (SYG201116).
文摘Image categorization in massive image database is an important problem. This paper proposes an approach for image categorization, using sparse set of salient semantic information and hierarchy semantic label tree (HSLT) model. First, to provide more critical image semantics, the proposed sparse set of salient regions only at the focuses of visual attention instead of the entire scene was formed by our proposed saliency detection model with incorporating low and high level feature and Shotton's semantic texton forests (STFs) method. Second, we also propose a new HSLT model in terms of the sparse regional semantic information to automatically build a semantic image hierarchy, which explicitly encodes a general to specific image relationship. And last, we archived image dataset using image hierarchical semantic, which is help to improve the performance of image organizing and browsing. Extension experimefital results showed that the use of semantic hierarchies as a hierarchical organizing frame- work provides a better image annotation and organization, improves the accuracy and reduces human's effort.
文摘Background:Dengue fever(DF)is a mosquito-borne infectious disease that has threatened tropical and subtropical regions in recent decades.An early and targeted warning of a dengue epidemic is important for vector control.Current studies have primarily determined weather conditions to be the main factor for dengue forecasting,thereby neglecting that environmental suitability for mosquito breeding is also an important factor,especially in fine-grained intra-urban settings.Considering that street-view images are promising for depicting physical environments,this study proposes a framework for facilitating fine-grained intra-urban dengue forecasting by integrating the urban environments measured from street-view images.