The recent developments in Multimedia Internet of Things(MIoT)devices,empowered with Natural Language Processing(NLP)model,seem to be a promising future of smart devices.It plays an important role in industrial models...The recent developments in Multimedia Internet of Things(MIoT)devices,empowered with Natural Language Processing(NLP)model,seem to be a promising future of smart devices.It plays an important role in industrial models such as speech understanding,emotion detection,home automation,and so on.If an image needs to be captioned,then the objects in that image,its actions and connections,and any silent feature that remains under-projected or missing from the images should be identified.The aim of the image captioning process is to generate a caption for image.In next step,the image should be provided with one of the most significant and detailed descriptions that is syntactically as well as semantically correct.In this scenario,computer vision model is used to identify the objects and NLP approaches are followed to describe the image.The current study develops aNatural Language Processing with Optimal Deep Learning Enabled Intelligent Image Captioning System(NLPODL-IICS).The aim of the presented NLPODL-IICS model is to produce a proper description for input image.To attain this,the proposed NLPODL-IICS follows two stages such as encoding and decoding processes.Initially,at the encoding side,the proposed NLPODL-IICS model makes use of Hunger Games Search(HGS)with Neural Search Architecture Network(NASNet)model.This model represents the input data appropriately by inserting it into a predefined length vector.Besides,during decoding phase,Chimp Optimization Algorithm(COA)with deeper Long Short Term Memory(LSTM)approach is followed to concatenate the description sentences 4436 CMC,2023,vol.74,no.2 produced by the method.The application of HGS and COA algorithms helps in accomplishing proper parameter tuning for NASNet and LSTM models respectively.The proposed NLPODL-IICS model was experimentally validated with the help of two benchmark datasets.Awidespread comparative analysis confirmed the superior performance of NLPODL-IICS model over other models.展开更多
Operation control of power systems has become challenging with an increase in the scale and complexity of power distribution systems and extensive access to renewable energy.Therefore,improvement of the ability of dat...Operation control of power systems has become challenging with an increase in the scale and complexity of power distribution systems and extensive access to renewable energy.Therefore,improvement of the ability of data-driven operation management,intelligent analysis,and mining is urgently required.To investigate and explore similar regularities of the historical operating section of the power distribution system and assist the power grid in obtaining high-value historical operation,maintenance experience,and knowledge by rule and line,a neural information retrieval model with an attention mechanism is proposed based on graph data computing technology.Based on the processing flow of the operating data of the power distribution system,a technical framework of neural information retrieval is established.Combined with the natural graph characteristics of the power distribution system,a unified graph data structure and a data fusion method of data access,data complement,and multi-source data are constructed.Further,a graph node feature-embedding representation learning algorithm and a neural information retrieval algorithm model are constructed.The neural information retrieval algorithm model is trained and tested using the generated graph node feature representation vector set.The model is verified on the operating section of the power distribution system of a provincial grid area.The results show that the proposed method demonstrates high accuracy in the similarity matching of historical operation characteristics and effectively supports intelligent fault diagnosis and elimination in power distribution systems.展开更多
In crowded cities,searching for the availability of parking lots is a herculean task as it results in the wastage of drivers’time,increases air pollution,and traffic congestion.Smart parking systems facilitate the dr...In crowded cities,searching for the availability of parking lots is a herculean task as it results in the wastage of drivers’time,increases air pollution,and traffic congestion.Smart parking systems facilitate the drivers to determine the information about the parking lot in real time and book them depending on the requirement.But the existing smart parking systems necessitate the drivers to reveal their sensitive information that includes their mobile number,personal identity,and desired destination.This disclosure of sensitive information makes the existing centralized smart parking systems more vulnerable to service providers’security breaches,single points of failure,and bottlenecks.In this paper,an Improved Asymmetric Consortium Blockchain and Homomorphically Computing Univariate Polynomial-based private information retrieval(IACB-HCUPPIR)scheme is proposed to ensure parking lots’availability with transparency security in a privacy-preserving smart parking system.In specific,an improved Asymmetric Consortium Blockchain is used for achieving secure transactions between different parties interacting in the smart parking environment.It further adopted the method of Homomorphically Computing Univariate Polynomial-based private information retrieval(HCUPPIR)scheme for preserving the location privacy of drivers.The results of IACB-HCUPPIR confirmed better results in terms of minimized computation and communication overload with throughput,latency,and response time with maximized drivers’privacy preservation.Moreover,the proposed fully homomorphic algorithm(FHE)was compared against partial-homomorphic encryption(PHE)and technique without encryption and found that the proposed model has quick communication in allocating the parking slots starting with 24.3 s,whereas PHE starts allocating from 24.7 s and the technique without encryption starts at 27.4 s.Thus,we ensure the proposed model performs well in allocating parking slots with less time and high security with privacy preservation.展开更多
With the development of Global Navigation Satellite Systems(GNSS),geodetic GNSS receivers have been utilized to monitor sea levels using GNSS-Interferometry Reflectometry(GNSS-IR)technology.The multi-mode,multi-freque...With the development of Global Navigation Satellite Systems(GNSS),geodetic GNSS receivers have been utilized to monitor sea levels using GNSS-Interferometry Reflectometry(GNSS-IR)technology.The multi-mode,multi-frequency signals of GPS,GLONASS,Galileo,and Beidou can be used for GNSS-IR sea level retrieval,but combining these retrievals remains problematic.To address this issue,a GNSS-IR sea level retrieval combination system has been developed,which begins by analyzing error sources in GNSS-IR sea level retrieval and establishing and solving the GNSS-IR retrieval equation.This paper focuses on two key points:time window selection and equation stability.The stability of the retrieval combination equations is determined by the condition number of the coefficient matrix within the time window.The impact of ill-conditioned coefficient matrices on the retrieval results is demonstrated using an extreme case of SNR data with only ascending or descending trajectories.After determining the time window and removing ill-conditioned equations,the multi-mode,multi-frequency GNSS-IR retrieval is performed.Results from three International GNSS Service(IGS)stations show that the combination method produces high-precision,high-resolution,and high-reliability sea level retrieval combination sequences.展开更多
This paper aims to develop Machine Learning algorithms to classify electronic articles related to this phenomenon by retrieving information and topic modelling.The Methodology of this study is categorized into three p...This paper aims to develop Machine Learning algorithms to classify electronic articles related to this phenomenon by retrieving information and topic modelling.The Methodology of this study is categorized into three phases:the Text Classification Approach(TCA),the Proposed Algorithms Interpretation(PAI),andfinally,Information Retrieval Approach(IRA).The TCA reflects the text preprocessing pipeline called a clean corpus.The Global Vec-tors for Word Representation(Glove)pre-trained model,FastText,Term Frequency-Inverse Document Fre-quency(TF-IDF),and Bag-of-Words(BOW)for extracting the features have been interpreted in this research.The PAI manifests the Bidirectional Long Short-Term Memory(Bi-LSTM)and Convolutional Neural Network(CNN)to classify the COVID-19 news.Again,the IRA explains the mathematical interpretation of Latent Dirich-let Allocation(LDA),obtained for modelling the topic of Information Retrieval(IR).In this study,99%accuracy was obtained by performing K-fold cross-validation on Bi-LSTM with Glove.A comparative analysis between Deep Learning and Machine Learning based on feature extraction and computational complexity exploration has been performed in this research.Furthermore,some text analyses and the most influential aspects of each document have been explored in this study.We have utilized Bidirectional Encoder Representations from Trans-formers(BERT)as a Deep Learning mechanism in our model training,but the result has not been uncovered satisfactory.However,the proposed system can be adjustable in the real-time news classification of COVID-19.展开更多
[Objective] The aim was to set up a plant digital information retrieval system.[Method] Plant digital information retrieval system was designed by combining with Microsoft Visual Basic 6.0 Enterprise Edition database ...[Objective] The aim was to set up a plant digital information retrieval system.[Method] Plant digital information retrieval system was designed by combining with Microsoft Visual Basic 6.0 Enterprise Edition database management system and Structure Query Language.[Result] The system realized electronic management and retrieval of local plant information.The key words of retrieval included family,genus,formal name,Chinese name,Latin,morphological characteristics,habitat,collection people,collection places,and protect class and so on.[Conclusion] It provided reference for these problems of species identification and digital management of herbarium.展开更多
The distinctive conditions present on the north and south slopes of Mount Qomolangma,along with the intricate variations in the underlying surfaces,result in notable variations in the surface energy flux patterns of t...The distinctive conditions present on the north and south slopes of Mount Qomolangma,along with the intricate variations in the underlying surfaces,result in notable variations in the surface energy flux patterns of the two slopes.In this paper,data from TESEBS(Topographical Enhanced Surface Energy Balance System),remote sensing data from eight cloud-free scenarios,and observational data from nine stations are utilized to examine the fluctuations in the surface heat flux on both slopes.The inclusion of MCD43A3 satellite data enhances the surface albedo,contributing to more accurate simulation outcomes.The model results are validated using observational data.The RMSEs of the net radiation,ground heat,sensible heat,and latent heat flux are 40.73,17.09,33.26,and 30.91 W m^(−2),respectively.The net radiation flux is greater on the south slope and exhibits a rapid decline from summer to autumn.Due to the influence of the monsoon,on the north slope,the maximum sensible heat flux occurs in the pre-monsoon period in summer and the maximum latent heat flux occurs during the monsoon.The south slope experiences the highest latent heat flux in summer.The dominant flux on the north slope is sensible heat,while it is latent heat on the south slope.The seasonal variations in the ground heat flux are more pronounced on the south slope than on the north slope.Except in summer,the ground heat flux on the north slope surpasses that on the south slope.展开更多
Through analyzing syntactic,semantic,pragmatic information,the retrieval system ACIS based on comprehensive information was established,which could achieve personalized information exaction to guide user s information...Through analyzing syntactic,semantic,pragmatic information,the retrieval system ACIS based on comprehensive information was established,which could achieve personalized information exaction to guide user s information retrieval.展开更多
The dual-retrieval (DR) operation sequencing problem in the flow-rack automated storage and retrieval system (AS/RS) is modeled as an assignment problem since it is equivalent to pairing outgoing unit-loads for ea...The dual-retrieval (DR) operation sequencing problem in the flow-rack automated storage and retrieval system (AS/RS) is modeled as an assignment problem since it is equivalent to pairing outgoing unit-loads for each DR operation. A recursion symmetry Hungarian method (RSHM), modified from the Hungarian method, is proposed for generating a DR operation sequence with minimal total travel time, in which symmetry marking is introduced to ensure a feasible solution and recursion is adopted to break the endless loop caused by the symmetry marking. Simulation experiments are conducted to evaluate the cost effectiveness and the performance of the proposed method. Experimental results illustrate that compared to the single-shuttle machine, the dual-shuttle machine can reduce more than 40% of the total travel time of retrieval operations, and the RSHM saves about 5% to 10% of the total travel time of retrieval operations compared to the greedy-based heuristic.展开更多
The Advanced Geosynchronous Radiation Imager(AGRI)is a mission-critical instrument for the Fengyun series of satellites.AGRI acquires full-disk images every 15 min and views East Asia every 5 min through 14 spectral b...The Advanced Geosynchronous Radiation Imager(AGRI)is a mission-critical instrument for the Fengyun series of satellites.AGRI acquires full-disk images every 15 min and views East Asia every 5 min through 14 spectral bands,enabling the detection of highly variable aerosol optical depth(AOD).Quantitative retrieval of AOD has hitherto been challenging,especially over land.In this study,an AOD retrieval algorithm is proposed that combines deep learning and transfer learning.The algorithm uses core concepts from both the Dark Target(DT)and Deep Blue(DB)algorithms to select features for the machinelearning(ML)algorithm,allowing for AOD retrieval at 550 nm over both dark and bright surfaces.The algorithm consists of two steps:①A baseline deep neural network(DNN)with skip connections is developed using 10 min Advanced Himawari Imager(AHI)AODs as the target variable,and②sunphotometer AODs from 89 ground-based stations are used to fine-tune the DNN parameters.Out-of-station validation shows that the retrieved AOD attains high accuracy,characterized by a coefficient of determination(R2)of 0.70,a mean bias error(MBE)of 0.03,and a percentage of data within the expected error(EE)of 70.7%.A sensitivity study reveals that the top-of-atmosphere reflectance at 650 and 470 nm,as well as the surface reflectance at 650 nm,are the two largest sources of uncertainty impacting the retrieval.In a case study of monitoring an extreme aerosol event,the AGRI AOD is found to be able to capture the detailed temporal evolution of the event.This work demonstrates the superiority of the transfer-learning technique in satellite AOD retrievals and the applicability of the retrieved AGRI AOD in monitoring extreme pollution events.展开更多
With the development of the hyperspectral remote sensing technique,extensive chemical weathering profiles have been identified on Mars.These weathering sequences,formed through precipitation-driven leaching processes,...With the development of the hyperspectral remote sensing technique,extensive chemical weathering profiles have been identified on Mars.These weathering sequences,formed through precipitation-driven leaching processes,can reflect the paleoenvironments and paleoclimates during pedogenic processes.The specific composition and stratigraphic profiles mirror the mineralogical and chemical trends observed in weathered basalts on Hainan Island in south China.In this study,we investigated the laboratory reflectance spectra of a 53-m-long drilling core of a thick basaltic weathering profile collected from Hainan Island.We established a quantitative spectral model by combining the genetic algorithm and partial least squares regression(GA-PLSR)to predict the chemical properties(SiO2,Al2O3,Fe2O3)and index of laterization(IOL).The entire sample set was divided into a calibration set of 25 samples and a validation set of 12 samples.Specifically,the GA was used to select the spectral subsets for each composition,which were then input into the PLSR model to derive the chemical concentration.The coefficient of determination(R2)values on the validation set for SiO2,Al2O3,Fe2O3,and the IOL were greater than 0.9.In addition,the effects of various spectral preprocessing techniques on the model accuracy were evaluated.We found that the spectral derivative treatment boosted the prediction accuracy of the GA-PLSR model.The improvement achieved with the second derivative was more pronounced than when using the first derivative.The quantitative model developed in this work has the potential to estimate the contents of similar weathering basalt products,and thus infer the degree of alteration and provide insights into paleoclimatic conditions.Moreover,the informative bands selected by the GA can serve as a guideline for designing spectral channels for the next generation of spectrometers.展开更多
Apricot has a long history of cultivation and has many varieties and types. The traditional variety identification methods are timeconsuming and labor-consuming, posing grand challenges to apricot resource management....Apricot has a long history of cultivation and has many varieties and types. The traditional variety identification methods are timeconsuming and labor-consuming, posing grand challenges to apricot resource management. Tool development in this regard will help researchers quickly identify variety information. This study photographed apricot fruits outdoors and indoors and constructed a dataset that can precisely classify the fruits using a U-net model (F-score:99%), which helps to obtain the fruit's size, shape, and color features. Meanwhile, a variety search engine was constructed, which can search and identify variety from the database according to the above features. Besides, a mobile and web application (ApricotView) was developed, and the construction mode can be also applied to other varieties of fruit trees.Additionally, we have collected four difficult-to-identify seed datasets and used the VGG16 model for training, with an accuracy of 97%, which provided an important basis for ApricotView. To address the difficulties in data collection bottlenecking apricot phenomics research, we developed the first apricot database platform of its kind (ApricotDIAP, http://apricotdiap.com/) to accumulate, manage, and publicize scientific data of apricot.展开更多
The developed system for eye and face detection using Convolutional Neural Networks(CNN)models,followed by eye classification and voice-based assistance,has shown promising potential in enhancing accessibility for ind...The developed system for eye and face detection using Convolutional Neural Networks(CNN)models,followed by eye classification and voice-based assistance,has shown promising potential in enhancing accessibility for individuals with visual impairments.The modular approach implemented in this research allows for a seamless flow of information and assistance between the different components of the system.This research significantly contributes to the field of accessibility technology by integrating computer vision,natural language processing,and voice technologies.By leveraging these advancements,the developed system offers a practical and efficient solution for assisting blind individuals.The modular design ensures flexibility,scalability,and ease of integration with existing assistive technologies.However,it is important to acknowledge that further research and improvements are necessary to enhance the system’s accuracy and usability.Fine-tuning the CNN models and expanding the training dataset can improve eye and face detection as well as eye classification capabilities.Additionally,incorporating real-time responses through sophisticated natural language understanding techniques and expanding the knowledge base of ChatGPT can enhance the system’s ability to provide comprehensive and accurate responses.Overall,this research paves the way for the development of more advanced and robust systems for assisting visually impaired individuals.By leveraging cutting-edge technologies and integrating them into amodular framework,this research contributes to creating a more inclusive and accessible society for individuals with visual impairments.Future work can focus on refining the system,addressing its limitations,and conducting user studies to evaluate its effectiveness and impact in real-world scenarios.展开更多
Recently,deep learning has yielded transformative success across optics and photonics,especially in optical metrology.Deep neural networks (DNNs) with a fully convolutional architecture (e.g.,U-Net and its derivatives...Recently,deep learning has yielded transformative success across optics and photonics,especially in optical metrology.Deep neural networks (DNNs) with a fully convolutional architecture (e.g.,U-Net and its derivatives) have been widely implemented in an end-to-end manner to accomplish various optical metrology tasks,such as fringe denoising,phase unwrapping,and fringe analysis.However,the task of training a DNN to accurately identify an image-to-image transform from massive input and output data pairs seems at best naive,as the physical laws governing the image formation or other domain expertise pertaining to the measurement have not yet been fully exploited in current deep learning practice.To this end,we introduce a physics-informed deep learning method for fringe pattern analysis (PI-FPA) to overcome this limit by integrating a lightweight DNN with a learning-enhanced Fourier transform profilometry (Le FTP) module.By parameterizing conventional phase retrieval methods,the Le FTP module embeds the prior knowledge in the network structure and the loss function to directly provide reliable phase results for new types of samples,while circumventing the requirement of collecting a large amount of high-quality data in supervised learning methods.Guided by the initial phase from Le FTP,the phase recovery ability of the lightweight DNN is enhanced to further improve the phase accuracy at a low computational cost compared with existing end-to-end networks.Experimental results demonstrate that PI-FPA enables more accurate and computationally efficient single-shot phase retrieval,exhibiting its excellent generalization to various unseen objects during training.The proposed PI-FPA presents that challenging issues in optical metrology can be potentially overcome through the synergy of physics-priors-based traditional tools and data-driven learning approaches,opening new avenues to achieve fast and accurate single-shot 3D imaging.展开更多
This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schem...This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schemes like tf-idf and BM25.These conventional methods often struggle with accurately capturing document relevance,leading to inefficiencies in both retrieval performance and index size management.OWS proposes a dynamic weighting mechanism that evaluates the significance of terms based on their orbital position within the vector space,emphasizing term relationships and distribution patterns overlooked by existing models.Our research focuses on evaluating OWS’s impact on model accuracy using Information Retrieval metrics like Recall,Precision,InterpolatedAverage Precision(IAP),andMeanAverage Precision(MAP).Additionally,we assessOWS’s effectiveness in reducing the inverted index size,crucial for model efficiency.We compare OWS-based retrieval models against others using different schemes,including tf-idf variations and BM25Delta.Results reveal OWS’s superiority,achieving a 54%Recall and 81%MAP,and a notable 38%reduction in the inverted index size.This highlights OWS’s potential in optimizing retrieval processes and underscores the need for further research in this underrepresented area to fully leverage OWS’s capabilities in information retrieval methodologies.展开更多
This exploration acquaints a momentous methodology with custom chatbot improvement that focuses on pro-ficiency close by viability.We accomplish this by joining three key innovations:LangChain,Retrieval Augmented Gene...This exploration acquaints a momentous methodology with custom chatbot improvement that focuses on pro-ficiency close by viability.We accomplish this by joining three key innovations:LangChain,Retrieval Augmented Generation(RAG),and enormous language models(LLMs)tweaked with execution proficient strategies like LoRA and QLoRA.LangChain takes into consideration fastidious fitting of chatbots to explicit purposes,guaranteeing engaged and important collaborations with clients.RAG’s web scratching capacities engage these chatbots to get to a tremendous store of data,empowering them to give exhaustive and enlightening reactions to requests.This recovered data is then decisively woven into reaction age utilizing LLMs that have been calibrated with an emphasis on execution productivity.This combination approach offers a triple advantage:further developed viability,upgraded client experience,and extended admittance to data.Chatbots become proficient at taking care of client questions precisely and productively,while instructive and logically pertinent reactions make a more regular and drawing in cooperation for clients.At last,web scratching enables chatbots to address a more extensive assortment of requests by conceding them admittance to a more extensive information base.By digging into the complexities of execution proficient LLM calibrating and underlining the basic job of web-scratched information,this examination offers a critical commitment to propelling custom chatbot plan and execution.The subsequent chatbots feature the monstrous capability of these advancements in making enlightening,easy to understand,and effective conversational specialists,eventually changing the manner in which clients cooperate with chatbots.展开更多
To deal with a lack of semantic interoperability of traditional knowledge retrieval approaches, a semantic-based networked manufacturing (NM) knowledge retrieval architecture is proposed, which offers a series of to...To deal with a lack of semantic interoperability of traditional knowledge retrieval approaches, a semantic-based networked manufacturing (NM) knowledge retrieval architecture is proposed, which offers a series of tools for supporting the sharing of knowledge and promoting NM collaboration. A 5-tuple based semantic information retrieval model is proposed, which includes the interoperation on the semantic layer, and a test process is given for this model. The recall ratio and the precision ratio of manufacturing knowledge retrieval are proved to be greatly improved by evaluation. Thus, a practical and reliable approach based on the semantic web is provided for solving the correlated concrete problems in regional networked manufacturing.展开更多
Under the influence of anthropogenic and climate change,the problems caused by urban heat island(UHI)has become increasingly prominent.In order to promote urban sustainable development and improve the quality of human...Under the influence of anthropogenic and climate change,the problems caused by urban heat island(UHI)has become increasingly prominent.In order to promote urban sustainable development and improve the quality of human settlements,it is significant for exploring the evolution characteristics of urban thermal environment and analyzing its driving forces.Taking the Landsat series images as the basic data sources,the winter land surface temperature(LST)of the rapid urbanization area of Fuzhou City in China was quantitatively retrieved from 2001 to 2021.Combing comprehensively the standard deviation ellipse model,profile analysis and GeoDetector model,the spatio-temporal evolution characteristics and influencing factors of the winter urban thermal environment were systematically analyzed.The results showed that the winter LST presented an increasing trend in the study area during 2001–2021,and the winter LST of the central urban regions was significantly higher than the suburbs.There was a strong UHI effect from 2001 to 2021with an expansion trend from the central urban regions to the suburbs and coastal areas in space scale.The LST of green lands and wetlands are significantly lower than croplands,artificial surface and unvegetated lands.Vegetation and water bodies had a significant mitigation effect on UHI,especially in the micro-scale.The winter UHI had been jointly driven by the underlying surface and socio-economic factors in a nonlinear or two-factor interactive enhancement mode,and socio-economic factors had played a leading role.This research could provide data support and decision-making references for rationally planning urban layout and promoting sustainable urban development.展开更多
Clothing attribute recognition has become an essential technology,which enables users to automatically identify the characteristics of clothes and search for clothing images with similar attributes.However,existing me...Clothing attribute recognition has become an essential technology,which enables users to automatically identify the characteristics of clothes and search for clothing images with similar attributes.However,existing methods cannot recognize newly added attributes and may fail to capture region-level visual features.To address the aforementioned issues,a region-aware fashion contrastive language-image pre-training(RaF-CLIP)model was proposed.This model aligned cropped and segmented images with category and multiple fine-grained attribute texts,achieving the matching of fashion region and corresponding texts through contrastive learning.Clothing retrieval found suitable clothing based on the user-specified clothing categories and attributes,and to further improve the accuracy of retrieval,an attribute-guided composed network(AGCN)as an additional component on RaF-CLIP was introduced,specifically designed for composed image retrieval.This task aimed to modify the reference image based on textual expressions to retrieve the expected target.By adopting a transformer-based bidirectional attention and gating mechanism,it realized the fusion and selection of image features and attribute text features.Experimental results show that the proposed model achieves a mean precision of 0.6633 for attribute recognition tasks and a recall@10(recall@k is defined as the percentage of correct samples appearing in the top k retrieval results)of 39.18 for composed image retrieval task,satisfying user needs for freely searching for clothing through images and texts.展开更多
Crime scene investigation(CSI)image is key evidence carrier during criminal investiga-tion,in which CSI image retrieval can assist the public police to obtain criminal clues.Moreover,with the rapid development of deep...Crime scene investigation(CSI)image is key evidence carrier during criminal investiga-tion,in which CSI image retrieval can assist the public police to obtain criminal clues.Moreover,with the rapid development of deep learning,data-driven paradigm has become the mainstreammethod of CSI image feature extraction and representation,and in this process,datasets provideeffective support for CSI retrieval performance.However,there is a lack of systematic research onCSI image retrieval methods and datasets.Therefore,we present an overview of the existing worksabout one-class and multi-class CSI image retrieval based on deep learning.According to theresearch,based on their technical functionalities and implementation methods,CSI image retrievalis roughly classified into five categories:feature representation,metric learning,generative adversar-ial networks,autoencoder networks and attention networks.Furthermore,We analyzed the remain-ing challenges and discussed future work directions in this field.展开更多
基金Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2022R161)PrincessNourah bint Abdulrahman University,Riyadh,Saudi Arabia.The authors would like to thank the|Deanship of Scientific Research at Umm Al-Qura University|for supporting this work by Grant Code:(22UQU4310373DSR33).
文摘The recent developments in Multimedia Internet of Things(MIoT)devices,empowered with Natural Language Processing(NLP)model,seem to be a promising future of smart devices.It plays an important role in industrial models such as speech understanding,emotion detection,home automation,and so on.If an image needs to be captioned,then the objects in that image,its actions and connections,and any silent feature that remains under-projected or missing from the images should be identified.The aim of the image captioning process is to generate a caption for image.In next step,the image should be provided with one of the most significant and detailed descriptions that is syntactically as well as semantically correct.In this scenario,computer vision model is used to identify the objects and NLP approaches are followed to describe the image.The current study develops aNatural Language Processing with Optimal Deep Learning Enabled Intelligent Image Captioning System(NLPODL-IICS).The aim of the presented NLPODL-IICS model is to produce a proper description for input image.To attain this,the proposed NLPODL-IICS follows two stages such as encoding and decoding processes.Initially,at the encoding side,the proposed NLPODL-IICS model makes use of Hunger Games Search(HGS)with Neural Search Architecture Network(NASNet)model.This model represents the input data appropriately by inserting it into a predefined length vector.Besides,during decoding phase,Chimp Optimization Algorithm(COA)with deeper Long Short Term Memory(LSTM)approach is followed to concatenate the description sentences 4436 CMC,2023,vol.74,no.2 produced by the method.The application of HGS and COA algorithms helps in accomplishing proper parameter tuning for NASNet and LSTM models respectively.The proposed NLPODL-IICS model was experimentally validated with the help of two benchmark datasets.Awidespread comparative analysis confirmed the superior performance of NLPODL-IICS model over other models.
基金supported by the National Key R&D Program of China(2020YFB0905900).
文摘Operation control of power systems has become challenging with an increase in the scale and complexity of power distribution systems and extensive access to renewable energy.Therefore,improvement of the ability of data-driven operation management,intelligent analysis,and mining is urgently required.To investigate and explore similar regularities of the historical operating section of the power distribution system and assist the power grid in obtaining high-value historical operation,maintenance experience,and knowledge by rule and line,a neural information retrieval model with an attention mechanism is proposed based on graph data computing technology.Based on the processing flow of the operating data of the power distribution system,a technical framework of neural information retrieval is established.Combined with the natural graph characteristics of the power distribution system,a unified graph data structure and a data fusion method of data access,data complement,and multi-source data are constructed.Further,a graph node feature-embedding representation learning algorithm and a neural information retrieval algorithm model are constructed.The neural information retrieval algorithm model is trained and tested using the generated graph node feature representation vector set.The model is verified on the operating section of the power distribution system of a provincial grid area.The results show that the proposed method demonstrates high accuracy in the similarity matching of historical operation characteristics and effectively supports intelligent fault diagnosis and elimination in power distribution systems.
基金The research was funded by the School of Information Technology and Engineering,Vellore Institute of Technology,Vellore 632014,Tamil Nadu,India.
文摘In crowded cities,searching for the availability of parking lots is a herculean task as it results in the wastage of drivers’time,increases air pollution,and traffic congestion.Smart parking systems facilitate the drivers to determine the information about the parking lot in real time and book them depending on the requirement.But the existing smart parking systems necessitate the drivers to reveal their sensitive information that includes their mobile number,personal identity,and desired destination.This disclosure of sensitive information makes the existing centralized smart parking systems more vulnerable to service providers’security breaches,single points of failure,and bottlenecks.In this paper,an Improved Asymmetric Consortium Blockchain and Homomorphically Computing Univariate Polynomial-based private information retrieval(IACB-HCUPPIR)scheme is proposed to ensure parking lots’availability with transparency security in a privacy-preserving smart parking system.In specific,an improved Asymmetric Consortium Blockchain is used for achieving secure transactions between different parties interacting in the smart parking environment.It further adopted the method of Homomorphically Computing Univariate Polynomial-based private information retrieval(HCUPPIR)scheme for preserving the location privacy of drivers.The results of IACB-HCUPPIR confirmed better results in terms of minimized computation and communication overload with throughput,latency,and response time with maximized drivers’privacy preservation.Moreover,the proposed fully homomorphic algorithm(FHE)was compared against partial-homomorphic encryption(PHE)and technique without encryption and found that the proposed model has quick communication in allocating the parking slots starting with 24.3 s,whereas PHE starts allocating from 24.7 s and the technique without encryption starts at 27.4 s.Thus,we ensure the proposed model performs well in allocating parking slots with less time and high security with privacy preservation.
基金National Natural Science Foundation of China(No.42004018)。
文摘With the development of Global Navigation Satellite Systems(GNSS),geodetic GNSS receivers have been utilized to monitor sea levels using GNSS-Interferometry Reflectometry(GNSS-IR)technology.The multi-mode,multi-frequency signals of GPS,GLONASS,Galileo,and Beidou can be used for GNSS-IR sea level retrieval,but combining these retrievals remains problematic.To address this issue,a GNSS-IR sea level retrieval combination system has been developed,which begins by analyzing error sources in GNSS-IR sea level retrieval and establishing and solving the GNSS-IR retrieval equation.This paper focuses on two key points:time window selection and equation stability.The stability of the retrieval combination equations is determined by the condition number of the coefficient matrix within the time window.The impact of ill-conditioned coefficient matrices on the retrieval results is demonstrated using an extreme case of SNR data with only ascending or descending trajectories.After determining the time window and removing ill-conditioned equations,the multi-mode,multi-frequency GNSS-IR retrieval is performed.Results from three International GNSS Service(IGS)stations show that the combination method produces high-precision,high-resolution,and high-reliability sea level retrieval combination sequences.
文摘This paper aims to develop Machine Learning algorithms to classify electronic articles related to this phenomenon by retrieving information and topic modelling.The Methodology of this study is categorized into three phases:the Text Classification Approach(TCA),the Proposed Algorithms Interpretation(PAI),andfinally,Information Retrieval Approach(IRA).The TCA reflects the text preprocessing pipeline called a clean corpus.The Global Vec-tors for Word Representation(Glove)pre-trained model,FastText,Term Frequency-Inverse Document Fre-quency(TF-IDF),and Bag-of-Words(BOW)for extracting the features have been interpreted in this research.The PAI manifests the Bidirectional Long Short-Term Memory(Bi-LSTM)and Convolutional Neural Network(CNN)to classify the COVID-19 news.Again,the IRA explains the mathematical interpretation of Latent Dirich-let Allocation(LDA),obtained for modelling the topic of Information Retrieval(IR).In this study,99%accuracy was obtained by performing K-fold cross-validation on Bi-LSTM with Glove.A comparative analysis between Deep Learning and Machine Learning based on feature extraction and computational complexity exploration has been performed in this research.Furthermore,some text analyses and the most influential aspects of each document have been explored in this study.We have utilized Bidirectional Encoder Representations from Trans-formers(BERT)as a Deep Learning mechanism in our model training,but the result has not been uncovered satisfactory.However,the proposed system can be adjustable in the real-time news classification of COVID-19.
基金Supported by Inner Mongolia Natural Science Fund(20080404MS0507)National Natural Science Fund(30660150)+1 种基金Education Ministry Higher Education School Science Innovation Project Major Program Cultivation Fund Program(707014)Inner Mongolia Natural Scientific Fund Major Program(200607010501)~~
文摘[Objective] The aim was to set up a plant digital information retrieval system.[Method] Plant digital information retrieval system was designed by combining with Microsoft Visual Basic 6.0 Enterprise Edition database management system and Structure Query Language.[Result] The system realized electronic management and retrieval of local plant information.The key words of retrieval included family,genus,formal name,Chinese name,Latin,morphological characteristics,habitat,collection people,collection places,and protect class and so on.[Conclusion] It provided reference for these problems of species identification and digital management of herbarium.
基金financially supported by the National Natural Science Foundation of China[grant number 42230610]the Second Tibetan Plateau Scientific Expedition and Research(STEP)program[grant number 2019QZKK0103]+1 种基金the Natural Science Foundation of Sichuan Province[grant number 2022NSFSC0217]the Scientific Research Project of Chengdu University of Information Technology[grant number KYTZ201721].
文摘The distinctive conditions present on the north and south slopes of Mount Qomolangma,along with the intricate variations in the underlying surfaces,result in notable variations in the surface energy flux patterns of the two slopes.In this paper,data from TESEBS(Topographical Enhanced Surface Energy Balance System),remote sensing data from eight cloud-free scenarios,and observational data from nine stations are utilized to examine the fluctuations in the surface heat flux on both slopes.The inclusion of MCD43A3 satellite data enhances the surface albedo,contributing to more accurate simulation outcomes.The model results are validated using observational data.The RMSEs of the net radiation,ground heat,sensible heat,and latent heat flux are 40.73,17.09,33.26,and 30.91 W m^(−2),respectively.The net radiation flux is greater on the south slope and exhibits a rapid decline from summer to autumn.Due to the influence of the monsoon,on the north slope,the maximum sensible heat flux occurs in the pre-monsoon period in summer and the maximum latent heat flux occurs during the monsoon.The south slope experiences the highest latent heat flux in summer.The dominant flux on the north slope is sensible heat,while it is latent heat on the south slope.The seasonal variations in the ground heat flux are more pronounced on the south slope than on the north slope.Except in summer,the ground heat flux on the north slope surpasses that on the south slope.
基金Supported by the National Natural Science Foundation of China(60575034)Science Foundation of Guangxi Provincial Education Department(200708LX322)~~
文摘Through analyzing syntactic,semantic,pragmatic information,the retrieval system ACIS based on comprehensive information was established,which could achieve personalized information exaction to guide user s information retrieval.
基金The National Natural Science Foundation of China(No.61003158,61272377)the Specialized Research Fund for the Doctoral Program of Higher Education(No.20120092110027)
文摘The dual-retrieval (DR) operation sequencing problem in the flow-rack automated storage and retrieval system (AS/RS) is modeled as an assignment problem since it is equivalent to pairing outgoing unit-loads for each DR operation. A recursion symmetry Hungarian method (RSHM), modified from the Hungarian method, is proposed for generating a DR operation sequence with minimal total travel time, in which symmetry marking is introduced to ensure a feasible solution and recursion is adopted to break the endless loop caused by the symmetry marking. Simulation experiments are conducted to evaluate the cost effectiveness and the performance of the proposed method. Experimental results illustrate that compared to the single-shuttle machine, the dual-shuttle machine can reduce more than 40% of the total travel time of retrieval operations, and the RSHM saves about 5% to 10% of the total travel time of retrieval operations compared to the greedy-based heuristic.
基金supported by the National Natural Science of Foundation of China(41825011,42030608,42105128,and 42075079)the Opening Foundation of Key Laboratory of Atmospheric Sounding,the CMA and the CMA Research Center on Meteorological Observation Engineering Technology(U2021Z03).
文摘The Advanced Geosynchronous Radiation Imager(AGRI)is a mission-critical instrument for the Fengyun series of satellites.AGRI acquires full-disk images every 15 min and views East Asia every 5 min through 14 spectral bands,enabling the detection of highly variable aerosol optical depth(AOD).Quantitative retrieval of AOD has hitherto been challenging,especially over land.In this study,an AOD retrieval algorithm is proposed that combines deep learning and transfer learning.The algorithm uses core concepts from both the Dark Target(DT)and Deep Blue(DB)algorithms to select features for the machinelearning(ML)algorithm,allowing for AOD retrieval at 550 nm over both dark and bright surfaces.The algorithm consists of two steps:①A baseline deep neural network(DNN)with skip connections is developed using 10 min Advanced Himawari Imager(AHI)AODs as the target variable,and②sunphotometer AODs from 89 ground-based stations are used to fine-tune the DNN parameters.Out-of-station validation shows that the retrieved AOD attains high accuracy,characterized by a coefficient of determination(R2)of 0.70,a mean bias error(MBE)of 0.03,and a percentage of data within the expected error(EE)of 70.7%.A sensitivity study reveals that the top-of-atmosphere reflectance at 650 and 470 nm,as well as the surface reflectance at 650 nm,are the two largest sources of uncertainty impacting the retrieval.In a case study of monitoring an extreme aerosol event,the AGRI AOD is found to be able to capture the detailed temporal evolution of the event.This work demonstrates the superiority of the transfer-learning technique in satellite AOD retrievals and the applicability of the retrieved AGRI AOD in monitoring extreme pollution events.
基金National Key Research and Development Project(Grant No.2019YFE0123300)National Natural Science Foundation of China(Grant Nos.42072337,42241111,and 42241129)+1 种基金Pandeng Program of National Space Science Center,Chinese Academy of Sciences.Xing Wu also acknowledges support from the Young Elite Scientists Sponsorship Program by the China Association for Science and Technology(Grant No.2022QNRC001)China Postdoctoral Science Foundation(Grant No.2021M700149).
文摘With the development of the hyperspectral remote sensing technique,extensive chemical weathering profiles have been identified on Mars.These weathering sequences,formed through precipitation-driven leaching processes,can reflect the paleoenvironments and paleoclimates during pedogenic processes.The specific composition and stratigraphic profiles mirror the mineralogical and chemical trends observed in weathered basalts on Hainan Island in south China.In this study,we investigated the laboratory reflectance spectra of a 53-m-long drilling core of a thick basaltic weathering profile collected from Hainan Island.We established a quantitative spectral model by combining the genetic algorithm and partial least squares regression(GA-PLSR)to predict the chemical properties(SiO2,Al2O3,Fe2O3)and index of laterization(IOL).The entire sample set was divided into a calibration set of 25 samples and a validation set of 12 samples.Specifically,the GA was used to select the spectral subsets for each composition,which were then input into the PLSR model to derive the chemical concentration.The coefficient of determination(R2)values on the validation set for SiO2,Al2O3,Fe2O3,and the IOL were greater than 0.9.In addition,the effects of various spectral preprocessing techniques on the model accuracy were evaluated.We found that the spectral derivative treatment boosted the prediction accuracy of the GA-PLSR model.The improvement achieved with the second derivative was more pronounced than when using the first derivative.The quantitative model developed in this work has the potential to estimate the contents of similar weathering basalt products,and thus infer the degree of alteration and provide insights into paleoclimatic conditions.Moreover,the informative bands selected by the GA can serve as a guideline for designing spectral channels for the next generation of spectrometers.
基金supported by the Fundamental Research Funds for the Central Non-profit Research Institution of the Chinese Academy of Forestry (Grant No.CAFYBB2020ZY003)the Key S&T Project of Inner Mongolia (Grant No.2021ZD0041-001-002)the Central Public-interest Scientific Institution Basal Research Fund (Grant No.11024316000202300001)。
文摘Apricot has a long history of cultivation and has many varieties and types. The traditional variety identification methods are timeconsuming and labor-consuming, posing grand challenges to apricot resource management. Tool development in this regard will help researchers quickly identify variety information. This study photographed apricot fruits outdoors and indoors and constructed a dataset that can precisely classify the fruits using a U-net model (F-score:99%), which helps to obtain the fruit's size, shape, and color features. Meanwhile, a variety search engine was constructed, which can search and identify variety from the database according to the above features. Besides, a mobile and web application (ApricotView) was developed, and the construction mode can be also applied to other varieties of fruit trees.Additionally, we have collected four difficult-to-identify seed datasets and used the VGG16 model for training, with an accuracy of 97%, which provided an important basis for ApricotView. To address the difficulties in data collection bottlenecking apricot phenomics research, we developed the first apricot database platform of its kind (ApricotDIAP, http://apricotdiap.com/) to accumulate, manage, and publicize scientific data of apricot.
文摘The developed system for eye and face detection using Convolutional Neural Networks(CNN)models,followed by eye classification and voice-based assistance,has shown promising potential in enhancing accessibility for individuals with visual impairments.The modular approach implemented in this research allows for a seamless flow of information and assistance between the different components of the system.This research significantly contributes to the field of accessibility technology by integrating computer vision,natural language processing,and voice technologies.By leveraging these advancements,the developed system offers a practical and efficient solution for assisting blind individuals.The modular design ensures flexibility,scalability,and ease of integration with existing assistive technologies.However,it is important to acknowledge that further research and improvements are necessary to enhance the system’s accuracy and usability.Fine-tuning the CNN models and expanding the training dataset can improve eye and face detection as well as eye classification capabilities.Additionally,incorporating real-time responses through sophisticated natural language understanding techniques and expanding the knowledge base of ChatGPT can enhance the system’s ability to provide comprehensive and accurate responses.Overall,this research paves the way for the development of more advanced and robust systems for assisting visually impaired individuals.By leveraging cutting-edge technologies and integrating them into amodular framework,this research contributes to creating a more inclusive and accessible society for individuals with visual impairments.Future work can focus on refining the system,addressing its limitations,and conducting user studies to evaluate its effectiveness and impact in real-world scenarios.
基金funded by National Key Research and Development Program of China (2022YFB2804603,2022YFB2804604)National Natural Science Foundation of China (62075096,62205147,U21B2033)+7 种基金China Postdoctoral Science Foundation (2023T160318,2022M711630,2022M721619)Jiangsu Funding Program for Excellent Postdoctoral Talent (2022ZB254)The Leading Technology of Jiangsu Basic Research Plan (BK20192003)The“333 Engineering”Research Project of Jiangsu Province (BRA2016407)The Jiangsu Provincial“One belt and one road”innovation cooperation project (BZ2020007)Open Research Fund of Jiangsu Key Laboratory of Spectral Imaging&Intelligent Sense (JSGP202105)Fundamental Research Funds for the Central Universities (30922010405,30921011208,30920032101,30919011222)National Major Scientific Instrument Development Project (62227818).
文摘Recently,deep learning has yielded transformative success across optics and photonics,especially in optical metrology.Deep neural networks (DNNs) with a fully convolutional architecture (e.g.,U-Net and its derivatives) have been widely implemented in an end-to-end manner to accomplish various optical metrology tasks,such as fringe denoising,phase unwrapping,and fringe analysis.However,the task of training a DNN to accurately identify an image-to-image transform from massive input and output data pairs seems at best naive,as the physical laws governing the image formation or other domain expertise pertaining to the measurement have not yet been fully exploited in current deep learning practice.To this end,we introduce a physics-informed deep learning method for fringe pattern analysis (PI-FPA) to overcome this limit by integrating a lightweight DNN with a learning-enhanced Fourier transform profilometry (Le FTP) module.By parameterizing conventional phase retrieval methods,the Le FTP module embeds the prior knowledge in the network structure and the loss function to directly provide reliable phase results for new types of samples,while circumventing the requirement of collecting a large amount of high-quality data in supervised learning methods.Guided by the initial phase from Le FTP,the phase recovery ability of the lightweight DNN is enhanced to further improve the phase accuracy at a low computational cost compared with existing end-to-end networks.Experimental results demonstrate that PI-FPA enables more accurate and computationally efficient single-shot phase retrieval,exhibiting its excellent generalization to various unseen objects during training.The proposed PI-FPA presents that challenging issues in optical metrology can be potentially overcome through the synergy of physics-priors-based traditional tools and data-driven learning approaches,opening new avenues to achieve fast and accurate single-shot 3D imaging.
文摘This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schemes like tf-idf and BM25.These conventional methods often struggle with accurately capturing document relevance,leading to inefficiencies in both retrieval performance and index size management.OWS proposes a dynamic weighting mechanism that evaluates the significance of terms based on their orbital position within the vector space,emphasizing term relationships and distribution patterns overlooked by existing models.Our research focuses on evaluating OWS’s impact on model accuracy using Information Retrieval metrics like Recall,Precision,InterpolatedAverage Precision(IAP),andMeanAverage Precision(MAP).Additionally,we assessOWS’s effectiveness in reducing the inverted index size,crucial for model efficiency.We compare OWS-based retrieval models against others using different schemes,including tf-idf variations and BM25Delta.Results reveal OWS’s superiority,achieving a 54%Recall and 81%MAP,and a notable 38%reduction in the inverted index size.This highlights OWS’s potential in optimizing retrieval processes and underscores the need for further research in this underrepresented area to fully leverage OWS’s capabilities in information retrieval methodologies.
文摘This exploration acquaints a momentous methodology with custom chatbot improvement that focuses on pro-ficiency close by viability.We accomplish this by joining three key innovations:LangChain,Retrieval Augmented Generation(RAG),and enormous language models(LLMs)tweaked with execution proficient strategies like LoRA and QLoRA.LangChain takes into consideration fastidious fitting of chatbots to explicit purposes,guaranteeing engaged and important collaborations with clients.RAG’s web scratching capacities engage these chatbots to get to a tremendous store of data,empowering them to give exhaustive and enlightening reactions to requests.This recovered data is then decisively woven into reaction age utilizing LLMs that have been calibrated with an emphasis on execution productivity.This combination approach offers a triple advantage:further developed viability,upgraded client experience,and extended admittance to data.Chatbots become proficient at taking care of client questions precisely and productively,while instructive and logically pertinent reactions make a more regular and drawing in cooperation for clients.At last,web scratching enables chatbots to address a more extensive assortment of requests by conceding them admittance to a more extensive information base.By digging into the complexities of execution proficient LLM calibrating and underlining the basic job of web-scratched information,this examination offers a critical commitment to propelling custom chatbot plan and execution.The subsequent chatbots feature the monstrous capability of these advancements in making enlightening,easy to understand,and effective conversational specialists,eventually changing the manner in which clients cooperate with chatbots.
基金The National High Technology Research and Devel-opment Program of China (863Program) (No2003AA1Z2560,2002AA414060)the Key Science and Technology Program of Shaanxi Province (No2006K04-G10)
文摘To deal with a lack of semantic interoperability of traditional knowledge retrieval approaches, a semantic-based networked manufacturing (NM) knowledge retrieval architecture is proposed, which offers a series of tools for supporting the sharing of knowledge and promoting NM collaboration. A 5-tuple based semantic information retrieval model is proposed, which includes the interoperation on the semantic layer, and a test process is given for this model. The recall ratio and the precision ratio of manufacturing knowledge retrieval are proved to be greatly improved by evaluation. Thus, a practical and reliable approach based on the semantic web is provided for solving the correlated concrete problems in regional networked manufacturing.
基金Under the auspices of the Social Science and Humanity on Young Fund of the Ministry of Education of China(No.21YJCZH100)the Scientific Research Project on Outstanding Young of the Fujian Agriculture and Forestry University(No.XJQ201920)+1 种基金the Science and Technology Innovation Special Fund Project of Fujian Agriculture and Forestry University(No.CXZX2021032)the Forestry Peak Discipline Construction Project of Fujian Agriculture and Forestry University(No.72202200205)。
文摘Under the influence of anthropogenic and climate change,the problems caused by urban heat island(UHI)has become increasingly prominent.In order to promote urban sustainable development and improve the quality of human settlements,it is significant for exploring the evolution characteristics of urban thermal environment and analyzing its driving forces.Taking the Landsat series images as the basic data sources,the winter land surface temperature(LST)of the rapid urbanization area of Fuzhou City in China was quantitatively retrieved from 2001 to 2021.Combing comprehensively the standard deviation ellipse model,profile analysis and GeoDetector model,the spatio-temporal evolution characteristics and influencing factors of the winter urban thermal environment were systematically analyzed.The results showed that the winter LST presented an increasing trend in the study area during 2001–2021,and the winter LST of the central urban regions was significantly higher than the suburbs.There was a strong UHI effect from 2001 to 2021with an expansion trend from the central urban regions to the suburbs and coastal areas in space scale.The LST of green lands and wetlands are significantly lower than croplands,artificial surface and unvegetated lands.Vegetation and water bodies had a significant mitigation effect on UHI,especially in the micro-scale.The winter UHI had been jointly driven by the underlying surface and socio-economic factors in a nonlinear or two-factor interactive enhancement mode,and socio-economic factors had played a leading role.This research could provide data support and decision-making references for rationally planning urban layout and promoting sustainable urban development.
基金National Natural Science Foundation of China(No.61971121)。
文摘Clothing attribute recognition has become an essential technology,which enables users to automatically identify the characteristics of clothes and search for clothing images with similar attributes.However,existing methods cannot recognize newly added attributes and may fail to capture region-level visual features.To address the aforementioned issues,a region-aware fashion contrastive language-image pre-training(RaF-CLIP)model was proposed.This model aligned cropped and segmented images with category and multiple fine-grained attribute texts,achieving the matching of fashion region and corresponding texts through contrastive learning.Clothing retrieval found suitable clothing based on the user-specified clothing categories and attributes,and to further improve the accuracy of retrieval,an attribute-guided composed network(AGCN)as an additional component on RaF-CLIP was introduced,specifically designed for composed image retrieval.This task aimed to modify the reference image based on textual expressions to retrieve the expected target.By adopting a transformer-based bidirectional attention and gating mechanism,it realized the fusion and selection of image features and attribute text features.Experimental results show that the proposed model achieves a mean precision of 0.6633 for attribute recognition tasks and a recall@10(recall@k is defined as the percentage of correct samples appearing in the top k retrieval results)of 39.18 for composed image retrieval task,satisfying user needs for freely searching for clothing through images and texts.
文摘Crime scene investigation(CSI)image is key evidence carrier during criminal investiga-tion,in which CSI image retrieval can assist the public police to obtain criminal clues.Moreover,with the rapid development of deep learning,data-driven paradigm has become the mainstreammethod of CSI image feature extraction and representation,and in this process,datasets provideeffective support for CSI retrieval performance.However,there is a lack of systematic research onCSI image retrieval methods and datasets.Therefore,we present an overview of the existing worksabout one-class and multi-class CSI image retrieval based on deep learning.According to theresearch,based on their technical functionalities and implementation methods,CSI image retrievalis roughly classified into five categories:feature representation,metric learning,generative adversar-ial networks,autoencoder networks and attention networks.Furthermore,We analyzed the remain-ing challenges and discussed future work directions in this field.