Hypoxia is a typical feature of the tumor microenvironment,one of the most critical factors affecting cell behavior and tumor progression.However,the lack of tumor models able to precisely emulate natural brain tumor ...Hypoxia is a typical feature of the tumor microenvironment,one of the most critical factors affecting cell behavior and tumor progression.However,the lack of tumor models able to precisely emulate natural brain tumor tissue has impeded the study of the effects of hypoxia on the progression and growth of tumor cells.This study reports a three-dimensional(3D)brain tumor model obtained by encapsulating U87MG(U87)cells in a hydrogel containing type I collagen.It also documents the effect of various oxygen concentrations(1%,7%,and 21%)in the culture environment on U87 cell morphology,proliferation,viability,cell cycle,apoptosis rate,and migration.Finally,it compares two-dimensional(2D)and 3D cultures.For comparison purposes,cells cultured in flat culture dishes were used as the control(2D model).Cells cultured in the 3D model proliferated more slowly but had a higher apoptosis rate and proportion of cells in the resting phase(G0 phase)/gap I phase(G1 phase)than those cultured in the 2D model.Besides,the two models yielded significantly different cell morphologies.Finally,hypoxia(e.g.,1%O2)affected cell morphology,slowed cell growth,reduced cell viability,and increased the apoptosis rate in the 3D model.These results indicate that the constructed 3D model is effective for investigating the effects of biological and chemical factors on cell morphology and function,and can be more representative of the tumor microenvironment than 2D culture systems.The developed 3D glioblastoma tumor model is equally applicable to other studies in pharmacology and pathology.展开更多
Cyberspace is extremely dynamic,with new attacks arising daily.Protecting cybersecurity controls is vital for network security.Deep Learning(DL)models find widespread use across various fields,with cybersecurity being...Cyberspace is extremely dynamic,with new attacks arising daily.Protecting cybersecurity controls is vital for network security.Deep Learning(DL)models find widespread use across various fields,with cybersecurity being one of the most crucial due to their rapid cyberattack detection capabilities on networks and hosts.The capabilities of DL in feature learning and analyzing extensive data volumes lead to the recognition of network traffic patterns.This study presents novel lightweight DL models,known as Cybernet models,for the detection and recognition of various cyber Distributed Denial of Service(DDoS)attacks.These models were constructed to have a reasonable number of learnable parameters,i.e.,less than 225,000,hence the name“lightweight.”This not only helps reduce the number of computations required but also results in faster training and inference times.Additionally,these models were designed to extract features in parallel from 1D Convolutional Neural Networks(CNN)and Long Short-Term Memory(LSTM),which makes them unique compared to earlier existing architectures and results in better performance measures.To validate their robustness and effectiveness,they were tested on the CIC-DDoS2019 dataset,which is an imbalanced and large dataset that contains different types of DDoS attacks.Experimental results revealed that bothmodels yielded promising results,with 99.99% for the detectionmodel and 99.76% for the recognition model in terms of accuracy,precision,recall,and F1 score.Furthermore,they outperformed the existing state-of-the-art models proposed for the same task.Thus,the proposed models can be used in cyber security research domains to successfully identify different types of attacks with a high detection and recognition rate.展开更多
Named Entity Recognition(NER)stands as a fundamental task within the field of biomedical text mining,aiming to extract specific types of entities such as genes,proteins,and diseases from complex biomedical texts and c...Named Entity Recognition(NER)stands as a fundamental task within the field of biomedical text mining,aiming to extract specific types of entities such as genes,proteins,and diseases from complex biomedical texts and categorize them into predefined entity types.This process can provide basic support for the automatic construction of knowledge bases.In contrast to general texts,biomedical texts frequently contain numerous nested entities and local dependencies among these entities,presenting significant challenges to prevailing NER models.To address these issues,we propose a novel Chinese nested biomedical NER model based on RoBERTa and Global Pointer(RoBGP).Our model initially utilizes the RoBERTa-wwm-ext-large pretrained language model to dynamically generate word-level initial vectors.It then incorporates a Bidirectional Long Short-Term Memory network for capturing bidirectional semantic information,effectively addressing the issue of long-distance dependencies.Furthermore,the Global Pointer model is employed to comprehensively recognize all nested entities in the text.We conduct extensive experiments on the Chinese medical dataset CMeEE and the results demonstrate the superior performance of RoBGP over several baseline models.This research confirms the effectiveness of RoBGP in Chinese biomedical NER,providing reliable technical support for biomedical information extraction and knowledge base construction.展开更多
As important geological data,a geological report contains rich expert and geological knowledge,but the challenge facing current research into geological knowledge extraction and mining is how to render accurate unders...As important geological data,a geological report contains rich expert and geological knowledge,but the challenge facing current research into geological knowledge extraction and mining is how to render accurate understanding of geological reports guided by domain knowledge.While generic named entity recognition models/tools can be utilized for the processing of geoscience reports/documents,their effectiveness is hampered by a dearth of domain-specific knowledge,which in turn leads to a pronounced decline in recognition accuracy.This study summarizes six types of typical geological entities,with reference to the ontological system of geological domains and builds a high quality corpus for the task of geological named entity recognition(GNER).In addition,Geo Wo BERT-adv BGP(Geological Word-base BERTadversarial training Bi-directional Long Short-Term Memory Global Pointer)is proposed to address the issues of ambiguity,diversity and nested entities for the geological entities.The model first uses the fine-tuned word granularitybased pre-training model Geo Wo BERT(Geological Word-base BERT)and combines the text features that are extracted using the Bi LSTM(Bi-directional Long Short-Term Memory),followed by an adversarial training algorithm to improve the robustness of the model and enhance its resistance to interference,the decoding finally being performed using a global association pointer algorithm.The experimental results show that the proposed model for the constructed dataset achieves high performance and is capable of mining the rich geological information.展开更多
Internet of Vehicles (IoV) is a new system that enables individual vehicles to connect with nearby vehicles,people, transportation infrastructure, and networks, thereby realizing amore intelligent and efficient transp...Internet of Vehicles (IoV) is a new system that enables individual vehicles to connect with nearby vehicles,people, transportation infrastructure, and networks, thereby realizing amore intelligent and efficient transportationsystem. The movement of vehicles and the three-dimensional (3D) nature of the road network cause the topologicalstructure of IoV to have the high space and time complexity.Network modeling and structure recognition for 3Droads can benefit the description of topological changes for IoV. This paper proposes a 3Dgeneral roadmodel basedon discrete points of roads obtained from GIS. First, the constraints imposed by 3D roads on moving vehicles areanalyzed. Then the effects of road curvature radius (Ra), longitudinal slope (Slo), and length (Len) on speed andacceleration are studied. Finally, a general 3D road network model based on road section features is established.This paper also presents intersection and road section recognition methods based on the structural features ofthe 3D road network model and the road features. Real GIS data from a specific region of Beijing is adopted tocreate the simulation scenario, and the simulation results validate the general 3D road network model and therecognitionmethod. Therefore, thiswork makes contributions to the field of intelligent transportation by providinga comprehensive approach tomodeling the 3Droad network and its topological changes in achieving efficient trafficflowand improved road safety.展开更多
Expanding photovoltaic(PV)resources in rural-grid areas is an essential means to augment the share of solar energy in the energy landscape,aligning with the“carbon peaking and carbon neutrality”objectives.However,ru...Expanding photovoltaic(PV)resources in rural-grid areas is an essential means to augment the share of solar energy in the energy landscape,aligning with the“carbon peaking and carbon neutrality”objectives.However,rural power grids often lack digitalization;thus,the load distribution within these areas is not fully known.This hinders the calculation of the available PV capacity and deduction of node voltages.This study proposes a load-distribution modeling approach based on remote-sensing image recognition in pursuit of a scientific framework for developing distributed PV resources in rural grid areas.First,houses in remote-sensing images are accurately recognized using deep-learning techniques based on the YOLOv5 model.The distribution of the houses is then used to estimate the load distribution in the grid area.Next,equally spaced and clustered distribution models are used to adaptively determine the location of the nodes and load power in the distribution lines.Finally,by calculating the connectivity matrix of the nodes,a minimum spanning tree is extracted,the topology of the network is constructed,and the node parameters of the load-distribution model are calculated.The proposed scheme is implemented in a software package and its efficacy is demonstrated by analyzing typical remote-sensing images of rural grid areas.The results underscore the ability of the proposed approach to effectively discern the distribution-line structure and compute the node parameters,thereby offering vital support for determining PV access capability.展开更多
With drilling and seismic data of Transtensional(strike-slip)Fault System in the Ziyang area of the central Sichuan Basin,SW China plane-section integrated structural interpretation,3-D fault framework model building,...With drilling and seismic data of Transtensional(strike-slip)Fault System in the Ziyang area of the central Sichuan Basin,SW China plane-section integrated structural interpretation,3-D fault framework model building,fault throw analyzing,and balanced profile restoration,it is pointed out that the transtensional fault system in the Ziyang 3-D seismic survey consists of the northeast-trending F_(I)19 and F_(I)20 fault zones dominated by extensional deformation,as well as 3 sets of northwest-trending en echelon normal faults experienced dextral shear deformation.Among them,the F_(I)19 and F_(I)20 fault zones cut through the Neoproterozoic to Lower Triassic Jialingjiang Formation,presenting a 3-D structure of an“S”-shaped ribbon.And before Permian and during the Early Triassic,the F_(I)19 and F_(I)20 fault zones underwent at least two periods of structural superimposition.Besides,the 3 sets of northwest-trending en echelon normal faults are composed of small normal faults arranged in pairs,with opposite dip directions and partially left-stepped arrangement.And before Permian,they had formed almost,restricting the eastward growth and propagation of the F_(I)19 fault zone.The F_(I)19 and F_(I)20 fault zones communicate multiple sets of source rocks and reservoirs from deep to shallow,and the timing of fault activity matches well with oil and gas generation peaks.If there were favorable Cambrian-Triassic sedimentary facies and reservoirs developing on the local anticlinal belts of both sides of the F_(I)19 and F_(I)20 fault zones,the major reservoirs in this area are expected to achieve breakthroughs in oil and gas exploration.展开更多
Named Entity Recognition(NER)is crucial for extracting structured information from text.While traditional methods rely on rules,Conditional Random Fields(CRFs),or deep learning,the advent of large-scale Pre-trained La...Named Entity Recognition(NER)is crucial for extracting structured information from text.While traditional methods rely on rules,Conditional Random Fields(CRFs),or deep learning,the advent of large-scale Pre-trained Language Models(PLMs)offers new possibilities.PLMs excel at contextual learning,potentially simplifying many natural language processing tasks.However,their application to NER remains underexplored.This paper investigates leveraging the GPT-3 PLM for NER without fine-tuning.We propose a novel scheme that utilizes carefully crafted templates and context examples selected based on semantic similarity.Our experimental results demonstrate the feasibility of this approach,suggesting a promising direction for harnessing PLMs in NER.展开更多
This paper presents a new method for extract three-dimensional (3D) discrete spherical Fourier descriptors based on surface curvature voxels for pollen particle recognition. In order to reduce the high amount of pol...This paper presents a new method for extract three-dimensional (3D) discrete spherical Fourier descriptors based on surface curvature voxels for pollen particle recognition. In order to reduce the high amount of pollen information and noise disturbance, the geometric normalized curvature voxels with the principal curvedness are first extracted to represent the intrinsic pollen volumetric data. Then the curvature voxels are decomposed into radial and angular components with spherical harmonic transform in spherical coordinates. Finally the 3D discrete Fourier transform is applied to the decomposed curvature voxels to obtain the 3D spherical Fourier descriptors for pollen recognition. Experimental results show that the presented descriptors are invariant to different pollen particle geometric transformations, such as pose change and spatial rotation, and can obtain high recognition accuracy and speed simultaneously.展开更多
A cascaded projection of the Gaussian mixture model algorithm is proposed.First,the marginal distribution of the Gaussian mixture model is computed for different feature dimensions, and a number of sub-classifiers are...A cascaded projection of the Gaussian mixture model algorithm is proposed.First,the marginal distribution of the Gaussian mixture model is computed for different feature dimensions, and a number of sub-classifiers are generated using the marginal distribution model.Each sub-classifier is based on different feature sets.The cascaded structure is adopted to fuse the sub-classifiers dynamically to achieve sample adaptation ability.Secondly,the effectiveness of the proposed algorithm is verified on electrocardiogram emotional signal and speech emotional signal.Emotional data including fidgetiness,happiness and sadness is collected by induction experiments.Finally,the emotion feature extraction method is discussed,including heart rate variability, the chaotic electrocardiogram feature and utterance level static feature.The emotional feature reduction methods are studied, including principle component analysis,sequential forward selection, the Fisher discriminant ratio and maximal information coefficient.The experimental results show that the proposed classification algorithm can effectively improve recognition accuracy in two different scenarios.展开更多
Aim Using animals as object of experiment to acquire various patterns of low cerebral blood pressure and reduced blood capacity in cerebral tissues of astronauts due to the load of acceleration. Methods The isotope ...Aim Using animals as object of experiment to acquire various patterns of low cerebral blood pressure and reduced blood capacity in cerebral tissues of astronauts due to the load of acceleration. Methods The isotope tracking technique was applied to mark the blood and record the dynamic curves of cerebral blood flow changes under various accelerations, and the relevant mathematical model was set up using the method of system recognition. Also the method of factor analyzing was used to select two out of the data collected by eight sensors as two factors. Results One of the two factors reflects the various patterns in the astronaut's upper body, the other for the lower body. Parameters of rise time, delay time and steady value reflect the results under different acceleration. Conclusion Whether for the upper body or the lower body, blood flow changes can be considered as a second order system model. This method provides a new technique and method of doing research on astronaut's endurance of acceleration and selecting astronauts.展开更多
Dynamic casual modeling of functional magnetic resonance imaging(fMRI) signals is employed to explore critical emotional neurocircuitry under sad stimuli. The intrinsic model of emotional loops is built on the basis...Dynamic casual modeling of functional magnetic resonance imaging(fMRI) signals is employed to explore critical emotional neurocircuitry under sad stimuli. The intrinsic model of emotional loops is built on the basis of Papez's circuit and related prior knowledge, and then three modulatory connection models are established. In these models, stimuli are placed at different points, which represents they affect the neural activities between brain regions, and these activities are modulated in different ways. Then, the optimal model is selected by Bayesian model comparison. From group analysis, patients' intrinsic and modulatory connections from the anterior cingulate cortex (ACC) to the right inferior frontal gyrus (rlFG) are significantly higher than those of the control group. Then the functional connection parameters of the model are selected as classifier features. The classification accuracy rate from the support vector machine(SVM) classifier is 80.73%, which, to some extent, validates the effectiveness of the regional connectivity parameters for depression recognition and provides a new approach for the clinical diagnosis of depression.展开更多
Two discriminative methods for solving tone problems in Mandarin speech recognition are presented. First, discriminative training on the HMM (hidden Markov model) based tone models is proposed. Then an integration t...Two discriminative methods for solving tone problems in Mandarin speech recognition are presented. First, discriminative training on the HMM (hidden Markov model) based tone models is proposed. Then an integration technique of tone models into a large vocabulary continuous speech recognition system is presented. Discriminative model weight training based on minimum phone error criteria is adopted aiming at optimal integration of the tone models. The extended Baum Welch algorithm is applied to find the model-dependent weights to scale the acoustic scores and tone scores. Experimental results show that tone recognition rates and continuous speech recognition accuracy can be improved by the discriminatively trained tone model. Performance of a large vocabulary continuous Mandarin speech recognition system can be further enhanced by the discriminatively trained weight combinations due to a better interpolation of the given models.展开更多
This paper presents a hybrid model for three-dimensional Geographical Information Systems which is an integration of surface- and volume-based models. The Triangulated Irregular Network (TIN) and octree models are int...This paper presents a hybrid model for three-dimensional Geographical Information Systems which is an integration of surface- and volume-based models. The Triangulated Irregular Network (TIN) and octree models are integrated in this hybrid models. The TIN model works as a surface-based model which mainly serves for surface presentation and visualization. On the other hand, the octree encoding supports volumetric analysis. The designed data structure brings a major advantage in the three-dimensional selective retrieval. This technique increases the efficiency of three-dimensional data operation.展开更多
Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,...Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks.展开更多
A new three-dimensional semi-implicit finite-volume ocean model has been developed for simulating the coastal ocean circulation, which is based on the staggered C-unstructured non-orthogonal grid in the hor- izontal d...A new three-dimensional semi-implicit finite-volume ocean model has been developed for simulating the coastal ocean circulation, which is based on the staggered C-unstructured non-orthogonal grid in the hor- izontal direction and z-level grid in the vertical direction. The three-dimensional model is discretized by the semi-implicit finite-volume method, in that the free-surface and the vertical diffusion are semi-implicit, thereby removing stability limitations associated with the surface gravity wave and vertical diffusion terms. The remaining terms in the momentum equations are discretized explicitly by an integral method. The partial cell method is used for resolving topography, which enables the model to better represent irregular topography. The model has been tested against analytical cases for wind and tidal oscillation circulation, and is applied to simulating the tidal flow in the Bohal Sea. The results are in good agreement both with the analytical solutions and measurement results.展开更多
In gravity-anomaly-based prospecting, the computational and memory requirements for practical numerical modeling are potentially enormous. Achieving an efficient and precise inversion for gravity anomaly imaging over ...In gravity-anomaly-based prospecting, the computational and memory requirements for practical numerical modeling are potentially enormous. Achieving an efficient and precise inversion for gravity anomaly imaging over large-scale and complex terrain requires additional methods. To this end, we have proposed a new topography-capable By performing a two-dimensional Fourier transform in the horizontal directions, threedimensional partial differential equations in the spatial domain were transformed into a group of independent, one-dimensional differential equations engaged with different wave numbers. These independent differential equations are highly parallel across different wave numbers. differential equations with different wave numbers, and the efficiency of solving fixedbandwidth linear equations was further improved by a chasing method. In a synthetic test, a prism model was used to verify the accuracy and reliability of the proposed algorithm by comparing the numerical solution with the analytical solution. We studied the computational precision and efficiency with and without topography using different Fourier transform methods. The results showed that the Guass-FFT method has higher numerical precision, while the standard FFT method is superior, in terms of computation time, for inversion and quantitative interpretation under complicated terrain.展开更多
Gesture recognition is used in many practical applications such as human-robot interaction, medical rehabilitation and sign language. With increasing motion sensor development, multiple data sources have become availa...Gesture recognition is used in many practical applications such as human-robot interaction, medical rehabilitation and sign language. With increasing motion sensor development, multiple data sources have become available, which leads to the rise of multi-modal gesture recognition. Since our previous approach to gesture recognition depends on a unimodal system, it is difficult to classify similar motion patterns. In order to solve this problem, a novel approach which integrates motion, audio and video models is proposed by using dataset captured by Kinect. The proposed system can recognize observed gestures by using three models. Recognition results of three models are integrated by using the proposed framework and the output becomes the final result. The motion and audio models are learned by using Hidden Markov Model. Random Forest which is the video classifier is used to learn the video model. In the experiments to test the performances of the proposed system, the motion and audio models most suitable for gesture recognition are chosen by varying feature vectors and learning methods. Additionally, the unimodal and multi-modal models are compared with respect to recognition accuracy. All the experiments are conducted on dataset provided by the competition organizer of MMGRC, which is a workshop for Multi-Modal Gesture Recognition Challenge. The comparison results show that the multi-modal model composed of three models scores the highest recognition rate. This improvement of recognition accuracy means that the complementary relationship among three models improves the accuracy of gesture recognition. The proposed system provides the application technology to understand human actions of daily life more precisely.展开更多
Based on the features extracted from generalized autoregressive (GAR) model parameters of the received waveform, and the use of multilayer perceptron(MLP) neural network classifier, a new digital modulation recognitio...Based on the features extracted from generalized autoregressive (GAR) model parameters of the received waveform, and the use of multilayer perceptron(MLP) neural network classifier, a new digital modulation recognition method is proposed in this paper. Because of the better noise suppression ability of the GAR model and the powerful pattern classification capacity of the MLP neural network classifier, the new method can significantly improve the recognition performance in lower SNR with better robustness. To assess the performance of the new method, computer simulations are also performed.展开更多
In our study, entropy weight coefficients, based on Shannon entropy, were determined for an attribute recognition model to model the quality of groundwater sources. The model follows the theory previously proposed by ...In our study, entropy weight coefficients, based on Shannon entropy, were determined for an attribute recognition model to model the quality of groundwater sources. The model follows the theory previously proposed by Chen Q S. In the model, firstly, the author establishes the attribute space matrix and determines the weight based on Shannon entropy theory; secondly, calculates attribute measure; thirdly, evaluates that with confidence criterion and score criterion; finally, an application example is given. The results show that the water quality of the groundwater sources for the city comes up to the grade II or III standard. There is no pollution that obviously exceeds the standard and the water can meet people’s needs .The results from an evaluation of this model are in basic agreement with the observed situation and with a set pair analysis (SPA) model.展开更多
基金supported by the National Natural Science Foundation of China (No. 52275291)the Fundamental Research Funds for the Central Universitiesthe Program for Innovation Team of Shaanxi Province,China (No. 2023-CX-TD-17)
文摘Hypoxia is a typical feature of the tumor microenvironment,one of the most critical factors affecting cell behavior and tumor progression.However,the lack of tumor models able to precisely emulate natural brain tumor tissue has impeded the study of the effects of hypoxia on the progression and growth of tumor cells.This study reports a three-dimensional(3D)brain tumor model obtained by encapsulating U87MG(U87)cells in a hydrogel containing type I collagen.It also documents the effect of various oxygen concentrations(1%,7%,and 21%)in the culture environment on U87 cell morphology,proliferation,viability,cell cycle,apoptosis rate,and migration.Finally,it compares two-dimensional(2D)and 3D cultures.For comparison purposes,cells cultured in flat culture dishes were used as the control(2D model).Cells cultured in the 3D model proliferated more slowly but had a higher apoptosis rate and proportion of cells in the resting phase(G0 phase)/gap I phase(G1 phase)than those cultured in the 2D model.Besides,the two models yielded significantly different cell morphologies.Finally,hypoxia(e.g.,1%O2)affected cell morphology,slowed cell growth,reduced cell viability,and increased the apoptosis rate in the 3D model.These results indicate that the constructed 3D model is effective for investigating the effects of biological and chemical factors on cell morphology and function,and can be more representative of the tumor microenvironment than 2D culture systems.The developed 3D glioblastoma tumor model is equally applicable to other studies in pharmacology and pathology.
文摘Cyberspace is extremely dynamic,with new attacks arising daily.Protecting cybersecurity controls is vital for network security.Deep Learning(DL)models find widespread use across various fields,with cybersecurity being one of the most crucial due to their rapid cyberattack detection capabilities on networks and hosts.The capabilities of DL in feature learning and analyzing extensive data volumes lead to the recognition of network traffic patterns.This study presents novel lightweight DL models,known as Cybernet models,for the detection and recognition of various cyber Distributed Denial of Service(DDoS)attacks.These models were constructed to have a reasonable number of learnable parameters,i.e.,less than 225,000,hence the name“lightweight.”This not only helps reduce the number of computations required but also results in faster training and inference times.Additionally,these models were designed to extract features in parallel from 1D Convolutional Neural Networks(CNN)and Long Short-Term Memory(LSTM),which makes them unique compared to earlier existing architectures and results in better performance measures.To validate their robustness and effectiveness,they were tested on the CIC-DDoS2019 dataset,which is an imbalanced and large dataset that contains different types of DDoS attacks.Experimental results revealed that bothmodels yielded promising results,with 99.99% for the detectionmodel and 99.76% for the recognition model in terms of accuracy,precision,recall,and F1 score.Furthermore,they outperformed the existing state-of-the-art models proposed for the same task.Thus,the proposed models can be used in cyber security research domains to successfully identify different types of attacks with a high detection and recognition rate.
基金supported by the Outstanding Youth Team Project of Central Universities(QNTD202308)the Ant Group through CCF-Ant Research Fund(CCF-AFSG 769498 RF20220214).
文摘Named Entity Recognition(NER)stands as a fundamental task within the field of biomedical text mining,aiming to extract specific types of entities such as genes,proteins,and diseases from complex biomedical texts and categorize them into predefined entity types.This process can provide basic support for the automatic construction of knowledge bases.In contrast to general texts,biomedical texts frequently contain numerous nested entities and local dependencies among these entities,presenting significant challenges to prevailing NER models.To address these issues,we propose a novel Chinese nested biomedical NER model based on RoBERTa and Global Pointer(RoBGP).Our model initially utilizes the RoBERTa-wwm-ext-large pretrained language model to dynamically generate word-level initial vectors.It then incorporates a Bidirectional Long Short-Term Memory network for capturing bidirectional semantic information,effectively addressing the issue of long-distance dependencies.Furthermore,the Global Pointer model is employed to comprehensively recognize all nested entities in the text.We conduct extensive experiments on the Chinese medical dataset CMeEE and the results demonstrate the superior performance of RoBGP over several baseline models.This research confirms the effectiveness of RoBGP in Chinese biomedical NER,providing reliable technical support for biomedical information extraction and knowledge base construction.
基金financially supported by the Natural Science Foundation of China(Grant No.42301492)the National Key R&D Program of China(Grant Nos.2022YFF0711600,2022YFF0801201,2022YFF0801200)+3 种基金the Major Special Project of Xinjiang(Grant No.2022A03009-3)the Open Fund of Key Laboratory of Urban Land Resources Monitoring and Simulation,Ministry of Natural Resources(Grant No.KF-2022-07014)the Opening Fund of the Key Laboratory of the Geological Survey and Evaluation of the Ministry of Education(Grant No.GLAB 2023ZR01)the Fundamental Research Funds for the Central Universities。
文摘As important geological data,a geological report contains rich expert and geological knowledge,but the challenge facing current research into geological knowledge extraction and mining is how to render accurate understanding of geological reports guided by domain knowledge.While generic named entity recognition models/tools can be utilized for the processing of geoscience reports/documents,their effectiveness is hampered by a dearth of domain-specific knowledge,which in turn leads to a pronounced decline in recognition accuracy.This study summarizes six types of typical geological entities,with reference to the ontological system of geological domains and builds a high quality corpus for the task of geological named entity recognition(GNER).In addition,Geo Wo BERT-adv BGP(Geological Word-base BERTadversarial training Bi-directional Long Short-Term Memory Global Pointer)is proposed to address the issues of ambiguity,diversity and nested entities for the geological entities.The model first uses the fine-tuned word granularitybased pre-training model Geo Wo BERT(Geological Word-base BERT)and combines the text features that are extracted using the Bi LSTM(Bi-directional Long Short-Term Memory),followed by an adversarial training algorithm to improve the robustness of the model and enhance its resistance to interference,the decoding finally being performed using a global association pointer algorithm.The experimental results show that the proposed model for the constructed dataset achieves high performance and is capable of mining the rich geological information.
基金the National Natural Science Foundation of China(Nos.62272063,62072056 and 61902041)the Natural Science Foundation of Hunan Province(Nos.2022JJ30617 and 2020JJ2029)+4 种基金Open Research Fund of Key Lab of Broadband Wireless Communication and Sensor Network Technology,Nanjing University of Posts and Telecommunications(No.JZNY202102)the Traffic Science and Technology Project of Hunan Province,China(No.202042)Hunan Provincial Key Research and Development Program(No.2022GK2019)this work was funded by the Researchers Supporting Project Number(RSPD2023R681)King Saud University,Riyadh,Saudi Arabia.
文摘Internet of Vehicles (IoV) is a new system that enables individual vehicles to connect with nearby vehicles,people, transportation infrastructure, and networks, thereby realizing amore intelligent and efficient transportationsystem. The movement of vehicles and the three-dimensional (3D) nature of the road network cause the topologicalstructure of IoV to have the high space and time complexity.Network modeling and structure recognition for 3Droads can benefit the description of topological changes for IoV. This paper proposes a 3Dgeneral roadmodel basedon discrete points of roads obtained from GIS. First, the constraints imposed by 3D roads on moving vehicles areanalyzed. Then the effects of road curvature radius (Ra), longitudinal slope (Slo), and length (Len) on speed andacceleration are studied. Finally, a general 3D road network model based on road section features is established.This paper also presents intersection and road section recognition methods based on the structural features ofthe 3D road network model and the road features. Real GIS data from a specific region of Beijing is adopted tocreate the simulation scenario, and the simulation results validate the general 3D road network model and therecognitionmethod. Therefore, thiswork makes contributions to the field of intelligent transportation by providinga comprehensive approach tomodeling the 3Droad network and its topological changes in achieving efficient trafficflowand improved road safety.
基金supported by the State Grid Science&Technology Project of China(5400-202224153A-1-1-ZN).
文摘Expanding photovoltaic(PV)resources in rural-grid areas is an essential means to augment the share of solar energy in the energy landscape,aligning with the“carbon peaking and carbon neutrality”objectives.However,rural power grids often lack digitalization;thus,the load distribution within these areas is not fully known.This hinders the calculation of the available PV capacity and deduction of node voltages.This study proposes a load-distribution modeling approach based on remote-sensing image recognition in pursuit of a scientific framework for developing distributed PV resources in rural grid areas.First,houses in remote-sensing images are accurately recognized using deep-learning techniques based on the YOLOv5 model.The distribution of the houses is then used to estimate the load distribution in the grid area.Next,equally spaced and clustered distribution models are used to adaptively determine the location of the nodes and load power in the distribution lines.Finally,by calculating the connectivity matrix of the nodes,a minimum spanning tree is extracted,the topology of the network is constructed,and the node parameters of the load-distribution model are calculated.The proposed scheme is implemented in a software package and its efficacy is demonstrated by analyzing typical remote-sensing images of rural grid areas.The results underscore the ability of the proposed approach to effectively discern the distribution-line structure and compute the node parameters,thereby offering vital support for determining PV access capability.
基金Supported by the Key Project of National Natural Science Foundation of China(42330810).
文摘With drilling and seismic data of Transtensional(strike-slip)Fault System in the Ziyang area of the central Sichuan Basin,SW China plane-section integrated structural interpretation,3-D fault framework model building,fault throw analyzing,and balanced profile restoration,it is pointed out that the transtensional fault system in the Ziyang 3-D seismic survey consists of the northeast-trending F_(I)19 and F_(I)20 fault zones dominated by extensional deformation,as well as 3 sets of northwest-trending en echelon normal faults experienced dextral shear deformation.Among them,the F_(I)19 and F_(I)20 fault zones cut through the Neoproterozoic to Lower Triassic Jialingjiang Formation,presenting a 3-D structure of an“S”-shaped ribbon.And before Permian and during the Early Triassic,the F_(I)19 and F_(I)20 fault zones underwent at least two periods of structural superimposition.Besides,the 3 sets of northwest-trending en echelon normal faults are composed of small normal faults arranged in pairs,with opposite dip directions and partially left-stepped arrangement.And before Permian,they had formed almost,restricting the eastward growth and propagation of the F_(I)19 fault zone.The F_(I)19 and F_(I)20 fault zones communicate multiple sets of source rocks and reservoirs from deep to shallow,and the timing of fault activity matches well with oil and gas generation peaks.If there were favorable Cambrian-Triassic sedimentary facies and reservoirs developing on the local anticlinal belts of both sides of the F_(I)19 and F_(I)20 fault zones,the major reservoirs in this area are expected to achieve breakthroughs in oil and gas exploration.
文摘Named Entity Recognition(NER)is crucial for extracting structured information from text.While traditional methods rely on rules,Conditional Random Fields(CRFs),or deep learning,the advent of large-scale Pre-trained Language Models(PLMs)offers new possibilities.PLMs excel at contextual learning,potentially simplifying many natural language processing tasks.However,their application to NER remains underexplored.This paper investigates leveraging the GPT-3 PLM for NER without fine-tuning.We propose a novel scheme that utilizes carefully crafted templates and context examples selected based on semantic similarity.Our experimental results demonstrate the feasibility of this approach,suggesting a promising direction for harnessing PLMs in NER.
基金Project supported by the National Natural Science Foundation of China (Grant No. 60472061)the Natural Science Foundation of Jiangsu Province,China (Grant No. BK20090149)the Natural Science Foundation of Higher Education Institutions of Jiangsu Province,China (Grant No. 08KJD520019).
文摘This paper presents a new method for extract three-dimensional (3D) discrete spherical Fourier descriptors based on surface curvature voxels for pollen particle recognition. In order to reduce the high amount of pollen information and noise disturbance, the geometric normalized curvature voxels with the principal curvedness are first extracted to represent the intrinsic pollen volumetric data. Then the curvature voxels are decomposed into radial and angular components with spherical harmonic transform in spherical coordinates. Finally the 3D discrete Fourier transform is applied to the decomposed curvature voxels to obtain the 3D spherical Fourier descriptors for pollen recognition. Experimental results show that the presented descriptors are invariant to different pollen particle geometric transformations, such as pose change and spatial rotation, and can obtain high recognition accuracy and speed simultaneously.
基金The National Natural Science Foundation of China(No.61231002,61273266,51075068,61271359)Doctoral Fund of Ministry of Education of China(No.20110092130004)
文摘A cascaded projection of the Gaussian mixture model algorithm is proposed.First,the marginal distribution of the Gaussian mixture model is computed for different feature dimensions, and a number of sub-classifiers are generated using the marginal distribution model.Each sub-classifier is based on different feature sets.The cascaded structure is adopted to fuse the sub-classifiers dynamically to achieve sample adaptation ability.Secondly,the effectiveness of the proposed algorithm is verified on electrocardiogram emotional signal and speech emotional signal.Emotional data including fidgetiness,happiness and sadness is collected by induction experiments.Finally,the emotion feature extraction method is discussed,including heart rate variability, the chaotic electrocardiogram feature and utterance level static feature.The emotional feature reduction methods are studied, including principle component analysis,sequential forward selection, the Fisher discriminant ratio and maximal information coefficient.The experimental results show that the proposed classification algorithm can effectively improve recognition accuracy in two different scenarios.
文摘Aim Using animals as object of experiment to acquire various patterns of low cerebral blood pressure and reduced blood capacity in cerebral tissues of astronauts due to the load of acceleration. Methods The isotope tracking technique was applied to mark the blood and record the dynamic curves of cerebral blood flow changes under various accelerations, and the relevant mathematical model was set up using the method of system recognition. Also the method of factor analyzing was used to select two out of the data collected by eight sensors as two factors. Results One of the two factors reflects the various patterns in the astronaut's upper body, the other for the lower body. Parameters of rise time, delay time and steady value reflect the results under different acceleration. Conclusion Whether for the upper body or the lower body, blood flow changes can be considered as a second order system model. This method provides a new technique and method of doing research on astronaut's endurance of acceleration and selecting astronauts.
基金The National Natural Science Foundation of China(No.30900356,81071135)
文摘Dynamic casual modeling of functional magnetic resonance imaging(fMRI) signals is employed to explore critical emotional neurocircuitry under sad stimuli. The intrinsic model of emotional loops is built on the basis of Papez's circuit and related prior knowledge, and then three modulatory connection models are established. In these models, stimuli are placed at different points, which represents they affect the neural activities between brain regions, and these activities are modulated in different ways. Then, the optimal model is selected by Bayesian model comparison. From group analysis, patients' intrinsic and modulatory connections from the anterior cingulate cortex (ACC) to the right inferior frontal gyrus (rlFG) are significantly higher than those of the control group. Then the functional connection parameters of the model are selected as classifier features. The classification accuracy rate from the support vector machine(SVM) classifier is 80.73%, which, to some extent, validates the effectiveness of the regional connectivity parameters for depression recognition and provides a new approach for the clinical diagnosis of depression.
文摘Two discriminative methods for solving tone problems in Mandarin speech recognition are presented. First, discriminative training on the HMM (hidden Markov model) based tone models is proposed. Then an integration technique of tone models into a large vocabulary continuous speech recognition system is presented. Discriminative model weight training based on minimum phone error criteria is adopted aiming at optimal integration of the tone models. The extended Baum Welch algorithm is applied to find the model-dependent weights to scale the acoustic scores and tone scores. Experimental results show that tone recognition rates and continuous speech recognition accuracy can be improved by the discriminatively trained tone model. Performance of a large vocabulary continuous Mandarin speech recognition system can be further enhanced by the discriminatively trained weight combinations due to a better interpolation of the given models.
文摘This paper presents a hybrid model for three-dimensional Geographical Information Systems which is an integration of surface- and volume-based models. The Triangulated Irregular Network (TIN) and octree models are integrated in this hybrid models. The TIN model works as a surface-based model which mainly serves for surface presentation and visualization. On the other hand, the octree encoding supports volumetric analysis. The designed data structure brings a major advantage in the three-dimensional selective retrieval. This technique increases the efficiency of three-dimensional data operation.
基金supported by the project “The demonstration system of rich semantic search application in scientific literature” (Grant No. 1734) from the Chinese Academy of Sciences
文摘Purpose:Mo ve recognition in scientific abstracts is an NLP task of classifying sentences of the abstracts into different types of language units.To improve the performance of move recognition in scientific abstracts,a novel model of move recognition is proposed that outperforms the BERT-based method.Design/methodology/approach:Prevalent models based on BERT for sentence classification often classify sentences without considering the context of the sentences.In this paper,inspired by the BERT masked language model(MLM),we propose a novel model called the masked sentence model that integrates the content and contextual information of the sentences in move recognition.Experiments are conducted on the benchmark dataset PubMed 20K RCT in three steps.Then,we compare our model with HSLN-RNN,BERT-based and SciBERT using the same dataset.Findings:Compared with the BERT-based and SciBERT models,the F1 score of our model outperforms them by 4.96%and 4.34%,respectively,which shows the feasibility and effectiveness of the novel model and the result of our model comes closest to the state-of-theart results of HSLN-RNN at present.Research limitations:The sequential features of move labels are not considered,which might be one of the reasons why HSLN-RNN has better performance.Our model is restricted to dealing with biomedical English literature because we use a dataset from PubMed,which is a typical biomedical database,to fine-tune our model.Practical implications:The proposed model is better and simpler in identifying move structures in scientific abstracts and is worthy of text classification experiments for capturing contextual features of sentences.Originality/value:T he study proposes a masked sentence model based on BERT that considers the contextual features of the sentences in abstracts in a new way.The performance of this classification model is significantly improved by rebuilding the input layer without changing the structure of neural networks.
基金The Major State Basic Research Program of China under contract No. 2012CB417002the National Natural Science Foundation of China under contract Nos 50909065 and 51109039
文摘A new three-dimensional semi-implicit finite-volume ocean model has been developed for simulating the coastal ocean circulation, which is based on the staggered C-unstructured non-orthogonal grid in the hor- izontal direction and z-level grid in the vertical direction. The three-dimensional model is discretized by the semi-implicit finite-volume method, in that the free-surface and the vertical diffusion are semi-implicit, thereby removing stability limitations associated with the surface gravity wave and vertical diffusion terms. The remaining terms in the momentum equations are discretized explicitly by an integral method. The partial cell method is used for resolving topography, which enables the model to better represent irregular topography. The model has been tested against analytical cases for wind and tidal oscillation circulation, and is applied to simulating the tidal flow in the Bohal Sea. The results are in good agreement both with the analytical solutions and measurement results.
基金supported by the Natural Science Foundation of China(No.41574127)the China Postdoctoral Science Foundation(No.2017M622608)the project for the independent exploration of graduate students at Central South University(No.2017zzts008)
文摘In gravity-anomaly-based prospecting, the computational and memory requirements for practical numerical modeling are potentially enormous. Achieving an efficient and precise inversion for gravity anomaly imaging over large-scale and complex terrain requires additional methods. To this end, we have proposed a new topography-capable By performing a two-dimensional Fourier transform in the horizontal directions, threedimensional partial differential equations in the spatial domain were transformed into a group of independent, one-dimensional differential equations engaged with different wave numbers. These independent differential equations are highly parallel across different wave numbers. differential equations with different wave numbers, and the efficiency of solving fixedbandwidth linear equations was further improved by a chasing method. In a synthetic test, a prism model was used to verify the accuracy and reliability of the proposed algorithm by comparing the numerical solution with the analytical solution. We studied the computational precision and efficiency with and without topography using different Fourier transform methods. The results showed that the Guass-FFT method has higher numerical precision, while the standard FFT method is superior, in terms of computation time, for inversion and quantitative interpretation under complicated terrain.
基金Supported by Grant-in-Aid for Young Scientists(A)(Grant No.26700021)Japan Society for the Promotion of Science and Strategic Information and Communications R&D Promotion Programme(Grant No.142103011)Ministry of Internal Affairs and Communications
文摘Gesture recognition is used in many practical applications such as human-robot interaction, medical rehabilitation and sign language. With increasing motion sensor development, multiple data sources have become available, which leads to the rise of multi-modal gesture recognition. Since our previous approach to gesture recognition depends on a unimodal system, it is difficult to classify similar motion patterns. In order to solve this problem, a novel approach which integrates motion, audio and video models is proposed by using dataset captured by Kinect. The proposed system can recognize observed gestures by using three models. Recognition results of three models are integrated by using the proposed framework and the output becomes the final result. The motion and audio models are learned by using Hidden Markov Model. Random Forest which is the video classifier is used to learn the video model. In the experiments to test the performances of the proposed system, the motion and audio models most suitable for gesture recognition are chosen by varying feature vectors and learning methods. Additionally, the unimodal and multi-modal models are compared with respect to recognition accuracy. All the experiments are conducted on dataset provided by the competition organizer of MMGRC, which is a workshop for Multi-Modal Gesture Recognition Challenge. The comparison results show that the multi-modal model composed of three models scores the highest recognition rate. This improvement of recognition accuracy means that the complementary relationship among three models improves the accuracy of gesture recognition. The proposed system provides the application technology to understand human actions of daily life more precisely.
文摘Based on the features extracted from generalized autoregressive (GAR) model parameters of the received waveform, and the use of multilayer perceptron(MLP) neural network classifier, a new digital modulation recognition method is proposed in this paper. Because of the better noise suppression ability of the GAR model and the powerful pattern classification capacity of the MLP neural network classifier, the new method can significantly improve the recognition performance in lower SNR with better robustness. To assess the performance of the new method, computer simulations are also performed.
文摘In our study, entropy weight coefficients, based on Shannon entropy, were determined for an attribute recognition model to model the quality of groundwater sources. The model follows the theory previously proposed by Chen Q S. In the model, firstly, the author establishes the attribute space matrix and determines the weight based on Shannon entropy theory; secondly, calculates attribute measure; thirdly, evaluates that with confidence criterion and score criterion; finally, an application example is given. The results show that the water quality of the groundwater sources for the city comes up to the grade II or III standard. There is no pollution that obviously exceeds the standard and the water can meet people’s needs .The results from an evaluation of this model are in basic agreement with the observed situation and with a set pair analysis (SPA) model.