RiceDB, a web-based integrated database to annotate rice microarray in various biological contexts was developed. It is composed of eight modules. RiceMap module archives the process of Affymetrix probe sets mapping t...RiceDB, a web-based integrated database to annotate rice microarray in various biological contexts was developed. It is composed of eight modules. RiceMap module archives the process of Affymetrix probe sets mapping to different databases about rice, and aims to the genes represented by a microarray set by retrieving annotation information via the identifier or accession number of every database; RiceGO module indicates the association between a microarray set and gene ontology (GO) categories; RiceKO module is used to annotate a microarray set based on the KEGG biochemical pathways; RiceDO module indicates the information of domain associated with a microarray set; RiceUP module is used to obtain promoter sequences for all genes represented by a microarray set; RiceMR module lists potential microRNA which regulated the genes represented by a microarray set; RiceCD and RiceGF are used to annotate the genes represented by a microarray set in the context of chromosome distribution and rice paralogous family distribution. The results of automatic annotation are mostly consistent with manual annotation. Biological interpretation of the microarray data is quickened by the help of RiceDB.展开更多
High-throughput RNAoseq has revolutionized the process of small RNA (sRNA) discovery, leading to a rapid expansion of sRNA categories. In addition to the previously wellcharacterized sRNAs such as microRNAs (miRNAs...High-throughput RNAoseq has revolutionized the process of small RNA (sRNA) discovery, leading to a rapid expansion of sRNA categories. In addition to the previously wellcharacterized sRNAs such as microRNAs (miRNAs), piwi-interacting RNAs (piRNAs), and small nucleolar RNA (snoRNAs), recent emerging studies have spotlighted on tRNA-derived sRNAs (tsRNAs) and rRNA-derived sRNAs (rsRNAs) as new categories of sRNAs that bear versatile functions. Since existing software and pipelines for sRNA annotation are mostly focused on analyzing miRNAs or piRNAs, here we developed the sRNA annotation pipeline _optimized for rRNA- and tRNA-derived s_RNAs (SPORTS 1 .0). SPORTS1.0 is optimized for analyzing tsRNAs and rsRNAs from sRNA-seq data, in addition to its capacity to annotate canonical sRNAs such as miRNAs and piRNAs. Moreover, SPORTS1.0 can predict potential RNA modification sites based on nucleotide mismatches within sRNAs. SPORTS1.0 is precompiled to annotate sRNAs for a wide range of 68 species across bacteria, yeast, plant, and animal kingdoms, while additional species for analyses could be readily expanded upon end users' input. For demonstration, by analyzing sRNA datasets using SPORTS1.0, we reveal that distinct signatures are present in tsRNAs and rsRNAs from different mouse cell types. We also find that compared to other sRNA species, tsRNAs bear the highest mismatch rate, which is consistent with their highly modified nature. SPORTS1.0 is an opensource software and can be publically accessed at https://github.com/junchaoshi/sports1.0.展开更多
Corpus is a kind of important resource for knowledge acquisition in the natural language processing (NLP). However, up to now, in the biomedical domain comparatively fewer corpus focus on semantic association among ...Corpus is a kind of important resource for knowledge acquisition in the natural language processing (NLP). However, up to now, in the biomedical domain comparatively fewer corpus focus on semantic association among all tokens in a sentence. We proposed an annotation scheme based on feature structure theory for enriching biomedical domain corpora with token semantic association (TSA). There are 227 documents of the BioNLP GE ST training data annotated to form TSA corpus in which each annotated item shows a token semantic association that appears as a triple. The annotation of token semantic association has the potential to significantly advance biomedical text mining by providing rich token semantic information for NLP systems especially for the sophisticated IE systems, such as bio-event extraction.展开更多
The success of deep transfer learning in fault diagnosis is attributed to the collection of high-quality labeled data from the source domain.However,in engineering scenarios,achieving such high-quality label annotatio...The success of deep transfer learning in fault diagnosis is attributed to the collection of high-quality labeled data from the source domain.However,in engineering scenarios,achieving such high-quality label annotation is difficult and expensive.The incorrect label annotation produces two negative effects:1)the complex decision boundary of diagnosis models lowers the generalization performance on the target domain,and2)the distribution of target domain samples becomes misaligned with the false-labeled samples.To overcome these negative effects,this article proposes a solution called the label recovery and trajectory designable network(LRTDN).LRTDN consists of three parts.First,a residual network with dual classifiers is to learn features from cross-domain samples.Second,an annotation check module is constructed to generate a label anomaly indicator that could modify the abnormal labels of false-labeled samples in the source domain.With the training of relabeled samples,the complexity of diagnosis model is reduced via semi-supervised learning.Third,the adaptation trajectories are designed for sample distributions across domains.This ensures that the target domain samples are only adapted with the pure-labeled samples.The LRTDN is verified by two case studies,in which the diagnosis knowledge of bearings is transferred across different working conditions as well as different yet related machines.The results show that LRTDN offers a high diagnosis accuracy even in the presence of incorrect annotation.展开更多
Brassica oleracea has been developed into many important crops,including cabbage,kale,cauliflower,broccoli and so on.The genome and gene annotation of cabbage(cultivar JZS),a representative morphotype of B.oleracea,ha...Brassica oleracea has been developed into many important crops,including cabbage,kale,cauliflower,broccoli and so on.The genome and gene annotation of cabbage(cultivar JZS),a representative morphotype of B.oleracea,has been widely used as a common reference in biological research.Although its genome assembly has been updated twice,the current gene annotation still lacks information on untranslated regions(UTRs)and alternative splicing(AS).Here,we constructed a high-quality gene annotation(JZSv3)using a full-length transcriptome acquired by nanopore sequencing,yielding a total of 59452 genes and 75684 transcripts.Additionally,we re-analyzed the previously reported transcriptome data related to the development of different tissues and cold response using JZSv3 as a reference,and found that 3843 out of 11908 differentially expressed genes(DEGs)underwent AS during the development of different tissues and 309 out of 903 cold-related genes underwent AS in response to cold stress.Meanwhile,we also identified many AS genes,including BolLHCB5 and BolHSP70,that displayed distinct expression patterns within variant transcripts of the same gene,highlighting the importance of JZSv3 as a pivotal reference for AS analysis.Overall,JZSv3 provides a valuable resource for exploring gene function,especially for obtaining a deeper understanding of AS regulation mechanisms.展开更多
●AIM:To investigate a pioneering framework for the segmentation of meibomian glands(MGs),using limited annotations to reduce the workload on ophthalmologists and enhance the efficiency of clinical diagnosis.●METHODS...●AIM:To investigate a pioneering framework for the segmentation of meibomian glands(MGs),using limited annotations to reduce the workload on ophthalmologists and enhance the efficiency of clinical diagnosis.●METHODS:Totally 203 infrared meibomian images from 138 patients with dry eye disease,accompanied by corresponding annotations,were gathered for the study.A rectified scribble-supervised gland segmentation(RSSGS)model,incorporating temporal ensemble prediction,uncertainty estimation,and a transformation equivariance constraint,was introduced to address constraints imposed by limited supervision information inherent in scribble annotations.The viability and efficacy of the proposed model were assessed based on accuracy,intersection over union(IoU),and dice coefficient.●RESULTS:Using manual labels as the gold standard,RSSGS demonstrated outcomes with an accuracy of 93.54%,a dice coefficient of 78.02%,and an IoU of 64.18%.Notably,these performance metrics exceed the current weakly supervised state-of-the-art methods by 0.76%,2.06%,and 2.69%,respectively.Furthermore,despite achieving a substantial 80%reduction in annotation costs,it only lags behind fully annotated methods by 0.72%,1.51%,and 2.04%.●CONCLUSION:An innovative automatic segmentation model is developed for MGs in infrared eyelid images,using scribble annotation for training.This model maintains an exceptionally high level of segmentation accuracy while substantially reducing training costs.It holds substantial utility for calculating clinical parameters,thereby greatly enhancing the diagnostic efficiency of ophthalmologists in evaluating meibomian gland dysfunction.展开更多
Objective:To investigate the effect of Guangdong Shenqu(GSQ)on intestinal flora structure in mice with food stagnation through 16S rDNA sequencing.Methods: Mice were randomly assigned to control,model,GSQ low-dose(GSQ...Objective:To investigate the effect of Guangdong Shenqu(GSQ)on intestinal flora structure in mice with food stagnation through 16S rDNA sequencing.Methods: Mice were randomly assigned to control,model,GSQ low-dose(GSQL),GSQ medium-dose(GSQM),GSQ high-dose(GSQH),and lacidophilin tablets(LAB)groups,with each group containing 10 mice.A food stagnation and internal heat mouse model was established through intragastric administration of a mixture of beeswax and olive oil(1:15).The control group was administered normal saline,and the model group was administered beeswax and olive oil to maintain a state.The GSQL(2 g/kg),GSQM(4 g/kg),GSQH(8 g/kg),and LAB groups(0.625 g/kg)were administered corresponding drugs for 5 d.After administration,16S rDNA sequencing was performed to assess gut microbiota in mouse fecal samples.Results: The model group exhibited significant intestinal flora changes.Following GSQ administration,the abundance and diversity index of the intestinal flora increased significantly,the number of bacterial species was regulated,andαandβdiversity were improved.GSQ administration increased the abundance of probiotics,including Clostridia,Lachnospirales,and Lactobacillus,whereas the abundance of conditional pathogenic bacteria,such as Allobaculum,Erysipelotrichaceae,and Bacteroides decreased.Functional prediction analysis indicated that the pathogenesis of food stagnation and GSQ intervention were primarily associated with carbohydrate,lipid,and amino acid metabolism,among other metabolic pathways.Conclusion: The digestive mechanism of GSQ may be attributed to its role in restoring diversity and abundance within the intestinal flora,thereby improving the composition and structure of the intestinal flora in mice and subsequently influencing the regulation of metabolic pathways.展开更多
As Natural Language Processing(NLP)continues to advance,driven by the emergence of sophisticated large language models such as ChatGPT,there has been a notable growth in research activity.This rapid uptake reflects in...As Natural Language Processing(NLP)continues to advance,driven by the emergence of sophisticated large language models such as ChatGPT,there has been a notable growth in research activity.This rapid uptake reflects increasing interest in the field and induces critical inquiries into ChatGPT’s applicability in the NLP domain.This review paper systematically investigates the role of ChatGPT in diverse NLP tasks,including information extraction,Name Entity Recognition(NER),event extraction,relation extraction,Part of Speech(PoS)tagging,text classification,sentiment analysis,emotion recognition and text annotation.The novelty of this work lies in its comprehensive analysis of the existing literature,addressing a critical gap in understanding ChatGPT’s adaptability,limitations,and optimal application.In this paper,we employed a systematic stepwise approach following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses(PRISMA)framework to direct our search process and seek relevant studies.Our review reveals ChatGPT’s significant potential in enhancing various NLP tasks.Its adaptability in information extraction tasks,sentiment analysis,and text classification showcases its ability to comprehend diverse contexts and extract meaningful details.Additionally,ChatGPT’s flexibility in annotation tasks reducesmanual efforts and accelerates the annotation process,making it a valuable asset in NLP development and research.Furthermore,GPT-4 and prompt engineering emerge as a complementary mechanism,empowering users to guide the model and enhance overall accuracy.Despite its promising potential,challenges persist.The performance of ChatGP Tneeds tobe testedusingmore extensivedatasets anddiversedata structures.Subsequently,its limitations in handling domain-specific language and the need for fine-tuning in specific applications highlight the importance of further investigations to address these issues.展开更多
Dealing with issues such as too simple image features and word noise inference in product image sentence anmotation, a product image sentence annotation model focusing on image feature learning and key words summariza...Dealing with issues such as too simple image features and word noise inference in product image sentence anmotation, a product image sentence annotation model focusing on image feature learning and key words summarization is described. Three kernel descriptors such as gradient, shape, and color are extracted, respectively. Feature late-fusion is executed in turn by the multiple kernel learning model to obtain more discriminant image features. Absolute rank and relative rank of the tag-rank model are used to boost the key words' weights. A new word integration algorithm named word sequence blocks building (WSBB) is designed to create N-gram word sequences. Sentences are generated according to the N-gram word sequences and predefined templates. Experimental results show that both the BLEU-1 scores and BLEU-2 scores of the sentences are superior to those of the state-of-art baselines.展开更多
Du Yu’s Annotation of Classics and Elucidations in "Spring and Autumn Annals" is considered as a milestone for the studies of Spring and Autumn Annals, In Du’s book, the Preface to the Spring and Autumn An...Du Yu’s Annotation of Classics and Elucidations in "Spring and Autumn Annals" is considered as a milestone for the studies of Spring and Autumn Annals, In Du’s book, the Preface to the Spring and Autumn Annals is a theoretical summary and revision for the previous studies on the Spring and Autumn Annals and the Tso Chuan. Its combination of classics and elucidations makes Tso Chuan a real elucidating study on Spring and Autumn Annals. Du expounds his philosophy and political ideas in this book.展开更多
In order to implement the real-time detection of abnormality of elder and devices in an empty nest home,multi-modal joint sensors are used to collect discrete action sequences of behavior,and the improved hierarchical...In order to implement the real-time detection of abnormality of elder and devices in an empty nest home,multi-modal joint sensors are used to collect discrete action sequences of behavior,and the improved hierarchical hidden Markov model is adopted to Abstract these discrete action sequences captured by multi-modal joint sensors into an occupant’s high-level behavior—event,then structure representation models of occupant normality are modeled from large amounts of spatio-temporal data. These models are used as classifiers of normality to detect an occupant’s abnormal behavior.In order to express context information needed by reasoning and detection,multi-media ontology (MMO) is designed to annotate and reason about the media information in the smart monitoring system.A pessimistic emotion model (PEM) is improved to analyze multi-interleaving events of multi-active devices in the home.Experiments demonstrate that the PEM can enhance the accuracy and reliability for detecting active devices when these devices are in blind regions or are occlusive. The above approach has good performance in detecting abnormalities involving occupants and devices in a real-time way.展开更多
As a medium between two different languages, between peoples of two countries, translation is not only a matter of language, but also of cross-cultural transference. According to Professor He Ziran, socio-pragmatic tr...As a medium between two different languages, between peoples of two countries, translation is not only a matter of language, but also of cross-cultural transference. According to Professor He Ziran, socio-pragmatic translation is the kind of translation which examines the conditions on language use that stem from the social and cultural situations to serve cross-cultural communication. This paper focuses on the ways used to achieve socio-pragmatic equivalence in translation practice.展开更多
It is believed that in ancient China,there were only works on graphology,phonology and critical interpretation,but no works on grammar.In works of grammar history published in modern times,they just dwell briefly on a...It is believed that in ancient China,there were only works on graphology,phonology and critical interpretation,but no works on grammar.In works of grammar history published in modern times,they just dwell briefly on ancient grammar,and they consider that there were only some explanations of form words,a few books of form words,but no syntactic analyses.However,in fact,in ancient classical works there were abundant materials of grammatical analyses.Ancient scholars worked out quite a few grammar rules,some of which were profound and coincidental with modern grammar theories.In the present research of ancient Chinese grammar,grammatical analyses made by ancient scholars should be summed up and inherited.展开更多
基金supported by the National Key Basic Research and Development Program of China(Grant No.2005CB120900)the National Natural Science Foundation of China(Grant No.30500106)the Scientific Research Foundation for Returned Overseas Chinese Scholars,Ministry of Education,and the Department of Science and Technology of Zhejiang Province,China(Grant No.2007C22025).
文摘RiceDB, a web-based integrated database to annotate rice microarray in various biological contexts was developed. It is composed of eight modules. RiceMap module archives the process of Affymetrix probe sets mapping to different databases about rice, and aims to the genes represented by a microarray set by retrieving annotation information via the identifier or accession number of every database; RiceGO module indicates the association between a microarray set and gene ontology (GO) categories; RiceKO module is used to annotate a microarray set based on the KEGG biochemical pathways; RiceDO module indicates the information of domain associated with a microarray set; RiceUP module is used to obtain promoter sequences for all genes represented by a microarray set; RiceMR module lists potential microRNA which regulated the genes represented by a microarray set; RiceCD and RiceGF are used to annotate the genes represented by a microarray set in the context of chromosome distribution and rice paralogous family distribution. The results of automatic annotation are mostly consistent with manual annotation. Biological interpretation of the microarray data is quickened by the help of RiceDB.
基金supported by Start-up funds for Zhou and Chen labs from Reno School of Medicine,University of Nevada and from the National Institutes of Health,United States(Grant Nos.R01DK091336 and P01DK041315 to KMSGrant Nos.R01HD092431 and P30GM110767-03 to QC)
文摘High-throughput RNAoseq has revolutionized the process of small RNA (sRNA) discovery, leading to a rapid expansion of sRNA categories. In addition to the previously wellcharacterized sRNAs such as microRNAs (miRNAs), piwi-interacting RNAs (piRNAs), and small nucleolar RNA (snoRNAs), recent emerging studies have spotlighted on tRNA-derived sRNAs (tsRNAs) and rRNA-derived sRNAs (rsRNAs) as new categories of sRNAs that bear versatile functions. Since existing software and pipelines for sRNA annotation are mostly focused on analyzing miRNAs or piRNAs, here we developed the sRNA annotation pipeline _optimized for rRNA- and tRNA-derived s_RNAs (SPORTS 1 .0). SPORTS1.0 is optimized for analyzing tsRNAs and rsRNAs from sRNA-seq data, in addition to its capacity to annotate canonical sRNAs such as miRNAs and piRNAs. Moreover, SPORTS1.0 can predict potential RNA modification sites based on nucleotide mismatches within sRNAs. SPORTS1.0 is precompiled to annotate sRNAs for a wide range of 68 species across bacteria, yeast, plant, and animal kingdoms, while additional species for analyses could be readily expanded upon end users' input. For demonstration, by analyzing sRNA datasets using SPORTS1.0, we reveal that distinct signatures are present in tsRNAs and rsRNAs from different mouse cell types. We also find that compared to other sRNA species, tsRNAs bear the highest mismatch rate, which is consistent with their highly modified nature. SPORTS1.0 is an opensource software and can be publically accessed at https://github.com/junchaoshi/sports1.0.
基金Supported by the National Natural Science Foundation of China(61202304,61173095,61173062,61202193)
文摘Corpus is a kind of important resource for knowledge acquisition in the natural language processing (NLP). However, up to now, in the biomedical domain comparatively fewer corpus focus on semantic association among all tokens in a sentence. We proposed an annotation scheme based on feature structure theory for enriching biomedical domain corpora with token semantic association (TSA). There are 227 documents of the BioNLP GE ST training data annotated to form TSA corpus in which each annotated item shows a token semantic association that appears as a triple. The annotation of token semantic association has the potential to significantly advance biomedical text mining by providing rich token semantic information for NLP systems especially for the sophisticated IE systems, such as bio-event extraction.
基金the National Key R&D Program of China(2022YFB3402100)the National Science Fund for Distinguished Young Scholars of China(52025056)+4 种基金the National Natural Science Foundation of China(52305129)the China Postdoctoral Science Foundation(2023M732789)the China Postdoctoral Innovative Talents Support Program(BX20230290)the Open Foundation of Hunan Provincial Key Laboratory of Health Maintenance for Mechanical Equipment(2022JXKF JJ01)the Fundamental Research Funds for Central Universities。
文摘The success of deep transfer learning in fault diagnosis is attributed to the collection of high-quality labeled data from the source domain.However,in engineering scenarios,achieving such high-quality label annotation is difficult and expensive.The incorrect label annotation produces two negative effects:1)the complex decision boundary of diagnosis models lowers the generalization performance on the target domain,and2)the distribution of target domain samples becomes misaligned with the false-labeled samples.To overcome these negative effects,this article proposes a solution called the label recovery and trajectory designable network(LRTDN).LRTDN consists of three parts.First,a residual network with dual classifiers is to learn features from cross-domain samples.Second,an annotation check module is constructed to generate a label anomaly indicator that could modify the abnormal labels of false-labeled samples in the source domain.With the training of relabeled samples,the complexity of diagnosis model is reduced via semi-supervised learning.Third,the adaptation trajectories are designed for sample distributions across domains.This ensures that the target domain samples are only adapted with the pure-labeled samples.The LRTDN is verified by two case studies,in which the diagnosis knowledge of bearings is transferred across different working conditions as well as different yet related machines.The results show that LRTDN offers a high diagnosis accuracy even in the presence of incorrect annotation.
基金supported by the National Natural Science Foundation of China (Grant Nos.31972411,31722048,and 31630068)the Central Public-interest Scientific Institution Basal Research Fund (Grant No.Y2022PT23)+1 种基金the Innovation Program of the Chinese Academy of Agricultural Sciences,and the Key Laboratory of Biology and Genetic Improvement of Horticultural Crops,Ministry of Agriculture and Rural Affairs,P.R.Chinasupported by NIFA,the Department of Agriculture,via UC-Berkeley,USA。
文摘Brassica oleracea has been developed into many important crops,including cabbage,kale,cauliflower,broccoli and so on.The genome and gene annotation of cabbage(cultivar JZS),a representative morphotype of B.oleracea,has been widely used as a common reference in biological research.Although its genome assembly has been updated twice,the current gene annotation still lacks information on untranslated regions(UTRs)and alternative splicing(AS).Here,we constructed a high-quality gene annotation(JZSv3)using a full-length transcriptome acquired by nanopore sequencing,yielding a total of 59452 genes and 75684 transcripts.Additionally,we re-analyzed the previously reported transcriptome data related to the development of different tissues and cold response using JZSv3 as a reference,and found that 3843 out of 11908 differentially expressed genes(DEGs)underwent AS during the development of different tissues and 309 out of 903 cold-related genes underwent AS in response to cold stress.Meanwhile,we also identified many AS genes,including BolLHCB5 and BolHSP70,that displayed distinct expression patterns within variant transcripts of the same gene,highlighting the importance of JZSv3 as a pivotal reference for AS analysis.Overall,JZSv3 provides a valuable resource for exploring gene function,especially for obtaining a deeper understanding of AS regulation mechanisms.
基金Supported by Natural Science Foundation of Fujian Province(No.2020J011084)Fujian Province Technology and Economy Integration Service Platform(No.2023XRH001)Fuzhou-Xiamen-Quanzhou National Independent Innovation Demonstration Zone Collaborative Innovation Platform(No.2022FX5)。
文摘●AIM:To investigate a pioneering framework for the segmentation of meibomian glands(MGs),using limited annotations to reduce the workload on ophthalmologists and enhance the efficiency of clinical diagnosis.●METHODS:Totally 203 infrared meibomian images from 138 patients with dry eye disease,accompanied by corresponding annotations,were gathered for the study.A rectified scribble-supervised gland segmentation(RSSGS)model,incorporating temporal ensemble prediction,uncertainty estimation,and a transformation equivariance constraint,was introduced to address constraints imposed by limited supervision information inherent in scribble annotations.The viability and efficacy of the proposed model were assessed based on accuracy,intersection over union(IoU),and dice coefficient.●RESULTS:Using manual labels as the gold standard,RSSGS demonstrated outcomes with an accuracy of 93.54%,a dice coefficient of 78.02%,and an IoU of 64.18%.Notably,these performance metrics exceed the current weakly supervised state-of-the-art methods by 0.76%,2.06%,and 2.69%,respectively.Furthermore,despite achieving a substantial 80%reduction in annotation costs,it only lags behind fully annotated methods by 0.72%,1.51%,and 2.04%.●CONCLUSION:An innovative automatic segmentation model is developed for MGs in infrared eyelid images,using scribble annotation for training.This model maintains an exceptionally high level of segmentation accuracy while substantially reducing training costs.It holds substantial utility for calculating clinical parameters,thereby greatly enhancing the diagnostic efficiency of ophthalmologists in evaluating meibomian gland dysfunction.
基金supported by the National Natural Science Foundation of China(81872995).
文摘Objective:To investigate the effect of Guangdong Shenqu(GSQ)on intestinal flora structure in mice with food stagnation through 16S rDNA sequencing.Methods: Mice were randomly assigned to control,model,GSQ low-dose(GSQL),GSQ medium-dose(GSQM),GSQ high-dose(GSQH),and lacidophilin tablets(LAB)groups,with each group containing 10 mice.A food stagnation and internal heat mouse model was established through intragastric administration of a mixture of beeswax and olive oil(1:15).The control group was administered normal saline,and the model group was administered beeswax and olive oil to maintain a state.The GSQL(2 g/kg),GSQM(4 g/kg),GSQH(8 g/kg),and LAB groups(0.625 g/kg)were administered corresponding drugs for 5 d.After administration,16S rDNA sequencing was performed to assess gut microbiota in mouse fecal samples.Results: The model group exhibited significant intestinal flora changes.Following GSQ administration,the abundance and diversity index of the intestinal flora increased significantly,the number of bacterial species was regulated,andαandβdiversity were improved.GSQ administration increased the abundance of probiotics,including Clostridia,Lachnospirales,and Lactobacillus,whereas the abundance of conditional pathogenic bacteria,such as Allobaculum,Erysipelotrichaceae,and Bacteroides decreased.Functional prediction analysis indicated that the pathogenesis of food stagnation and GSQ intervention were primarily associated with carbohydrate,lipid,and amino acid metabolism,among other metabolic pathways.Conclusion: The digestive mechanism of GSQ may be attributed to its role in restoring diversity and abundance within the intestinal flora,thereby improving the composition and structure of the intestinal flora in mice and subsequently influencing the regulation of metabolic pathways.
文摘As Natural Language Processing(NLP)continues to advance,driven by the emergence of sophisticated large language models such as ChatGPT,there has been a notable growth in research activity.This rapid uptake reflects increasing interest in the field and induces critical inquiries into ChatGPT’s applicability in the NLP domain.This review paper systematically investigates the role of ChatGPT in diverse NLP tasks,including information extraction,Name Entity Recognition(NER),event extraction,relation extraction,Part of Speech(PoS)tagging,text classification,sentiment analysis,emotion recognition and text annotation.The novelty of this work lies in its comprehensive analysis of the existing literature,addressing a critical gap in understanding ChatGPT’s adaptability,limitations,and optimal application.In this paper,we employed a systematic stepwise approach following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses(PRISMA)framework to direct our search process and seek relevant studies.Our review reveals ChatGPT’s significant potential in enhancing various NLP tasks.Its adaptability in information extraction tasks,sentiment analysis,and text classification showcases its ability to comprehend diverse contexts and extract meaningful details.Additionally,ChatGPT’s flexibility in annotation tasks reducesmanual efforts and accelerates the annotation process,making it a valuable asset in NLP development and research.Furthermore,GPT-4 and prompt engineering emerge as a complementary mechanism,empowering users to guide the model and enhance overall accuracy.Despite its promising potential,challenges persist.The performance of ChatGP Tneeds tobe testedusingmore extensivedatasets anddiversedata structures.Subsequently,its limitations in handling domain-specific language and the need for fine-tuning in specific applications highlight the importance of further investigations to address these issues.
基金The National Natural Science Foundation of China(No.61133012)the Humanity and Social Science Foundation of the Ministry of Education(No.12YJCZH274)+1 种基金the Humanity and Social Science Foundation of Jiangxi Province(No.XW1502,TQ1503)the Science and Technology Project of Jiangxi Science and Technology Department(No.20121BBG70050,20142BBG70011)
文摘Dealing with issues such as too simple image features and word noise inference in product image sentence anmotation, a product image sentence annotation model focusing on image feature learning and key words summarization is described. Three kernel descriptors such as gradient, shape, and color are extracted, respectively. Feature late-fusion is executed in turn by the multiple kernel learning model to obtain more discriminant image features. Absolute rank and relative rank of the tag-rank model are used to boost the key words' weights. A new word integration algorithm named word sequence blocks building (WSBB) is designed to create N-gram word sequences. Sentences are generated according to the N-gram word sequences and predefined templates. Experimental results show that both the BLEU-1 scores and BLEU-2 scores of the sentences are superior to those of the state-of-art baselines.
文摘Du Yu’s Annotation of Classics and Elucidations in "Spring and Autumn Annals" is considered as a milestone for the studies of Spring and Autumn Annals, In Du’s book, the Preface to the Spring and Autumn Annals is a theoretical summary and revision for the previous studies on the Spring and Autumn Annals and the Tso Chuan. Its combination of classics and elucidations makes Tso Chuan a real elucidating study on Spring and Autumn Annals. Du expounds his philosophy and political ideas in this book.
基金The National Natural Science Foundation of China(No.60773110)the Youth Education Fund of Hunan Province(No.07B014)
文摘In order to implement the real-time detection of abnormality of elder and devices in an empty nest home,multi-modal joint sensors are used to collect discrete action sequences of behavior,and the improved hierarchical hidden Markov model is adopted to Abstract these discrete action sequences captured by multi-modal joint sensors into an occupant’s high-level behavior—event,then structure representation models of occupant normality are modeled from large amounts of spatio-temporal data. These models are used as classifiers of normality to detect an occupant’s abnormal behavior.In order to express context information needed by reasoning and detection,multi-media ontology (MMO) is designed to annotate and reason about the media information in the smart monitoring system.A pessimistic emotion model (PEM) is improved to analyze multi-interleaving events of multi-active devices in the home.Experiments demonstrate that the PEM can enhance the accuracy and reliability for detecting active devices when these devices are in blind regions or are occlusive. The above approach has good performance in detecting abnormalities involving occupants and devices in a real-time way.
文摘As a medium between two different languages, between peoples of two countries, translation is not only a matter of language, but also of cross-cultural transference. According to Professor He Ziran, socio-pragmatic translation is the kind of translation which examines the conditions on language use that stem from the social and cultural situations to serve cross-cultural communication. This paper focuses on the ways used to achieve socio-pragmatic equivalence in translation practice.
文摘It is believed that in ancient China,there were only works on graphology,phonology and critical interpretation,but no works on grammar.In works of grammar history published in modern times,they just dwell briefly on ancient grammar,and they consider that there were only some explanations of form words,a few books of form words,but no syntactic analyses.However,in fact,in ancient classical works there were abundant materials of grammatical analyses.Ancient scholars worked out quite a few grammar rules,some of which were profound and coincidental with modern grammar theories.In the present research of ancient Chinese grammar,grammatical analyses made by ancient scholars should be summed up and inherited.