期刊文献+
共找到334,566篇文章
< 1 2 250 >
每页显示 20 50 100
Deciphering gastric inflammation-induced tumorigenesis through multi-omics data and AI methods
1
作者 Qian Zhang Mingran Yang +3 位作者 Peng Zhang Bowen Wu Xiaosen Wei Shao Li 《Cancer Biology & Medicine》 SCIE CAS CSCD 2024年第4期312-330,共19页
Gastric cancer(GC), the fifth most common cancer globally, remains the leading cause of cancer deaths worldwide. Inflammation-induced tumorigenesis is the predominant process in GC development;therefore, systematic re... Gastric cancer(GC), the fifth most common cancer globally, remains the leading cause of cancer deaths worldwide. Inflammation-induced tumorigenesis is the predominant process in GC development;therefore, systematic research in this area should improve understanding of the biological mechanisms that initiate GC development and promote cancer hallmarks. Here, we summarize biological knowledge regarding gastric inflammation-induced tumorigenesis, and characterize the multi-omics data and systems biology methods for investigating GC development. Of note, we highlight pioneering studies in multi-omics data and state-of-the-art network-based algorithms used for dissecting the features of gastric inflammation-induced tumorigenesis, and we propose translational applications in early GC warning biomarkers and precise treatment strategies. This review offers integrative insights for GC research, with the goal of paving the way to novel paradigms for GC precision oncology and prevention. 展开更多
关键词 Gastric cancer inflammation-induced tumorigenesis multi-omics artificial intelligence network-based methods
下载PDF
A feature extraction framework for discovering pan-cancer driver genes based on multi-omics data
2
作者 Xiaomeng Xue Feng Li +3 位作者 Junliang Shang Lingyun Dai Daohui Ge Qianqian Ren 《Quantitative Biology》 CAS CSCD 2024年第2期173-181,共9页
The identification of tumor driver genes facilitates accurate cancer diagnosis and treatment,playing a key role in precision oncology,along with gene signaling,regulation,and their interaction with protein complexes.T... The identification of tumor driver genes facilitates accurate cancer diagnosis and treatment,playing a key role in precision oncology,along with gene signaling,regulation,and their interaction with protein complexes.To tackle the challenge of distinguishing driver genes from a large number of genomic data,we construct a feature extraction framework for discovering pan-cancer driver genes based on multi-omics data(mutations,gene expression,copy number variants,and DNA methylation)combined with protein–protein interaction(PPI)networks.Using a network propagation algorithm,we mine functional information among nodes in the PPI network,focusing on genes with weak node information to represent specific cancer information.From these functional features,we extract distribution features of pan-cancer data,pan-cancer TOPSIS features of functional features using the ideal solution method,and SetExpan features of pan-cancer data from the gene functional features,a method to rank pan-cancer data based on the average inverse rank.These features represent the common message of pan-cancer.Finally,we use the lightGBM classification algorithm for gene prediction.Experimental results show that our method outperforms existing methods in terms of the area under the check precision-recall curve(AUPRC)and demonstrates better performance across different PPI networks.This indicates our framework’s effectiveness in predicting potential cancer genes,offering valuable insights for the diagnosis and treatment of tumors. 展开更多
关键词 cancer driver genes feature extraction multi-omics data network propagation pan-cancer
原文传递
Prioritization of risk genes in colorectal cancer by integrative analysis of multi-omics data and gene networks 被引量:3
3
作者 Ming Zhang Xiaoyang Wang +10 位作者 Nan Yang Xu Zhu Zequn Lu Yimin Cai Bin Li Ying Zhu Xiangpan Li Yongchang Wei Shaokai Zhang Jianbo Tian Xiaoping Miao 《Science China(Life Sciences)》 SCIE CAS CSCD 2024年第1期132-148,共17页
Genome-wide association studies(GWASs)have identified over 140 colorectal cancer(CRC)-associated loci;however,target genes at the majority of loci and underlying molecular mechanisms are poorly understood.Here,we util... Genome-wide association studies(GWASs)have identified over 140 colorectal cancer(CRC)-associated loci;however,target genes at the majority of loci and underlying molecular mechanisms are poorly understood.Here,we utilized a Bayesian approach,integrative risk gene selector(iRIGS),to prioritize risk genes at CRC GWAS loci by integrating multi-omics data.As a result,a total of 105 high-confidence risk genes(HRGs)were identified,which exhibited strong gene dependencies for CRC and enrichment in the biological processes implicated in CRC.Among the 105 HRGs,CEBPB,located at the 20q13.13 locus,acted as a transcription factor playing critical roles in cancer.Our subsequent assays indicated the tumor promoter function of CEBPB that facilitated CRC cell proliferation by regulating multiple oncogenic pathways such as MAPK,PI3K-Akt,and Ras signaling.Next,by integrating a fine-mapping analysis and three independent case-control studies in Chinese populations consisting of 8,039 cases and 12,775 controls,we elucidated that rs1810503,a putative functional variant regulating CEBPB,was associated with CRC risk(OR=0.90,95%CI=0.86–0.93,P=1.07×10^(−7)).The association between rs1810503 and CRC risk was further validated in three additional multi-ancestry populations consisting of 24,254 cases and 58,741 controls.Mechanistically,the rs1810503 A to T allele change weakened the enhancer activity in an allele-specific manner to decrease CEBPB expression via longrange promoter-enhancer interactions,mediated by the transcription factor,REST,and thus decreased CRC risk.In summary,our study provides a genetic resource and a generalizable strategy for CRC etiology investigation,and highlights the biological implications of CEBPB in CRC tumorigenesis,shedding new light on the etiology of CRC. 展开更多
关键词 susceptibility genes gene screening models multi-omics GWAS CEBPB long-range promoter-enhancer interactions
原文传递
Constructing the metabolic network of wheat kernels based on structure-guided chemical modification and multi-omics data 被引量:1
4
作者 Zhitao Tian Jingqi Jia +1 位作者 Bo Yin Wei Chen 《Journal of Genetics and Genomics》 SCIE CAS CSCD 2024年第7期714-722,共9页
Metabolic network construction plays a pivotal role in unraveling the regulatory mechanism of biological activities,although it often proves to be challenging and labor-intensive,particularly with non-model organisms.... Metabolic network construction plays a pivotal role in unraveling the regulatory mechanism of biological activities,although it often proves to be challenging and labor-intensive,particularly with non-model organisms.In this study,we develop a computational approach that employs reaction models based on the structure-guided chemical modification and related compounds to construct a metabolic network in wheat.This construction results in a comprehensive structure-guided network,including 625 identified metabolites and additional 333 putative reactions compared with the Kyoto Encyclopedia of Genes and Genomes database.Using a combination of gene annotation,reaction classification,structure similarity,and correlations from transcriptome and metabolome analysis,a total of 229 potential genes related to these reactions are identified within this network.To validate the network,the functionality of a hydroxycinnamoyltransferase(TraesCS3D01G314900)for the synthesis of polyphenols and a rhamnosyltransferase(TraesCS2D01G078700)for the modification of flavonoids are verified through in vitro enzymatic studies and wheat mutant tests,respectively.Our research thus supports the utility of structure-guided chemical modification as an effective tool in identifying causal candidate genes for constructing metabolic networks and further in metabolomic genetic studies. 展开更多
关键词 Metabolic network Chemical modification Genetic study Wheat kernel multi-omics
原文传递
Integrating multi-omics data of childhood asthma using a deep association model 被引量:1
5
作者 Kai Wei Fang Qian +2 位作者 Yixue Lia Tao Zeng Tao Huang 《Fundamental Research》 CAS CSCD 2024年第4期738-751,共14页
Childhood asthma is one of the most common respiratory diseases with rising mortality and morbidity.The multi-omics data is providing a new chance to explore collaborative biomarkers and corresponding diagnostic model... Childhood asthma is one of the most common respiratory diseases with rising mortality and morbidity.The multi-omics data is providing a new chance to explore collaborative biomarkers and corresponding diagnostic models of childhood asthma.To capture the nonlinear association of multi-omics data and improve interpretability of diagnostic model,we proposed a novel deep association model(DAM)and corresponding efficient analysis framework.First,the Deep Subspace Reconstruction was used to fuse the omics data and diagnostic information,thereby correcting the distribution of the original omics data and reducing the influence of unnecessary data noises.Second,the Joint Deep Semi-Negative Matrix Factorization was applied to identify different latent sample patterns and extract biomarkers from different omics data levels.Third,our newly proposed Deep Orthogonal Canonical Correlation Analysis can rank features in the collaborative module,which are able to construct the diagnostic model considering nonlinear correlation between different omics data levels.Using DAM,we deeply analyzed the transcriptome and methylation data of childhood asthma.The effectiveness of DAM is verified from the perspectives of algorithm performance and biological significance on the independent test dataset,by ablation experiment and comparison with many baseline methods from clinical and biological studies.The DAM-induced diagnostic model can achieve a prediction AUC of o.912,which is higher than that of many other alternative methods.Meanwhile,relevant pathways and biomarkers of childhood asthma are also recognized to be collectively altered on the gene expression and methylation levels.As an interpretable machine learning approach,DAM simultaneously considers the non-linear associations among samples and those among biological features,which should help explore interpretative biomarker candidates and efficient diagnostic models from multi-omics data analysis for human complexdiseases. 展开更多
关键词 Deepsub space reconstruction Deepnon-negative matrix factorization Deepcanonical correlationanalysis multi-omics Interpretable machine learning Childhood asthma
原文传递
SMARTdb:An Integrated Database for Exploring Single-cell Multi-omics Data of Reproductive Medicine
6
作者 Zekai Liu Zhen Yuan +7 位作者 Yunlei Guo Ruilin Wang Yusheng Guan Zhanglian Wang Yunan Chen Tianlu Wang Meining Jiang Shuhui Bian 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2024年第3期113-123,共11页
Single-cell multi-omics sequencing has greatly accelerated reproductive research in recent years,and the data are continually growing.However,utilizing these data resources is challenging for wet-lab researchers.A com... Single-cell multi-omics sequencing has greatly accelerated reproductive research in recent years,and the data are continually growing.However,utilizing these data resources is challenging for wet-lab researchers.A comprehensive platform for exploring single-cell multi-omics data related to reproduction is urgently needed.Here,we introduce the single-cell multi-omics atlas of reproduction(SMARTdb),an integrative and user-friendly platform for exploring molecular dynamics of reproductive development,aging,and disease,which covers multi-omics,multi-species,and multi-stage data.We curated and analyzed single-cell transcriptomic and epigenomic data of over 2.0 million cells from 6 species across the entire lifespan.A series of powerful functionalities are provided,such as“Query gene expression”,“DIY expression plot”,“DNA methylation plot”,and“Epigenome browser”.With SMARTdb,we found that the male germ cell-specific expression pattern of RPL39L and RPL10L is conserved between human and other model animals.Moreover,DNA hypomethylation and open chromatin may collectively regulate the specific expression pattern of RPL39L in both male and female germ cells.In summary,SMARTdb is a powerful platform for convenient data mining and gaining novel insights into reproductive development,aging,and disease.SMARTdb is publicly available at https://smart-db.cn. 展开更多
关键词 database Single-cell multi-omics EMBRYO Germ cell GONAD
原文传递
DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants 被引量:17
7
作者 Kelin Wang Muhammad Ali Abid +3 位作者 Awais Rasheed Jose Crossa Sarah Hearne Huihui Li 《Molecular Plant》 SCIE CAS CSCD 2023年第1期279-293,共15页
Genomic prediction is an effective way to accelerate the rate of agronomic trait improvement in plants.Traditional methods typically use linear regression models with clear assumptions;such methods are unable to captu... Genomic prediction is an effective way to accelerate the rate of agronomic trait improvement in plants.Traditional methods typically use linear regression models with clear assumptions;such methods are unable to capture the complex relationships between genotypes and phenotypes.Non-linear models(e.g.,deep neural networks)have been proposed as a superior alternative to linear models because they can capture complex non-additive effects.Here we introduce a deep learning(DL)method,deep neural network genomic prediction(DNNGP),for integration of multi-omics data in plants.We trained DNNGP on four datasets and compared its performance with methods built with five classic models:genomic best linear unbiased prediction(GBLUP);two methods based on a machine learning(ML)framework,light gradient boosting machine(LightGBM)and support vector regression(SVR);and two methods based on a DL framework,deep learning genomic selection(DeepGS)and deep learning genome-wide association study(DLGWAS).DNNGP is novel in five ways.First,it can be applied to a variety of omics data to predict phenotypes.Second,the multilayered hierarchical structure of DNNGP dynamically learns features from raw data,avoiding overfitting and improving the convergence rate using a batch normalization layer and early stopping and rectified linear activation(rectified linear unit)functions.Third,when small datasets were used,DNNGP produced results that are competitive with results from the other five methods,showing greater prediction accuracy than the other methods when large-scale breeding data were used.Fourth,the computation time required by DNNGP was comparable with that of commonly used methods,up to 10 times faster than DeepGS.Fifth,hyperparameters can easily be batch tuned on a local machine.Compared with GBLUP,LightGBM,SVR,DeepGS and DLGWAS,DNNGP is superior to these existing widely used genomic selection(GS)methods.Moreover,DNNGP can generate robust assessments from diverse datasets,including omics data,and quickly incorporate complex and large datasets into usable models,making it a promising and practical approach for straightforward integration into existing GS platforms. 展开更多
关键词 deep learning genomic selection multi-omics data prediction method
原文传递
DeepRCI:predicting RNA-chromatin interactions via deep learning with multi-omics data
8
作者 Yuanpeng Xiong Xuan He +2 位作者 Dan Zhao Tao Jiang Jianyang Zeng 《Quantitative Biology》 CSCD 2023年第3期275-286,共12页
Background:Chromatin-associated RNA(caRNA)acts as a ubiquitous epigenetic layer in eukaryotes,and has been reported to be essential in various biological processes,including gene transcription,chromatin remodeling and... Background:Chromatin-associated RNA(caRNA)acts as a ubiquitous epigenetic layer in eukaryotes,and has been reported to be essential in various biological processes,including gene transcription,chromatin remodeling and cellular differentiation.Recently,numerous experimental techniques have been developed to characterize genome-wide RNA-chromatin interactions to understand their underlying biological functions.However,these experimental methods are generally expensive,time-consuming,and limited in identifying all potential sites,while most of the existing computational methods are restricted to detecting only specific types of RNAs interacting with chromatin.Methods:Here,we propose a highly interpretable computational framework,named DeepRCI,to identify the interactions between various types of RNAs and chromatin.In this framework,we introduce a novel deep learning component called variformer and integrate multi-omics data to capture intrinsic genomic features at both RNA and DNA levels.Results:Extensive experiments demonstrate that DeepRCI can detect RNA-chromatin interactions more accurately when compared to the state-of-the-art baseline prediction methods.Furthermore,the sequence features extracted by DeepRCI can be well matched to known critical gene regulatory components,indicating that our model can provide useful biological insights into understanding the underlying mechanisms of RNA-chromatin interactions.In addition,based on the prediction results,we further delineate the relationships between RNA-chromatin interactions and cellular functions,including gene expression and the modulation of cell states.Conclusions:In summary,DeepRCI can serve as a useful tool for characterizing RNA-chromatin interactions and studying the underlying gene regulatory code. 展开更多
关键词 deep learning multi-omics data RNA-chromatin
原文传递
Integrative multi-omics and systems bioinformatics in translational neuroscience:A data mining perspective 被引量:4
9
作者 Lance M.O'Connor Blake A.O'Connor +2 位作者 Su Bin Lim Jialiu Zeng Chih Hung Lo 《Journal of Pharmaceutical Analysis》 SCIE CAS CSCD 2023年第8期836-850,共15页
Bioinformatic analysis of large and complex omics datasets has become increasingly useful in modern day biology by providing a great depth of information,with its application to neuroscience termed neuroinformatics.Da... Bioinformatic analysis of large and complex omics datasets has become increasingly useful in modern day biology by providing a great depth of information,with its application to neuroscience termed neuroinformatics.Data mining of omics datasets has enabled the generation of new hypotheses based on differentially regulated biological molecules associated with disease mechanisms,which can be tested experimentally for improved diagnostic and therapeutic targeting of neurodegenerative diseases.Importantly,integrating multi-omics data using a systems bioinformatics approach will advance the understanding of the layered and interactive network of biological regulation that exchanges systemic knowledge to facilitate the development of a comprehensive human brain profile.In this review,we first summarize data mining studies utilizing datasets from the individual type of omics analysis,including epigenetics/epigenomics,transcriptomics,proteomics,metabolomics,lipidomics,and spatial omics,pertaining to Alzheimer's disease,Parkinson's disease,and multiple sclerosis.We then discuss multi-omics integration approaches,including independent biological integration and unsupervised integration methods,for more intuitive and informative interpretation of the biological data obtained across different omics layers.We further assess studies that integrate multi-omics in data mining which provide convoluted biological insights and offer proof-of-concept proposition towards systems bioinformatics in the reconstruction of brain networks.Finally,we recommend a combination of high dimensional bioinformatics analysis with experimental validation to achieve translational neuroscience applications including biomarker discovery,therapeutic development,and elucidation of disease mechanisms.We conclude by providing future perspectives and opportunities in applying integrative multi-omics and systems bioinformatics to achieve precision phenotyping of neurodegenerative diseases and towards personalized medicine. 展开更多
关键词 multi-omics integration Systems bioinformatics data mining Human brain profile reconstruction Translational neuroscience
下载PDF
BnlR:A multi-omics database with various tools for Brassica napus research and breeding 被引量:6
10
作者 Zhiquan Yang Shengbo Wang +11 位作者 Lulu Wei Yiming Huang Dongxu Liu Yupeng Jia Chengfang Luo Yuchen Lin Congyuan Liang Yue Hu Cheng Dai Liang Guo Yongming Zhou Qing-Yong Yang 《Molecular Plant》 SCIE CSCD 2023年第4期775-789,共15页
In the post-genome-wide association study era,multi-omics techniques have shown great power and poten-tial for candidate gene mining and functional genomics research.However,due to the lack of effective data integrati... In the post-genome-wide association study era,multi-omics techniques have shown great power and poten-tial for candidate gene mining and functional genomics research.However,due to the lack of effective data integration and multi-omics analysis platforms,such techniques have not still been applied widely in rape-seed,an important oil crop worldwide.Here,we report a rapeseed multi-omics database(BnlR;http:/l yanglab.hzau.edu.cn/BnlR),which provides datasets of six omics including genomics,transcriptomics,variomics,epigenetics,phenomics,and metabolomics,as well as numerous"variation-gene expression-phenotype"associations by using multiple statistical methods.In addition,a series of multi-omics search and analysis tools are integrated to facilitate the browsing and application of these datasets.BnlR is the most comprehensive multi-omics database for rapeseed so far,and two case studies demonstrated its power to mine candidate genes associated with specific traits and analyze their potential regulatory mechanisms. 展开更多
关键词 Brassica napus multi-omics dataBASE candidate gene mining functional genomics
原文传递
Elucidating regulatory processes of intense physical activity by multi-omics analysis
11
作者 Ernesto S.Nakayasu Marina A.Gritsenko +17 位作者 Young-Mo Kim Jennifer E.Kyle Kelly G.Stratton Carrie D.Nicora Nathalie Munoz Kathleen M.Navarro Daniel Claborne Yuqian Gao Karl K.Weitz Vanessa L.Paurus Kent J.Bloodsworth Kelsey A.Allen Lisa M.Bramer Fernando Montes Kathleen A.Clark Grant Tietje Justin Teeguarden Kristin E.Burnum-Johnson 《Military Medical Research》 SCIE CAS CSCD 2024年第4期479-499,共21页
Background:Physiological and biochemical processes across tissues of the body are regulated in response to the high demands of intense physical activity in several occupations,such as firefighting,law enforcement,mili... Background:Physiological and biochemical processes across tissues of the body are regulated in response to the high demands of intense physical activity in several occupations,such as firefighting,law enforcement,military,and sports.A better understanding of such processes can ultimately help improve human performance and prevent illnesses in the work environment.Methods:To study regulatory processes in intense physical activity simulating real-life conditions,we performed a multi-omics analysis of 3 biofluids(blood plasma,urine,and saliva)collected from 11 wildland firefighters before and after a 45 min,intense exercise regimen.Omics profiles post-vs.pre-exercise were compared by Student’s t-test followed by pathway analysis and comparison between the different omics modalities.Results:Our multi-omics analysis identified and quantified 3835 proteins,730 lipids and 182 metabolites combining the 3 different types of samples.The blood plasma analysis revealed signatures of tissue damage and acute repair response accompanied by enhanced carbon metabolism to meet energy demands.The urine analysis showed a strong,concomitant regulation of 6 out of 8 identified proteins from the renin-angiotensin system supporting increased excretion of catabolites,reabsorption of nutrients and maintenance of fluid balance.In saliva,we observed a decrease in 3 pro-inflammatory cytokines and an increase in 8 antimicrobial peptides.A systematic literature review identified 6 papers that support an altered susceptibility to respiratory infection.Conclusions:This study shows simultaneous regulatory signatures in biofluids indicative of homeostatic maintenance during intense physical activity with possible effects on increased infection susceptibility,suggesting that caution against respiratory diseases could benefit workers on highly physical demanding jobs. 展开更多
关键词 multi-omics analysis Intense exercise Human performance BIOFLUIDS Metabolism Immunity
下载PDF
Gapless Genome Assembly of ZH8015 and Preliminary Multi-Omics Analysis to Investigate ZH8015's Responses Against Brown Planthopper Infestation
12
作者 LI Dian DUAN Wenjing +5 位作者 LIU Qun’en WU Weixun ZHAN Xiaodeng SUN Lianping ZHANG Yingxin CHENG Shihua 《Rice science》 SCIE CSCD 2024年第3期317-327,I0042-I0045,共15页
Accurate genomic information is essential for advancing genetic breeding research in specific rice varieties.This study presented a gapless genome assembly of the indica rice cultivar Zhonghui 8015(ZH8015)using Pac Bi... Accurate genomic information is essential for advancing genetic breeding research in specific rice varieties.This study presented a gapless genome assembly of the indica rice cultivar Zhonghui 8015(ZH8015)using Pac Bio HiFi,Hi-C,and ONT(Oxford Nanopore Technologies)ultra-long sequencing technologies,annotating 43037 gene structures.Subsequently,utilizing this genome along with transcriptomic and metabolomic techniques,we explored ZH8015's response to brown planthopper(BPH)infestation.Continuous transcriptomic sampling indicated significant changes in gene expression levels around 48 h after BPH feeding.Enrichment analysis revealed particularly significant alterations in genes related to reactive oxygen species scavenging and cell wall formation.Metabolomic results demonstrated marked increases in levels of several monosaccharides,which are components of the cell wall and dramatic changes in flavonoid contents.Omics association analysis identified differentially expressed genes associated with key metabolites,shedding light on ZH8015's response to BPH infestation.In summary,this study constructed a reliable genome sequence resource for ZH8015,and the preliminary multi-omics results will guide future insect-resistant breeding research. 展开更多
关键词 brown planthopper gapless genome genome assembly multi-omics Nilaparvata lugens rice
下载PDF
LIBRA:an adaptative integrative tool for paired single-cell multi-omics data
13
作者 Xabier Martinez-de-Morentin Sumeer AKhan +6 位作者 Robert Lehmann Sisi Qu Alberto Maillo Narsis AKiani Felipe Prosper Jesper Tegner David Gomez-Cabrero 《Quantitative Biology》 CSCD 2023年第3期246-259,共14页
Background:Single-cell multi-omics technologies allow a profound system-level biology understanding of cells and tissues.However,an integrative and possibly systems-based analysis capturing the different modalities is... Background:Single-cell multi-omics technologies allow a profound system-level biology understanding of cells and tissues.However,an integrative and possibly systems-based analysis capturing the different modalities is challenging.In response,bioinformatics and machine learning methodologies are being developed for multi-omics single-cell analysis.It is unclear whether current tools can address the dual aspect of modality integration and prediction across modalities without requiring extensive parameter fine-tuning.Methods:We designed LIBRA,a neural network based framework,to learn translation between paired multi-omics profiles so that a shared latent space is constructed.Additionally,we implemented a variation,aLIBRA,that allows automatic fine-tuning by identifying parameter combinations that optimize both the integrative and predictive tasks.All model parameters and evaluation metrics are made available to users with minimal user iteration.Furthermore,aLIBRA allows experienced users to implement custom configurations.The LIBRA toolbox is freely available as R and Python libraries at GitHub(TranslationalBioinformaticsUnit/LIBRA).Results:LIBRA was evaluated in eight multi-omic single-cell data-sets,including three combinations of omics.We observed that LIBRA is a state-of-the-art tool when evaluating the ability to increase cell-type(clustering)resolution in the integrated latent space.Furthermore,when assessing the predictive power across data modalities,such as predictive chromatin accessibility from gene expression,LIBRA outperforms existing tools.As expected,adaptive parameter optimization(aLIBRA)significantly boosted the performance of learning predictive models from paired data-sets.Conclusion:LIBRA is a versatile tool that performs competitively in both“integration”and“prediction”tasks based on single-cell multi-omics data.LIBRA is a data-driven robust platform that includes an adaptive learning scheme. 展开更多
关键词 SINGLE-CELL multi-omic Autoencoder auto-finetuning
原文传递
Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction 被引量:2
14
作者 Shaopan Ye Jiaqi Li Zhe Zhang 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2021年第2期508-519,共12页
Background:Presently,multi-omics data(e.g.,genomics,transcriptomics,proteomics,and metabolomics)are available to improve genomic predictors.Omics data not only offers new data layers for genomic prediction but also pr... Background:Presently,multi-omics data(e.g.,genomics,transcriptomics,proteomics,and metabolomics)are available to improve genomic predictors.Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level.Therefore,using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction.In this study,simultaneously using whole-genome sequencing(WGS)and gene expression level data,four strategies for single-nucleotide polymorphism(SNP)preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel.Results:Using genomic best linear unbiased prediction(GBLUP)with complete WGS data,the prediction accuracies were 0.208±0.020(0.181±0.022)for the startle response and 0.272±0.017(0.307±0.015)for starvation resistance in the female(male)lines.Compared with GBLUP using complete WGS data,both GBLUP and the genomic feature BLUP(GFBLUP)did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies(GWASs)or transcriptome-wide association studies(TWASs).Furthermore,by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus(eQTL)mapping of all genes,only the startle response had greater accuracy than GBLUP with the complete WGS data.The best accuracy values in the female and male lines were 0.243±0.020 and 0.220±0.022,respectively.Importantly,by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS,both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction.Compared with the GBLUP using complete WGS data,the best accuracy values represented increases of 60.66%and 39.09%for the starvation resistance and 27.40%and 35.36%for startle response in the female and male lines,respectively.Conclusions:Overall,multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction.The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction. 展开更多
关键词 ACCURACY Drosophila melanogaster Genomic prediction multi-omics data SNP preselection
下载PDF
Multi-omics-driven development of alternative crops for natural rubber production 被引量:2
15
作者 YANG Ning YANG Dan-dan +1 位作者 YU Xu-chen XU Cao 《Journal of Integrative Agriculture》 SCIE CAS CSCD 2023年第4期959-971,共13页
Natural rubber(NR)is an irreplaceable biopolymer of economic and strategic importance owing to its unique physical and chemical properties.The Parárubber tree(Hevea brasiliensis(Willd.ex A.Juss.)Müll.Arg.)is... Natural rubber(NR)is an irreplaceable biopolymer of economic and strategic importance owing to its unique physical and chemical properties.The Parárubber tree(Hevea brasiliensis(Willd.ex A.Juss.)Müll.Arg.)is currently the exclusive commercial source of NR,and it is primarily grown in plantations restricted to the tropical and subtropical areas of Southeast Asia.However,current Parárubber production barely meets the sharply increasing global industrial demand for rubber.Petroleum-based synthetic rubber(SR)has been used to supplement the shortage of NR but its industrial performance is not comparable to that of NR.Thus,there is an urgent need to develop new productive rubber crops with broader environmental adaptability.This review summarizes the current research progress on alternative rubberproducing plants,including horticultural plants(Taraxacum kok-saghyz Rodin and Lactuca L.species),woody plants(Parthenium argentatum A.Gray and Eucommia ulmoides Oliv.),and other plant species with potential for NR production.With an emphasis on the molecular basis of NR biosynthesis revealed by a multi-omics approach,we highlight new integrative strategies and biotechnologies for exploring the mechanism of NR biosynthesis with a broader scope,which may accelerate the breeding and improvement of new rubber crops. 展开更多
关键词 natural rubber multi-omics GENOMICS TRANSCRIPTOMICS PROTEOMICS new crops
下载PDF
基于re3data的中英科学数据仓储平台对比研究 被引量:1
16
作者 袁烨 陈媛媛 《数字图书馆论坛》 CSSCI 2024年第2期13-23,共11页
以re3data为数据获取源,选取中英两国406个科学数据仓储为研究对象,从分布特征、责任类型、仓储许可、技术标准及质量标准等5个方面、11个指标对两国科学数据仓储的建设情况进行对比分析,试图为我国数据仓储的可持续发展提出建议:广泛... 以re3data为数据获取源,选取中英两国406个科学数据仓储为研究对象,从分布特征、责任类型、仓储许可、技术标准及质量标准等5个方面、11个指标对两国科学数据仓储的建设情况进行对比分析,试图为我国数据仓储的可持续发展提出建议:广泛联结国内外异质机构,推进多学科领域的交流与合作,有效扩充仓储许可权限与类型,优化技术标准的应用现况,提高元数据使用的灵活性。 展开更多
关键词 科学数据 数据仓储平台 re3data 中国 英国
下载PDF
Dissecting the brain with spatially resolved multi-omics 被引量:1
17
作者 Yijia Fangma Mengting Liu +2 位作者 Jie Liao Zhong Chen Yanrong Zheng 《Journal of Pharmaceutical Analysis》 SCIE CAS CSCD 2023年第7期694-710,共17页
Recent studies have highlighted spatially resolved multi-omics technologies,including spatial genomics,transcriptomics,proteomics,and metabolomics,as powerful tools to decipher the spatial heterogeneity of the brain.H... Recent studies have highlighted spatially resolved multi-omics technologies,including spatial genomics,transcriptomics,proteomics,and metabolomics,as powerful tools to decipher the spatial heterogeneity of the brain.Here,we focus on two major approaches in spatial transcriptomics(next-generation sequencing-based technologies and image-based technologies),and mass spectrometry imaging technologies used in spatial proteomics and spatial metabolomics.Furthermore,we discuss their applications in neuroscience,including building the brain atlas,uncovering gene expression patterns of neurons for special behaviors,deciphering the molecular basis of neuronal communication,and providing a more comprehensive explanation of the molecular mechanisms underlying central nervous system disorders.However,further efforts are still needed toward the integrative application of multi-omics technologies,including the real-time spatial multi-omics analysis in living cells,the detailed gene profile in a whole-brain view,and the combination of functional verification. 展开更多
关键词 Central nervous system disorders multi-omics Spatial transcriptomics Spatial proteomics Spatial metabolomics
下载PDF
Data Secure Storage Mechanism for IIoT Based on Blockchain 被引量:2
18
作者 Jin Wang Guoshu Huang +2 位作者 R.Simon Sherratt Ding Huang Jia Ni 《Computers, Materials & Continua》 SCIE EI 2024年第3期4029-4048,共20页
With the development of Industry 4.0 and big data technology,the Industrial Internet of Things(IIoT)is hampered by inherent issues such as privacy,security,and fault tolerance,which pose certain challenges to the rapi... With the development of Industry 4.0 and big data technology,the Industrial Internet of Things(IIoT)is hampered by inherent issues such as privacy,security,and fault tolerance,which pose certain challenges to the rapid development of IIoT.Blockchain technology has immutability,decentralization,and autonomy,which can greatly improve the inherent defects of the IIoT.In the traditional blockchain,data is stored in a Merkle tree.As data continues to grow,the scale of proofs used to validate it grows,threatening the efficiency,security,and reliability of blockchain-based IIoT.Accordingly,this paper first analyzes the inefficiency of the traditional blockchain structure in verifying the integrity and correctness of data.To solve this problem,a new Vector Commitment(VC)structure,Partition Vector Commitment(PVC),is proposed by improving the traditional VC structure.Secondly,this paper uses PVC instead of the Merkle tree to store big data generated by IIoT.PVC can improve the efficiency of traditional VC in the process of commitment and opening.Finally,this paper uses PVC to build a blockchain-based IIoT data security storage mechanism and carries out a comparative analysis of experiments.This mechanism can greatly reduce communication loss and maximize the rational use of storage space,which is of great significance for maintaining the security and stability of blockchain-based IIoT. 展开更多
关键词 Blockchain IIoT data storage cryptographic commitment
下载PDF
Hadoop-based secure storage solution for big data in cloud computing environment 被引量:1
19
作者 Shaopeng Guan Conghui Zhang +1 位作者 Yilin Wang Wenqing Liu 《Digital Communications and Networks》 SCIE CSCD 2024年第1期227-236,共10页
In order to address the problems of the single encryption algorithm,such as low encryption efficiency and unreliable metadata for static data storage of big data platforms in the cloud computing environment,we propose... In order to address the problems of the single encryption algorithm,such as low encryption efficiency and unreliable metadata for static data storage of big data platforms in the cloud computing environment,we propose a Hadoop based big data secure storage scheme.Firstly,in order to disperse the NameNode service from a single server to multiple servers,we combine HDFS federation and HDFS high-availability mechanisms,and use the Zookeeper distributed coordination mechanism to coordinate each node to achieve dual-channel storage.Then,we improve the ECC encryption algorithm for the encryption of ordinary data,and adopt a homomorphic encryption algorithm to encrypt data that needs to be calculated.To accelerate the encryption,we adopt the dualthread encryption mode.Finally,the HDFS control module is designed to combine the encryption algorithm with the storage model.Experimental results show that the proposed solution solves the problem of a single point of failure of metadata,performs well in terms of metadata reliability,and can realize the fault tolerance of the server.The improved encryption algorithm integrates the dual-channel storage mode,and the encryption storage efficiency improves by 27.6% on average. 展开更多
关键词 Big data security data encryption HADOOP Parallel encrypted storage Zookeeper
下载PDF
Defect Detection Model Using Time Series Data Augmentation and Transformation 被引量:1
20
作者 Gyu-Il Kim Hyun Yoo +1 位作者 Han-Jin Cho Kyungyong Chung 《Computers, Materials & Continua》 SCIE EI 2024年第2期1713-1730,共18页
Time-series data provide important information in many fields,and their processing and analysis have been the focus of much research.However,detecting anomalies is very difficult due to data imbalance,temporal depende... Time-series data provide important information in many fields,and their processing and analysis have been the focus of much research.However,detecting anomalies is very difficult due to data imbalance,temporal dependence,and noise.Therefore,methodologies for data augmentation and conversion of time series data into images for analysis have been studied.This paper proposes a fault detection model that uses time series data augmentation and transformation to address the problems of data imbalance,temporal dependence,and robustness to noise.The method of data augmentation is set as the addition of noise.It involves adding Gaussian noise,with the noise level set to 0.002,to maximize the generalization performance of the model.In addition,we use the Markov Transition Field(MTF)method to effectively visualize the dynamic transitions of the data while converting the time series data into images.It enables the identification of patterns in time series data and assists in capturing the sequential dependencies of the data.For anomaly detection,the PatchCore model is applied to show excellent performance,and the detected anomaly areas are represented as heat maps.It allows for the detection of anomalies,and by applying an anomaly map to the original image,it is possible to capture the areas where anomalies occur.The performance evaluation shows that both F1-score and Accuracy are high when time series data is converted to images.Additionally,when processed as images rather than as time series data,there was a significant reduction in both the size of the data and the training time.The proposed method can provide an important springboard for research in the field of anomaly detection using time series data.Besides,it helps solve problems such as analyzing complex patterns in data lightweight. 展开更多
关键词 Defect detection time series deep learning data augmentation data transformation
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部