Although genome-wide association studies have identified more than eighty genetic variants associated with non-small cell lung cancer(NSCLC)risk,biological mechanisms of these variants remain largely unknown.By integr...Although genome-wide association studies have identified more than eighty genetic variants associated with non-small cell lung cancer(NSCLC)risk,biological mechanisms of these variants remain largely unknown.By integrating a large-scale genotype data of 15581 lung adenocarcinoma(AD)cases,8350 squamous cell carcinoma(SqCC)cases,and 27355 controls,as well as multiple transcriptome and epigenomic databases,we conducted histology-specific meta-analyses and functional annotations of both reported and novel susceptibility variants.We identified 3064 credible risk variants for NSCLC,which were overrepresented in enhancer-like and promoter-like histone modification peaks as well as DNase I hypersensitive sites.Transcription factor enrichment analysis revealed that USF1 was AD-specific while CREB1 was SqCC-specific.Functional annotation and genebased analysis implicated 894 target genes,including 274 specifics for AD and 123 for SqCC,which were overrepresented in somatic driver genes(ER=1.95,P=0.005).Pathway enrichment analysis and Gene-Set Enrichment Analysis revealed that AD genes were primarily involved in immune-related pathways,while SqCC genes were homologous recombination deficiency related.Our results illustrate the molecular basis of both wellstudied and new susceptibility loci of NSCLC,providing not only novel insights into the genetic heterogeneity between AD and SqCC but also a set of plausible gene targets for post-GWAS functional experiments.展开更多
基金the Key International(Regional)Cooperative Research Project(No.81820108028)the National Natural Science Foundation of China(Nos.81521004,81922061,81973123,and 81803306)+2 种基金the Science Foundation for Distinguished Young Scholars of Jiangsu(No.BK20160046)the Priority Academic Program for the Development of Jiangsu Higher Education Institutions(Public Health and Preventive Medicine).the National Cancer Institute,National Institutes of Health of USA through grants U01-CA063673,UM1-CA167462,and U01-CA167462.
文摘Although genome-wide association studies have identified more than eighty genetic variants associated with non-small cell lung cancer(NSCLC)risk,biological mechanisms of these variants remain largely unknown.By integrating a large-scale genotype data of 15581 lung adenocarcinoma(AD)cases,8350 squamous cell carcinoma(SqCC)cases,and 27355 controls,as well as multiple transcriptome and epigenomic databases,we conducted histology-specific meta-analyses and functional annotations of both reported and novel susceptibility variants.We identified 3064 credible risk variants for NSCLC,which were overrepresented in enhancer-like and promoter-like histone modification peaks as well as DNase I hypersensitive sites.Transcription factor enrichment analysis revealed that USF1 was AD-specific while CREB1 was SqCC-specific.Functional annotation and genebased analysis implicated 894 target genes,including 274 specifics for AD and 123 for SqCC,which were overrepresented in somatic driver genes(ER=1.95,P=0.005).Pathway enrichment analysis and Gene-Set Enrichment Analysis revealed that AD genes were primarily involved in immune-related pathways,while SqCC genes were homologous recombination deficiency related.Our results illustrate the molecular basis of both wellstudied and new susceptibility loci of NSCLC,providing not only novel insights into the genetic heterogeneity between AD and SqCC but also a set of plausible gene targets for post-GWAS functional experiments.