Cotton is a major crop that provides the most important renewable textile fibers in the world.Studies of the taxonomy and evolution of cotton species have received wide attentions,not only due to cotton’s economic va...Cotton is a major crop that provides the most important renewable textile fibers in the world.Studies of the taxonomy and evolution of cotton species have received wide attentions,not only due to cotton’s economic value but also due to the fact that Gossypium is an ideal model system to study the origin,evolution,and cultivation of polyploid species.Previous studies suggested the involvement of mitochondrial genome editing sites and copy number as well as mitochondrial functions in cotton fiber elongation.Whereas,with only a few mitogenomes assembled in the cotton genus Gossypium,our knowledge about their roles in cotton evolution and speciation is still scarce.To close this gap,here we assembled 20 mitogenomes from 15 cotton species spanning all the cotton clades(A–G,K,and AD genomes)and 5 cotton relatives using short and long sequencing reads.Systematic analyses uncovered a high level of mitochondrial gene sequence conservation,abundant sequence repeats and many insertions of foreign sequences,as well as extensive structural variations in cotton mitogenomes.The sequence repeats and foreign sequences caused significant mitogenome size inflation in Gossypium and its close relative Kokia in general,while there is no significant difference between the lint and fuzz cotton mitogenomes in terms of gene content,RNA editing,and gene expression level.Interestingly,we further revealed the specific presence and expression of two novel mitochondrial open reading frames(ORFs)in lint-fiber cotton species.Finally,these structural features and novel ORFs help us gain valuable insights into the history of cotton evolution and polyploidization and the origin of species producing long lint fibers from a mitogenomic perspective.展开更多
Due to the economic value of natural textile fiber, cotton has attracted much research attention, which has led to the publication of two diploid genomes and two tetraploid genomes. These big data facilitate functiona...Due to the economic value of natural textile fiber, cotton has attracted much research attention, which has led to the publication of two diploid genomes and two tetraploid genomes. These big data facilitate functional genomic study in cotton, and allow researchers to investigate cotton genome structure, gene expression, and protein function on the global scale using high-throughput methods. In this review, we summarized recent studies of cotton genomes. Population genomic analyses revealed the domestication history of cultivated upland cotton and the roles of transposable elements in cotton genome evolution.Alternative splicing of cotton transcriptomes was evaluated genome-widely. Several important gene families like MYC, NAC, Sus and GhPLDal were systematically identified and classified based on genetic structure and biological function. High-throughput proteomics also unraveled the key functional proteins correlated with fiber development. Functional genomic studies have provided unprecedented insights into global-scale methods for cotton research.展开更多
Transposable elements(TEs)usually occupy largest fractions of plant genome and are also the most variable part of the structure.Although traditionally it is hallmarked as"junk and selfish DNA",today more and...Transposable elements(TEs)usually occupy largest fractions of plant genome and are also the most variable part of the structure.Although traditionally it is hallmarked as"junk and selfish DNA",today more and more evidence points out TE’s participation in gene regulations including gene mutation,duplication,movement and novel gene creation via genetic and epigenetic mechanisms.The recently sequenced genomes of diploid cottons Gossypium arboreum(AA)and Gossypium raimondii(DD)together with their allotetraploid progeny Gossypium hirsutum(At At Dt Dt)provides a unique opportunity to compare genome variations in the Gossypium genus and to analyze the functions of TEs during its evolution.TEs accounted for 57%,68.5%and67.2%,respectively in DD,AA and At At Dt Dt genomes.The 1,694 Mb A-genome was found to harbor more LTR(long terminal repeat)-type retrotransposons that made cardinal contributions to the twofold increase in its genome size after evolution from the 775.2 Mb D-genome.Although the 2,173 Mb At At Dt Dt genome showed similar TE content to the A-genome,the total numbers of LTR-gypsy and LTR-copia type TEs varied significantly between these two genomes.Considering their roles on rewiring gene regulatory networks,we believe that TEs may somehow be involved in cotton fiber cell development.Indeed,the insertion or deletion of different TEs in the upstream region of two important transcription factor genes in At or Dt subgenomes resulted in qualitative differences in target gene expression.We suggest that our findings may open a window for improving cotton agronomic traits by editing TE activities.展开更多
Cotton is an irreplaceable economic crop currently domesticated in the human world for its extremely elongated fiber cells specialized in seed epidermis,which makes it of high research and application value.To date,nu...Cotton is an irreplaceable economic crop currently domesticated in the human world for its extremely elongated fiber cells specialized in seed epidermis,which makes it of high research and application value.To date,numerous research on cotton has navigated various aspects,from multi-genome assembly,genome editing,mechanism of fiber development,metabolite biosynthesis,and analysis to genetic breeding.Genomic and 3D genomic studies reveal the origin of cotton species and the spatiotemporal asymmetric chromatin structure in fibers.Mature multiple genome editing systems,such as CRISPR/Cas9,Cas12(Cpf1)and cytidine base editing(CBE),have been widely used in the study of candidate genes affecting fiber development.Based on this,the cotton fiber cell development network has been preliminarily drawn.Among them,the MYB-b HLH-WDR(MBW)transcription factor complex and IAA and BR signaling pathway regulate the initiation;various plant hormones,including ethylene,mediated regulatory network and membrane protein overlap fine-regulate elongation.Multistage transcription factors targeting Ces A 4,7,and 8 specifically dominate the whole process of secondary cell wall thickening.And fluorescently labeled cytoskeletal proteins can observe real-time dynamic changes in fiber development.Furthermore,research on the synthesis of cotton secondary metabolite gossypol,resistance to diseases and insect pests,plant architecture regulation,and seed oil utilization are all conducive to finding more high-quality breeding-related genes and subsequently facilitating the cultivation of better cotton varieties.This review summarizes the paramount research achievements in cotton molecular biology over the last few decades from the above aspects,thereby enabling us to conduct a status review on the current studies of cotton and provide strong theoretical support for the future direction.展开更多
As one of the longest single-celled seed trichomes, fibers provide an excellent model for studying fundamental biological processes such as cell differentiation, cell expansion, and cell wall biosynthesis. In this rev...As one of the longest single-celled seed trichomes, fibers provide an excellent model for studying fundamental biological processes such as cell differentiation, cell expansion, and cell wall biosynthesis. In this review, we summarize recent progress in cotton functional genomic studies that characterize the dynamic changes in the transcriptomes of fiber cells. Extensive expression profilings of cotton fiber transcriptomes have provided comprehensive information, as quite a number of transcription factors and enzyme-coding genes have been shown to express preferentially during the fiber elongation period. Biosynthesis of the plant hormone ethylene is found significantly upregulated during the fiber growth period as revealed by both microarray analysis and by biochemical and physiological studies. It is suggested that genetic engineering of the ethylene pathway may improve the quality and the productivity of cotton lint. Many metabolic pathways, such as biosynthesis of cellulose and matrix polysaccharides are preferentially expressed in actively growing fiber cells. Five gene families, including proline-rich proteins (PRP), arabinogalactan proteins (AGP), expansins, tubulins and lipid transfer proteins (LTP) are activated during early fiber development, indicating that they may also be needed for cell elongation. In conclusion, we identify a few areas of future research for cotton functional genomic studies.展开更多
Cotton fibers, commonly known as cotton lint, are single-celled trichomes derived from epidermal lay- ers of cotton ovules. Despite of its importance in word trade, the molecular mechanisms of cotton fiber production ...Cotton fibers, commonly known as cotton lint, are single-celled trichomes derived from epidermal lay- ers of cotton ovules. Despite of its importance in word trade, the molecular mechanisms of cotton fiber production is still poorly understood. Through transcriptome profiling, functional genomics, pro- teomics, metabolomics approaches as well as marker-assisted molecular breeding, scientists in China have made significant contributions in cotton research. Here, we briefly summarize major progresses made in Chinese laboratories, and discuss future directions and perspectives relative to the develop- ment of this unique crop plant.展开更多
We assembled a total of 297,239 Gossypium hirsutum (Gh, a tetraploid cotton, AADD) expressed sequence tag (EST) sequences that were available in the National Center for Biotechnology Information database, with ref...We assembled a total of 297,239 Gossypium hirsutum (Gh, a tetraploid cotton, AADD) expressed sequence tag (EST) sequences that were available in the National Center for Biotechnology Information database, with reference to the recently published G. raimondii (Gr, a diploid cotton, DD) genome, and obtained 49,125 UniGenes. The average lengths of the U niGenes were increased from 804 and 791 bp in two previous EST assemblies to 1,019 bp in the current analysis. The number of putative cotton UniGenes with lengths of 3 kb or more increased from 25 or 34 to 1,223. As a result, thousands of originally independent G. hirsutum ESTs were aligned to produce large contigs encoding transcripts with very long open reading frames, indicating that the G. raimondii genome sequence provided remarkable advantages to assemble the tetraploid cotton transcriptome. Significant different distribution patterns within several GO terms, including transcription factor activity, were observed between D- and A-derived assemblies. Tran- scriptome analysis showed that, in a tetraploid cotton cell, 29,547 UniGenes were possibly derived from the D subgenome while another 19,578 may come from the A subgenome. Finally, some of the in silico data were confirmed by reverse transcription polymerase chain reaction experiments to show the changes in transcript levels for several gene families known to play key role in cotton fiber development. We believe that our work provides a useful platform for functional and evolutionary genomic studies in cotton.展开更多
基金the Zhejiang Natural Science Foundation Outstanding Youth Grant(LR20C020002)the Zhejiang Provincial Natural Science Foundation of China(LZ23C020002)+4 种基金the National Natural Science Foundation of China(32200231)the National Key Research and Development Program of China(2022YFD1401600)the Leading Innovative and Entrepreneur Team Introduction Program of Zhejiang(2019R01002)Key Research Project of Zhejiang Lab(2021PE0AC04)the U.S.National Science Foundation(MCB 2148206).
文摘Cotton is a major crop that provides the most important renewable textile fibers in the world.Studies of the taxonomy and evolution of cotton species have received wide attentions,not only due to cotton’s economic value but also due to the fact that Gossypium is an ideal model system to study the origin,evolution,and cultivation of polyploid species.Previous studies suggested the involvement of mitochondrial genome editing sites and copy number as well as mitochondrial functions in cotton fiber elongation.Whereas,with only a few mitogenomes assembled in the cotton genus Gossypium,our knowledge about their roles in cotton evolution and speciation is still scarce.To close this gap,here we assembled 20 mitogenomes from 15 cotton species spanning all the cotton clades(A–G,K,and AD genomes)and 5 cotton relatives using short and long sequencing reads.Systematic analyses uncovered a high level of mitochondrial gene sequence conservation,abundant sequence repeats and many insertions of foreign sequences,as well as extensive structural variations in cotton mitogenomes.The sequence repeats and foreign sequences caused significant mitogenome size inflation in Gossypium and its close relative Kokia in general,while there is no significant difference between the lint and fuzz cotton mitogenomes in terms of gene content,RNA editing,and gene expression level.Interestingly,we further revealed the specific presence and expression of two novel mitochondrial open reading frames(ORFs)in lint-fiber cotton species.Finally,these structural features and novel ORFs help us gain valuable insights into the history of cotton evolution and polyploidization and the origin of species producing long lint fibers from a mitogenomic perspective.
基金supported by the Natural Science Foundation of China(Nos.21602162 and 31690090)the National Science and Technology Major Project(No.2016ZX08005003-001)the Fundamental Research Funds for the Central Universities(No.104862016)
文摘Due to the economic value of natural textile fiber, cotton has attracted much research attention, which has led to the publication of two diploid genomes and two tetraploid genomes. These big data facilitate functional genomic study in cotton, and allow researchers to investigate cotton genome structure, gene expression, and protein function on the global scale using high-throughput methods. In this review, we summarized recent studies of cotton genomes. Population genomic analyses revealed the domestication history of cultivated upland cotton and the roles of transposable elements in cotton genome evolution.Alternative splicing of cotton transcriptomes was evaluated genome-widely. Several important gene families like MYC, NAC, Sus and GhPLDal were systematically identified and classified based on genetic structure and biological function. High-throughput proteomics also unraveled the key functional proteins correlated with fiber development. Functional genomic studies have provided unprecedented insights into global-scale methods for cotton research.
基金the National Natural Science Foundation of China(90717009)the Chinese National Basic Research Program of the Ministry of Science and Technology of China(2010CB126000)
文摘Transposable elements(TEs)usually occupy largest fractions of plant genome and are also the most variable part of the structure.Although traditionally it is hallmarked as"junk and selfish DNA",today more and more evidence points out TE’s participation in gene regulations including gene mutation,duplication,movement and novel gene creation via genetic and epigenetic mechanisms.The recently sequenced genomes of diploid cottons Gossypium arboreum(AA)and Gossypium raimondii(DD)together with their allotetraploid progeny Gossypium hirsutum(At At Dt Dt)provides a unique opportunity to compare genome variations in the Gossypium genus and to analyze the functions of TEs during its evolution.TEs accounted for 57%,68.5%and67.2%,respectively in DD,AA and At At Dt Dt genomes.The 1,694 Mb A-genome was found to harbor more LTR(long terminal repeat)-type retrotransposons that made cardinal contributions to the twofold increase in its genome size after evolution from the 775.2 Mb D-genome.Although the 2,173 Mb At At Dt Dt genome showed similar TE content to the A-genome,the total numbers of LTR-gypsy and LTR-copia type TEs varied significantly between these two genomes.Considering their roles on rewiring gene regulatory networks,we believe that TEs may somehow be involved in cotton fiber cell development.Indeed,the insertion or deletion of different TEs in the upstream region of two important transcription factor genes in At or Dt subgenomes resulted in qualitative differences in target gene expression.We suggest that our findings may open a window for improving cotton agronomic traits by editing TE activities.
基金the National Natural Science Foundation of China(32200286)the China Postdoctoral Science Foundation(2022TQ0240,2022M722470)。
文摘Cotton is an irreplaceable economic crop currently domesticated in the human world for its extremely elongated fiber cells specialized in seed epidermis,which makes it of high research and application value.To date,numerous research on cotton has navigated various aspects,from multi-genome assembly,genome editing,mechanism of fiber development,metabolite biosynthesis,and analysis to genetic breeding.Genomic and 3D genomic studies reveal the origin of cotton species and the spatiotemporal asymmetric chromatin structure in fibers.Mature multiple genome editing systems,such as CRISPR/Cas9,Cas12(Cpf1)and cytidine base editing(CBE),have been widely used in the study of candidate genes affecting fiber development.Based on this,the cotton fiber cell development network has been preliminarily drawn.Among them,the MYB-b HLH-WDR(MBW)transcription factor complex and IAA and BR signaling pathway regulate the initiation;various plant hormones,including ethylene,mediated regulatory network and membrane protein overlap fine-regulate elongation.Multistage transcription factors targeting Ces A 4,7,and 8 specifically dominate the whole process of secondary cell wall thickening.And fluorescently labeled cytoskeletal proteins can observe real-time dynamic changes in fiber development.Furthermore,research on the synthesis of cotton secondary metabolite gossypol,resistance to diseases and insect pests,plant architecture regulation,and seed oil utilization are all conducive to finding more high-quality breeding-related genes and subsequently facilitating the cultivation of better cotton varieties.This review summarizes the paramount research achievements in cotton molecular biology over the last few decades from the above aspects,thereby enabling us to conduct a status review on the current studies of cotton and provide strong theoretical support for the future direction.
基金Publication of this paper is supported by the National Natural Science Foundation of China (30624808) and Science Publication Foundation of the Chinese Academy of Sciences.
文摘As one of the longest single-celled seed trichomes, fibers provide an excellent model for studying fundamental biological processes such as cell differentiation, cell expansion, and cell wall biosynthesis. In this review, we summarize recent progress in cotton functional genomic studies that characterize the dynamic changes in the transcriptomes of fiber cells. Extensive expression profilings of cotton fiber transcriptomes have provided comprehensive information, as quite a number of transcription factors and enzyme-coding genes have been shown to express preferentially during the fiber elongation period. Biosynthesis of the plant hormone ethylene is found significantly upregulated during the fiber growth period as revealed by both microarray analysis and by biochemical and physiological studies. It is suggested that genetic engineering of the ethylene pathway may improve the quality and the productivity of cotton lint. Many metabolic pathways, such as biosynthesis of cellulose and matrix polysaccharides are preferentially expressed in actively growing fiber cells. Five gene families, including proline-rich proteins (PRP), arabinogalactan proteins (AGP), expansins, tubulins and lipid transfer proteins (LTP) are activated during early fiber development, indicating that they may also be needed for cell elongation. In conclusion, we identify a few areas of future research for cotton functional genomic studies.
基金Supported by the National Basic Research Program of China (Grant No. 2004CB117302) Program for Changjiang Scholars and Innovative Research Team in University (Grant No. IRT0432)
文摘Cotton fibers, commonly known as cotton lint, are single-celled trichomes derived from epidermal lay- ers of cotton ovules. Despite of its importance in word trade, the molecular mechanisms of cotton fiber production is still poorly understood. Through transcriptome profiling, functional genomics, pro- teomics, metabolomics approaches as well as marker-assisted molecular breeding, scientists in China have made significant contributions in cotton research. Here, we briefly summarize major progresses made in Chinese laboratories, and discuss future directions and perspectives relative to the develop- ment of this unique crop plant.
基金supported by grants from the China National Basic Research Program(2010CB126002)the National Natural Science Foundation of China(90717009)the 111 Project funded by the Chinese Ministry of Education
文摘We assembled a total of 297,239 Gossypium hirsutum (Gh, a tetraploid cotton, AADD) expressed sequence tag (EST) sequences that were available in the National Center for Biotechnology Information database, with reference to the recently published G. raimondii (Gr, a diploid cotton, DD) genome, and obtained 49,125 UniGenes. The average lengths of the U niGenes were increased from 804 and 791 bp in two previous EST assemblies to 1,019 bp in the current analysis. The number of putative cotton UniGenes with lengths of 3 kb or more increased from 25 or 34 to 1,223. As a result, thousands of originally independent G. hirsutum ESTs were aligned to produce large contigs encoding transcripts with very long open reading frames, indicating that the G. raimondii genome sequence provided remarkable advantages to assemble the tetraploid cotton transcriptome. Significant different distribution patterns within several GO terms, including transcription factor activity, were observed between D- and A-derived assemblies. Tran- scriptome analysis showed that, in a tetraploid cotton cell, 29,547 UniGenes were possibly derived from the D subgenome while another 19,578 may come from the A subgenome. Finally, some of the in silico data were confirmed by reverse transcription polymerase chain reaction experiments to show the changes in transcript levels for several gene families known to play key role in cotton fiber development. We believe that our work provides a useful platform for functional and evolutionary genomic studies in cotton.