]*>","")" /> Using Genome-Referenced Expressed Sequence Tag Assembly to Analyze the Origin and Expression Patterns of <i>Gossypium hirsutum</i> Transcripts

J Integr Plant Biol ›› 2013, Vol. 55 ›› Issue (7): 576-585.DOI: 10.1111/jipb.12066

• • 上一篇    下一篇

Using Genome-Referenced Expressed Sequence Tag Assembly to Analyze the Origin and Expression Patterns of Gossypium hirsutum Transcripts

Xiang Jin1, Qin Li1, Guanghui Xiao1 and Yuxian Zhu1,2*   

  • 收稿日期:2013-03-29 接受日期:2013-05-12 出版日期:2013-07-01 发布日期:2013-07-18

Using Genome-Referenced Expressed Sequence Tag Assembly to Analyze the Origin and Expression Patterns of Gossypium hirsutum Transcripts

Xiang Jin1, Qin Li1, Guanghui Xiao1 and Yuxian Zhu1,2*   

  1. 1State Key Laboratory of Protein and Plant Gene Research, College of Life Sciences, Peking University, Beijing, China
    2National Center for Plant Gene Research (Beijing), Beijing, China
  • Received:2013-03-29 Accepted:2013-05-12 Online:2013-07-01 Published:2013-07-18
  • About author:*Corresponding author: Email: zhuyx2@pku.edu.cn

Abstract:

We assembled a total of 297,239 Gossypium hirsutum (Gh, a tetraploid cotton, AADD) expressed sequence tag (EST) sequences that were available in the National Center for Biotechnology Information database, with reference to the recently published G. raimondii (Gr, a diploid cotton, DD) genome, and obtained 49,125 UniGenes. The average lengths of the UniGenes were increased from 804 and 791 bp in two previous EST assemblies to 1,019 bp in the current analysis. The number of putative cotton UniGenes with lengths of 3 kb or more increased from 25 or 34 to 1,223. As a result, thousands of originally independent G. hirsutum ESTs were aligned to produce large contigs encoding transcripts with very long open reading frames, indicating that the G. raimondii genome sequence provided remarkable advantages to assemble the tetraploid cotton transcriptome. Significant different distribution patterns within several GO terms, including transcription factor activity, were observed between D- and A-derived assemblies. Transcriptome analysis showed that, in a tetraploid cotton cell, 29,547 UniGenes were possibly derived from the D subgenome while another 19,578 may come from the A subgenome. Finally, some of the in silico data were confirmed by reverse transcription polymerase chain reaction experiments to show the changes in transcript levels for several gene families known to play key role in cotton fiber development. We believe that our work provides a useful platform for functional and evolutionary genomic studies in cotton.

Jin X, Li Q, Xiao G, Zhu Y (2013) Using genome‐referenced expressed sequence tag assembly to analyze the origin and expression patterns of Gossypium hirsutum transcripts. J. Integr. Plant Biol. 55(7), 576–585.

Key words: Gossypium, cotton fiber, EST assembly, functional genomics, deep sequencing

[an error occurred while processing this directive]