J Integr Plant Biol. ›› 2013, Vol. 55 ›› Issue (7): 576-585.DOI: 10.1111/jipb.12066

• Research Articles • Previous Articles     Next Articles

Using Genome-Referenced Expressed Sequence Tag Assembly to Analyze the Origin and Expression Patterns of Gossypium hirsutum Transcripts

Xiang Jin1, Qin Li1, Guanghui Xiao1 and Yuxian Zhu1,2*   

  1. 1State Key Laboratory of Protein and Plant Gene Research, College of Life Sciences, Peking University, Beijing, China
    2National Center for Plant Gene Research (Beijing), Beijing, China
  • Received:2013-03-29 Accepted:2013-05-12 Published:2013-07-01
  • About author:*Corresponding author: Email: zhuyx2@pku.edu.cn


We assembled a total of 297,239 Gossypium hirsutum (Gh, a tetraploid cotton, AADD) expressed sequence tag (EST) sequences that were available in the National Center for Biotechnology Information database, with reference to the recently published G. raimondii (Gr, a diploid cotton, DD) genome, and obtained 49,125 UniGenes. The average lengths of the UniGenes were increased from 804 and 791 bp in two previous EST assemblies to 1,019 bp in the current analysis. The number of putative cotton UniGenes with lengths of 3 kb or more increased from 25 or 34 to 1,223. As a result, thousands of originally independent G. hirsutum ESTs were aligned to produce large contigs encoding transcripts with very long open reading frames, indicating that the G. raimondii genome sequence provided remarkable advantages to assemble the tetraploid cotton transcriptome. Significant different distribution patterns within several GO terms, including transcription factor activity, were observed between D- and A-derived assemblies. Transcriptome analysis showed that, in a tetraploid cotton cell, 29,547 UniGenes were possibly derived from the D subgenome while another 19,578 may come from the A subgenome. Finally, some of the in silico data were confirmed by reverse transcription polymerase chain reaction experiments to show the changes in transcript levels for several gene families known to play key role in cotton fiber development. We believe that our work provides a useful platform for functional and evolutionary genomic studies in cotton.

Jin X, Li Q, Xiao G, Zhu Y (2013) Using genome‐referenced expressed sequence tag assembly to analyze the origin and expression patterns of Gossypium hirsutum transcripts. J. Integr. Plant Biol. 55(7), 576–585.

Key words: Gossypium, cotton fiber, EST assembly, functional genomics, deep sequencing

Editorial Office, Journal of Integrative Plant Biology, Institute of Botany, CAS
No. 20 Nanxincun, Xiangshan, Beijing 100093, China
Tel: +86 10 6283 6133 Fax: +86 10 8259 2636 E-mail: jipb@ibcas.ac.cn
Copyright © 2022 by the Institute of Botany, the Chinese Academy of Sciences
Online ISSN: 1744-7909 Print ISSN: 1672-9072 CN: 11-5067/Q