the first illumina-based de novo transcriptome sequencing and analysis of safflower flowers第一个illumina-based新创转录组测序和分析红花的花.pdf
文本预览下载声明
The First Illumina-Based De Novo Transcriptome
Sequencing and Analysis of Safflower Flowers
Huang Lulin, Yang Xiao, Sun Pei, Tong Wen, Hu Shangqin*
Department of Traditional Chinese Medicine Study, Industrial Crop Institute, Sichuan Academy of Agricultural Sciences, Jianyang, Sichuan, China
Abstract
Background: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid
content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower
has a large and complex genome, the development of its genomic resources has been delayed. Second-generation Illumina
sequencing is now an efficient route for generating an enormous volume of sequences that can represent a large number of
genes and their expression levels.
Methodology/Principal Findings: To investigate the genes and pathways that might control flavonoids and other
secondary metabolites in the safflower, we used Illumina sequencing to perform a de novo assembly of the safflower tubular
flower tissue transcriptome. We obtained a total of 4.69 Gb in clean nucleotides comprising 52,119,104 clean sequencing
reads, 195,320 contigs, and 120,778 unigenes. Based on similarity searches with known proteins, we annotated 70,342 of the
unigenes (about 58% of the identified unigenes) with cut-off E-values of 1025. In total, 21,943 of the safflower unigenes
were found to have COG classifications, and BLAST2GO assigned 26,332 of the unigenes to 1,754 GO term annotations. In
addition, we assigned 30,203 of the unigenes to 121 KEGG pathways. When we focused on genes identified as contributing
to flavonoid biosynthesis and the biosynthesis of unsaturated fatty acids, which are important pathways that control flower
and seed quality, respectively, we found that these genes were fairly well conserv
显示全部