转录组测序数据中cSNP和表达差异基因的分析方法.docx
文本预览下载声明
转录组测序数据中 cSNP 和表达差异基因的分析方法李少波, 傅国辉( 上海交通大学 基础医学院病理学教研室,上海 200025)[摘要] 目的 确立本次转录组测序数据中编码区单核苷酸多态性( cSNP) 和表达差异基因的分析方法,筛选出可能导致蛋白质 功能改变的单核苷酸多态性( SNP) 位点和不同表型细胞中存在的表达差异基因。方法 对正常培养的胃癌细胞系 MKN28 和 SGC7901 进行 RNA 测序( RNA-Seq) ,将测序数据与参考基因组进行比对,对测序的 reads 数、测得的基因数、MKN28 和 SGC7901 中各自表达上调的基因数、SNP 数及可变剪接形式进行统计学分析。运用在线的软件和数据库并结合计算机编程,对 2 株胃癌 细胞系转录组测序数据中的 SNP 进行筛选和功能预测; 对 2 株细胞中表达差异基因 GO 聚类结果进行分析比较。结果 筛选并 预测了 8 种类别 709 种基因的 SNP,分析出了 6 个经预测能够导致蛋白功能改变的 SNP 位点。对表达差异基因的分析得到了丝 氨酸 / 苏氨酸蛋白激酶在 2 株细胞中的表达情况; 经 Western blotting 和 PCR 验证了部分分析结果。结论 确立了 1 种转录组测 序后 cSNP 数据的分析方法,该方法能够对大量 SNP 数据进行高效筛选和分析; 通过聚类分析后再比较得到了一组在 MKN28 中 高表达而在 SGC7901 中低表达的蛋白激酶基因; 这些结果为后续实验提供了依据。[关键词] 编码区单核苷酸多态性; 转录组; RNA 测序; 表达差异基因; 胃癌[DOI] 10. 3969 / j. issn. 1674-8115. 2014. 02. 001 [中图分类号] Q75 [文献标志码] ARNA-Seq based analysis on cSNP and gene expression levelLI Shao-bo, FU Guo-hui( Department of Pathology, Basic Medical College, Shanghai Jiao Tong University, Shanghai 200025, China)[Abstract] Objective To establish the analytical method for cSNP and gene expression difference based on transcriptome RNA-Seq data, and to screen SNP loci that may alter protein functions and gene expression difference among different cell phenotypes. Methods RNA-Seq was performed for normal cultured gastric cancer cell lines MKN28 and SGC7901. The sequencing data was then compared with the reference genome and the statistic analysis was conducted for the number of reads, sequenced genes, upregulated genes of MKN28 and SGC7901, and SNP and variable splicing patterns. Online software, database and computer programming were combined to screen and predict functions of SNP in transcriptome sequencing data of two gastric cancer cell lines, and to perform analysis and comparison for the GO clustering results of differentially expressed genes. Results The SNP of 709 genes belonging to 8 different gene terms were screened and predicted and 6 cS
显示全部