文档详情

customisation of the exome data analysis pipeline using a combinatorial approach定制的外显子组数据分析管道使用组合方法.pdf

发布:2017-09-12约4.82万字共9页下载文档
文本预览下载声明
Customisation of the Exome Data Analysis Pipeline Using a Combinatorial Approach 1,2 1 1 1 1,2 Swetansu Pattnaik , Srividya Vaidyanathan , Durgad G. Pooja , Sa Deepak , Binay Panda * 1 Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, India, 2 Strand Life Sciences, Bangalore, India Abstract The advent of next generation sequencing (NGS) technologies have revolutionised the way biologists produce, analyse and interpret data. Although NGS platforms provide a cost-effective way to discover genome-wide variants from a single experiment, variants discovered by NGS need follow up validation due to the high error rates associated with various sequencing chemistries. Recently, whole exome sequencing has been proposed as an affordable option compared to whole genome runs but it still requires follow up validation of all the novel exomic variants. Customarily, a consensus approach is used to overcome the systematic errors inherent to the sequencing technology, alignment and post alignment variant detection algorithms. However, the aforementioned approach warrants the use of multiple sequencing chemistry, multiple alignment tools, multiple variant callers which may not be viable in terms of time and money for individual investigators with limited informatics know-how. Biologists often lack the requisite training to deal with the huge amount of data produced by NGS runs and face difficulty in choosing from the list of freely available analytical tools for NGS data analysis. Hence, there is a need to customise the NGS data analysis pipeline to preferentially retain true variants by minimisin
显示全部
相似文档