摘要
当前二代测序数据的处理广泛使用基于标准版本的Linux操作系统分析方法。这一系统专业性强,成本较高,操作界面不够友好,严重限制了大多数科研人员对数据的自主分析。本文创建了一个基于微软Windows操作系统的全功能二代测序数据的生物信息学分析系统,利用该系统经优选实现当前多种高通量测序数据的主流标准化分析流程。通过RNA-Seq的代表性案例,演算实测数据与传统Linux系统驱动的数据分析结果相比较,结果显示,本系统的组件和流程在常用的数据分析过程中,可以基本取代目前主流的Linux服务器或云计算平台,在运行效率相近的情况下,其操作极为简便且成本大大降低。本系统与所配附的编译软件及流程脚本,不仅为测序数据的生物信息学分析实操演练提供全面的解决方案,而且可以直接应用于专业的测序数据分析中。
Current computation platforms for data processing and informatic analyses of next generation sequencing(NGS)are mainly based on popular versions of the Linux operating system.This system is highly professional,expensive,and difficult to operate,which seriously limits the majority of researchers to independent analysis of the data.In this paper,a full-functional bioinformatic analysis platform was established for NGS data running on the Microsoft Windows operating system.The procedures were optimized and tested for a series of standardized analyzing routines with high-throughput sequencing data.The performance results were compared with Linux-based protocols.With the appealing ease to operate and for management,the test outcome indicated that the protocols or components from the system could replace the currently dominant computation server/cloud solutions in most of the case applications,and yet the cost was greatly reduced.Our system can be utilized for professional bioinformatics operations under a variety of circumstances,and above all it can also be readily employed for practical training in high-throughput sequencing analyses.
作者
黄文秋
刘旭
王俐勇
程杉
叶海虹
丁卫
HUANG Wen-Qiu;LIU Xu;WANG Li-Yong;CHENG Shan;YE Hai-Hong;DING Wei(School of Basic Medical Sciences,Capital Medical University,Beijing100069,China;Central Laboratory,Capital Medical University,Beijing100069,China)
出处
《中国生物化学与分子生物学报》
CAS
CSCD
北大核心
2019年第6期687-692,共6页
Chinese Journal of Biochemistry and Molecular Biology
基金
国家自然科学基金(No.81572704)资助~~