The completion of the Human Genome Project lays a foundation for systematically studying the human genome from evolutionary history to precision medicine against diseases.With the explosive growth of biological data, ...The completion of the Human Genome Project lays a foundation for systematically studying the human genome from evolutionary history to precision medicine against diseases.With the explosive growth of biological data, there is an increasing number of biological databases that have been developed in aid of human-related research. Here we present a collection of humanrelated biological databases and provide a mini-review by classifying them into different categories according to their data types. As human-related databases continue to grow not only in count but also in volume, challenges are ahead in big data storage, processing, exchange and curation.展开更多
The rapid growth of structured data has presented new technological challenges in the research fields of big data and relational database. In this paper, we present an efficient system for managing and analyzing PB le...The rapid growth of structured data has presented new technological challenges in the research fields of big data and relational database. In this paper, we present an efficient system for managing and analyzing PB level structured data called Banian. Banian overcomes the storage structure limitation of relational database and effectively integrates interactive query with large-scale storage management. It provides a uniform query interface for cross-platform datasets and thus shows favorable compatibility and scalability. Banian's system architecture mainly includes three layers:(1) a storage layer using HDFS for the distributed storage of massive data;(2) a scheduling and execution layer employing the splitting and scheduling technology of parallel database; and(3)an application layer providing a cross-platform query interface and supporting standard SQL. We evaluate Banian using PB level Internet data and the TPC-H benchmark. The results show that when compared with Hive, Banian improves the query performance to a maximum of 30 times and achieves better scalability and concurrency.展开更多
Innovation springs from practice, and its soul lies in practical thinking. All human wisdom is a product of practice and needs to be tested during the practice. With the rapid development of medicine, a clinician has ...Innovation springs from practice, and its soul lies in practical thinking. All human wisdom is a product of practice and needs to be tested during the practice. With the rapid development of medicine, a clinician has to keep pace with the new era, grasp the pulse of the times and innovate. Only in this way, could he or she lead the trend of the new era. Clinical medicine is a practical science and implemented mainly by clinicians, requiring them to explore the truth and pursue technological innovation all the time. At the same time~ as an academic leader, a clinician is encouraged to practice actively, to take risks to innovate, to pursue truth and test truth in the practice, discarding old ideas and correcting wrong theories and technologies. To summarize, a clinician has to push tbrward practice-based innovation of theory and technology to keep up with the pace of the times.展开更多
基金supported by the‘‘100-Talent Program’’of Chinese Academy of Sciencesthe Strategic Priority Research Program of the Chinese Academy of Sciences(Grant No.XDB13040500)+1 种基金the National High-tech R&D Program(863 ProgramGrant No.2012AA020409)by the Ministry of Science and Technology of China awarded to ZZ
文摘The completion of the Human Genome Project lays a foundation for systematically studying the human genome from evolutionary history to precision medicine against diseases.With the explosive growth of biological data, there is an increasing number of biological databases that have been developed in aid of human-related research. Here we present a collection of humanrelated biological databases and provide a mini-review by classifying them into different categories according to their data types. As human-related databases continue to grow not only in count but also in volume, challenges are ahead in big data storage, processing, exchange and curation.
基金supported by the National High-Tech Research and Development (863) Program of China (No. 2012AA012609)
文摘The rapid growth of structured data has presented new technological challenges in the research fields of big data and relational database. In this paper, we present an efficient system for managing and analyzing PB level structured data called Banian. Banian overcomes the storage structure limitation of relational database and effectively integrates interactive query with large-scale storage management. It provides a uniform query interface for cross-platform datasets and thus shows favorable compatibility and scalability. Banian's system architecture mainly includes three layers:(1) a storage layer using HDFS for the distributed storage of massive data;(2) a scheduling and execution layer employing the splitting and scheduling technology of parallel database; and(3)an application layer providing a cross-platform query interface and supporting standard SQL. We evaluate Banian using PB level Internet data and the TPC-H benchmark. The results show that when compared with Hive, Banian improves the query performance to a maximum of 30 times and achieves better scalability and concurrency.
文摘Innovation springs from practice, and its soul lies in practical thinking. All human wisdom is a product of practice and needs to be tested during the practice. With the rapid development of medicine, a clinician has to keep pace with the new era, grasp the pulse of the times and innovate. Only in this way, could he or she lead the trend of the new era. Clinical medicine is a practical science and implemented mainly by clinicians, requiring them to explore the truth and pursue technological innovation all the time. At the same time~ as an academic leader, a clinician is encouraged to practice actively, to take risks to innovate, to pursue truth and test truth in the practice, discarding old ideas and correcting wrong theories and technologies. To summarize, a clinician has to push tbrward practice-based innovation of theory and technology to keep up with the pace of the times.