Continual reduction in sequencing cost is expanding the accessibility of genome sequencing data for routine clinical applications.However,the lack of methods to construct machine learning-based predictive models using...Continual reduction in sequencing cost is expanding the accessibility of genome sequencing data for routine clinical applications.However,the lack of methods to construct machine learning-based predictive models using these datasets has become a crucial bottleneck for the application of sequencing technology in clinics.Here,we develop a new algorithm,eTumorMetastasis,which transforms tumor functional mutations into network-based profiles and identifies network operational gene(NOG)signatures.NOG signatures model the tipping point at which a tumor cell shifts from a state that doesn’t favor recurrence to one that does.We show that NOG signatures derived from genomic mutations of tumor founding clones(i.e.,the‘most recent common ancestor’of the cells within a tumor)significantly distinguish the recurred and non-recurred breast tumors as well as outperform the most popular genomic test(i.e.,Oncotype DX).These results imply that mutations of the tumor founding clones are associated with tumor recurrence and can be used to predict clinical outcomes.As such,predictive tools could be used in clinics to guide treatment routes.Finally,the concepts underlying the eTumorMetastasis pave the way for the application of genome sequencing in predictions for other complex genetic diseases.eTumorMetastasis pseudocode and related data used in this study are available at https://github.com/WangEdwinLab/eTumorMetastasis.展开更多
基金supported under the IDEATION program of the National Research Council of Canada,the Alberta In-novates Translational Chair Program in Cancer Genomics,the Natural Sciences and Engineering Research Council of Canada(Grant No.RGPIN-2017-04885)the Canada Foundation of Innovation(Grant No.36655).
文摘Continual reduction in sequencing cost is expanding the accessibility of genome sequencing data for routine clinical applications.However,the lack of methods to construct machine learning-based predictive models using these datasets has become a crucial bottleneck for the application of sequencing technology in clinics.Here,we develop a new algorithm,eTumorMetastasis,which transforms tumor functional mutations into network-based profiles and identifies network operational gene(NOG)signatures.NOG signatures model the tipping point at which a tumor cell shifts from a state that doesn’t favor recurrence to one that does.We show that NOG signatures derived from genomic mutations of tumor founding clones(i.e.,the‘most recent common ancestor’of the cells within a tumor)significantly distinguish the recurred and non-recurred breast tumors as well as outperform the most popular genomic test(i.e.,Oncotype DX).These results imply that mutations of the tumor founding clones are associated with tumor recurrence and can be used to predict clinical outcomes.As such,predictive tools could be used in clinics to guide treatment routes.Finally,the concepts underlying the eTumorMetastasis pave the way for the application of genome sequencing in predictions for other complex genetic diseases.eTumorMetastasis pseudocode and related data used in this study are available at https://github.com/WangEdwinLab/eTumorMetastasis.