A statistically significant research finding should not be defined as a P-value of 0.05 or less, because this definition does not take into account study power. Statistical significance was originally defined by Fishe...A statistically significant research finding should not be defined as a P-value of 0.05 or less, because this definition does not take into account study power. Statistical significance was originally defined by Fisher RA as a P-value of 0.05 or less. According to Fisher, any finding that is likely to occur by random variation no more than 1 in 20 times is considered significant. Neyman J and Pearson ES subsequently argued that Fisher's definition was incomplete. They proposed that statistical significance could only be determined by analyzing the chance of incorrectly considering a study finding was significant(a Type Ⅰ?error) or incorrectly considering a study finding was insignificant(a Type Ⅱ error). Their definition of statistical significance is also incomplete because the error rates are considered separately, not together. A better definition of statistical significance is the positive predictive value of a P-value, which is equal to the power divided by the sum of power and the P-value. This definition is more complete and relevant than Fisher's or Neyman-Peason's definitions, because it takes into account both concepts of statistical significance. Using this definition, a statistically significant finding requires a P-value of 0.05 or less when the power is at least 95%, and a P-value of 0.032 or less when the power is 60%. To achieve statistical significance, P-values must be adjusted downward as the study power decreases.展开更多
On February 10,2022,the U.S.Food and Drug Administration(FDA)’s Oncologic Drugs Advisory Committee voted 14-1 against using data from the ORIENT-11 trial to support a biologics license application for sintilimab inje...On February 10,2022,the U.S.Food and Drug Administration(FDA)’s Oncologic Drugs Advisory Committee voted 14-1 against using data from the ORIENT-11 trial to support a biologics license application for sintilimab injection plus pemetrexed and platinum-based chemotherapy as first-line treatment for patients with nonsquamous non-small cell lung cancer(NSCLC)1.One major reason was that the FDA claimed overall survival(OS)was not statistically tested in ORIENT-11,while previous regular approvals were granted on the basis of statistically significant improvements in OS2.This may be a surprise to some physicians,as the ORIENT-11 trial previously reported improved OS with a hazard ratio(HR)of 0.60(95%confidence interval[CI]:0.45–0.79)and a P value of 0.00033,which is far below the commonly accepted threshold of 0.05 for declaring statistical significance.Some may argue that the OS results of ORIENT-11 were not considered statistically significant by the FDA because OS was only a secondary endpoint.However,in the KEYNOTE-024 trial4,which compared pembrolizumab with chemotherapy in patients with previously untreated advanced NSCLC with PD-L1 expression on at least 50%of tumor cells,OS was also a secondary endpoint.Nevertheless,the OS benefit of the KEYNOTE-024 trial was acknowledged by the FDA and included in the drug label5.The fundamental reason why OS results of ORIENT-11 were not considered statistically significant by the FDA is that the OS endpoint was not included in the multiplicity control strategy.展开更多
文摘A statistically significant research finding should not be defined as a P-value of 0.05 or less, because this definition does not take into account study power. Statistical significance was originally defined by Fisher RA as a P-value of 0.05 or less. According to Fisher, any finding that is likely to occur by random variation no more than 1 in 20 times is considered significant. Neyman J and Pearson ES subsequently argued that Fisher's definition was incomplete. They proposed that statistical significance could only be determined by analyzing the chance of incorrectly considering a study finding was significant(a Type Ⅰ?error) or incorrectly considering a study finding was insignificant(a Type Ⅱ error). Their definition of statistical significance is also incomplete because the error rates are considered separately, not together. A better definition of statistical significance is the positive predictive value of a P-value, which is equal to the power divided by the sum of power and the P-value. This definition is more complete and relevant than Fisher's or Neyman-Peason's definitions, because it takes into account both concepts of statistical significance. Using this definition, a statistically significant finding requires a P-value of 0.05 or less when the power is at least 95%, and a P-value of 0.032 or less when the power is 60%. To achieve statistical significance, P-values must be adjusted downward as the study power decreases.
文摘On February 10,2022,the U.S.Food and Drug Administration(FDA)’s Oncologic Drugs Advisory Committee voted 14-1 against using data from the ORIENT-11 trial to support a biologics license application for sintilimab injection plus pemetrexed and platinum-based chemotherapy as first-line treatment for patients with nonsquamous non-small cell lung cancer(NSCLC)1.One major reason was that the FDA claimed overall survival(OS)was not statistically tested in ORIENT-11,while previous regular approvals were granted on the basis of statistically significant improvements in OS2.This may be a surprise to some physicians,as the ORIENT-11 trial previously reported improved OS with a hazard ratio(HR)of 0.60(95%confidence interval[CI]:0.45–0.79)and a P value of 0.00033,which is far below the commonly accepted threshold of 0.05 for declaring statistical significance.Some may argue that the OS results of ORIENT-11 were not considered statistically significant by the FDA because OS was only a secondary endpoint.However,in the KEYNOTE-024 trial4,which compared pembrolizumab with chemotherapy in patients with previously untreated advanced NSCLC with PD-L1 expression on at least 50%of tumor cells,OS was also a secondary endpoint.Nevertheless,the OS benefit of the KEYNOTE-024 trial was acknowledged by the FDA and included in the drug label5.The fundamental reason why OS results of ORIENT-11 were not considered statistically significant by the FDA is that the OS endpoint was not included in the multiplicity control strategy.