1
|
Altan S, Amaratunga D, Cabrera J, Garren J, Geys H, Kolassa J, LeBlond D, Li D, Liao J, Liu J, Lubomirski M, Miro-Quesada G, Novick S, Otava M, Peterson J, Reckermann K, Schofield T, Tan C, Tatikola K, Tekle F, Thomas J, Vukovinsky K. Survey and Recommendations on the Use of P-Values Driving Decisions in Nonclinical Pharmaceutical Applications. Stat Biopharm Res 2022. [DOI: 10.1080/19466315.2022.2038258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Jia Liu
- Pfizer Inc, Andover, Ma, 01810
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Kluxen FM, Weber K, Strupp C, Jensen SM, Hothorn LA, Garcin JC, Hofmann T. Using historical control data in bioassays for regulatory toxicology. Regul Toxicol Pharmacol 2021; 125:105024. [PMID: 34364928 DOI: 10.1016/j.yrtph.2021.105024] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 06/21/2021] [Accepted: 07/30/2021] [Indexed: 12/12/2022]
Abstract
Historical control data (HCD) consist of pooled control group responses from bioassays. These data must be collected and are often used or reported in regulatory toxicology studies for multiple purposes: as quality assurance for the test system, to help identify toxicological effects and their effect-size relevance and to address the statistical multiple comparison problem. The current manuscript reviews the various classical and potential new approaches for using HCD. Issues in current practice are identified and recommendations for improved use and discussion are provided. Furthermore, stakeholders are invited to discuss whether it is necessary to consider uncertainty when using HCD formally and statistically in toxicological discussions and whether binary inclusion/exclusion criteria for HCD should be revised to a tiered information contribution to assessments. Overall, the critical value of HCD in toxicological bioassays is highlighted when used in a weight-of-evidence assessment.
Collapse
Affiliation(s)
| | | | | | - Signe M Jensen
- Department of Plant and Efoldnvironmental Sciences, University of Copenhagen, Copenhagen, Denmark
| | | | | | | |
Collapse
|
3
|
Kluxen FM, Jensen SM. Expanding the toxicologist's statistical toolbox: Using effect size estimation and dose-response modelling for holistic assessments instead of generic testing. Regul Toxicol Pharmacol 2021; 121:104871. [PMID: 33485925 DOI: 10.1016/j.yrtph.2021.104871] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 12/21/2020] [Accepted: 01/19/2021] [Indexed: 12/25/2022]
Abstract
It is tempting to base (eco-)toxicological assay evaluation solely on statistical significance tests. The approach is stringent, objective and facilitates binary decisions. However, tests according to null hypothesis statistical testing (NHST) are thought experiments that rely heavily on assumptions. The generic and unreflected application of statistical tests has been called "mindless" by Gigerenzer. While statistical tests have an appropriate application domain, the present work investigates how unreflected testing may affect toxicological assessments. Dunnett multiple-comparison and Williams trend testing and their compatibility intervals are compared with dose-response-modelling in case studies, where data do not follow textbook behavior, nor behave as expected from a toxicological point of view. In such cases, toxicological assessments based only on p-values may be biased and biological evaluations based on plausibility may be prioritized. If confidence in a negative assay outcome cannot be established, further data may be needed for a robust toxicological assessment.
Collapse
Affiliation(s)
| | - Signe M Jensen
- Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
4
|
Hothorn LA, Ralph Pirow. Use compatibility intervals in regulatory toxicology. Regul Toxicol Pharmacol 2020; 116:104720. [DOI: 10.1016/j.yrtph.2020.104720] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Revised: 06/02/2020] [Accepted: 06/16/2020] [Indexed: 01/25/2023]
|
5
|
Alternatives to statistical decision trees in regulatory (eco-)toxicological bioassays. Arch Toxicol 2020; 94:1135-1149. [PMID: 32193567 DOI: 10.1007/s00204-020-02690-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2019] [Accepted: 02/25/2020] [Indexed: 12/24/2022]
Abstract
The goal of (eco-) toxicological testing is to experimentally establish a dose or concentration-response and to identify a threshold with a biologically relevant and probably non-random deviation from "normal". Statistical tests aid this process. Most statistical tests have distributional assumptions that need to be satisfied for reliable performance. Therefore, most statistical analyses used in (eco-)toxicological bioassays use subsequent pre- or assumption-tests to identify the most appropriate main test, so-called statistical decision trees. There are however several deficiencies with the approach, based on study design, type of tests used and subsequent statistical testing in general. When multiple comparisons are used to identify a non-random change against negative control, we propose to use robust testing, which can be generically applied without the need of decision trees. Visualization techniques and reference ranges also offer advantages over the current pre-testing approaches. We aim to promulgate the concepts in the (eco-) toxicological community and initiate a discussion for regulatory acceptance.
Collapse
|
6
|
Fox JF, Denton DL, Diamond J, Stuber R. Comparison of false-positive rates of 2 hypothesis-test approaches in relation to laboratory toxicity test performance. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2019; 38:511-523. [PMID: 30776146 PMCID: PMC6869336 DOI: 10.1002/etc.4347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2018] [Revised: 08/18/2018] [Accepted: 12/18/2018] [Indexed: 06/09/2023]
Abstract
We compared 2 statistical hypothesis-test approaches (no-observed-effect concentration [NOEC] and test of significant toxicity [TST]) to determine the influence of laboratory test performance on the false-positive error rate using the US Environmental Protection Agency's Ceriodaphnia dubia reproduction whole-effluent toxicity (WET) test endpoint. Simulation and power calculations were used to determine error rates based on observed control coefficients of variation (CV) for 8 laboratories over a range of effect levels. Average C. dubia control reproduction among laboratories was 20 to 40 offspring per female, and the 75th percentile CV was 0.10 to 0.31, reflecting a range in laboratory performance. The 2 approaches behave similarly for CVs of 0.2 to 0.3. At effects <10%, as CV decreases, TST is less likely to declare toxicity and NOEC is more likely to do so. Laboratory performance affects whether a sample is declared toxic and influences the probability of false-positive (and -negative) error rates using either approach. At the 75th percentile control CV observed for each laboratory, 4 laboratories would achieve approximately a 5% false-positive rate using 13 or fewer replicates for this test method. For the remaining 4 laboratories, more replicates would be needed to achieve a 5% false-positive rate. The present analyses demonstrate how false-positive rates are influenced by laboratory performance and WET test design. Environ Toxicol Chem 2019;38:511-523. Published 2019 Wiley Periodicals Inc. on behalf of SETAC. This article is a US government work and, as such, is in the public domain in the United States of America.
Collapse
Affiliation(s)
- John F. Fox
- Office of Research and Development, US Environmental
Protection Agency, Washington, DC
| | - Debra L. Denton
- US Environmental Protection Agency, Region 9, Sacramento,
California
| | | | - Robyn Stuber
- US Environmental Protection Agency, Region 9, Sacramento,
California
| |
Collapse
|
7
|
Hothorn LA. The two-step approach—a significant ANOVA F-test before Dunnett's comparisons against a control—is not recommended. COMMUN STAT-THEOR M 2016. [DOI: 10.1080/03610926.2014.902225] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
8
|
Kim DE, Yang H, Jang WH, Jung KM, Park M, Choi JK, Jung MS, Jeon EY, Heo Y, Yeo KW, Jo JH, Park JE, Sohn SJ, Kim TS, Ahn IY, Jeong TC, Lim KM, Bae S. Predictive capacity of a non-radioisotopic local lymph node assay using flow cytometry, LLNA:BrdU–FCM: Comparison of a cutoff approach and inferential statistics. J Pharmacol Toxicol Methods 2016; 78:76-84. [DOI: 10.1016/j.vascn.2015.12.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Revised: 10/20/2015] [Accepted: 12/02/2015] [Indexed: 10/22/2022]
|
9
|
Yang H, Na J, Jang WH, Jung MS, Jeon JY, Heo Y, Yeo KW, Jo JH, Lim KM, Bae S. Appraisal of within- and between-laboratory reproducibility of non-radioisotopic local lymph node assay using flow cytometry, LLNA:BrdU-FCM: Comparison of OECD TG429 performance standard and statistical evaluation. Toxicol Lett 2015; 234:172-9. [DOI: 10.1016/j.toxlet.2015.02.014] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2014] [Revised: 02/15/2015] [Accepted: 02/18/2015] [Indexed: 10/23/2022]
|
10
|
De Jonghe S, Proctor J, Vinken P, Feyen B, Wynant I, Marien D, Geys H, Mamidi RNVS, Johnson MD. Carcinogenicity in rats of the SGLT2 inhibitor canagliflozin. Chem Biol Interact 2014; 224:1-12. [PMID: 25289773 DOI: 10.1016/j.cbi.2014.09.018] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Revised: 08/29/2014] [Accepted: 09/23/2014] [Indexed: 12/20/2022]
Abstract
The carcinogenicity potential of canagliflozin, an inhibitor of SGLT2, was evaluated in a 2-year rat study (10, 30, and 100 mg/kg). Rats showed an increase in pheochromocytomas, renal tubular tumors, and testicular Leydig cell tumors. Systemic exposure multiples at the highest dose relative to the maximum clinical dose were 12- to 21-fold. Pheochromocytomas and renal tubular tumors were noted in both sexes at 100 mg/kg. Leydig cell tumors were observed in males in all dose groups and were associated with increased luteinizing hormone levels. Hyperplasia was increased in the adrenal medulla at 100 mg/kg, but only a limited increase in simple tubular hyperplasia was observed in the kidney of males at 100 mg/kg. Hyperostosis occurred and was accompanied by substantial effects on calcium metabolism, including increased urinary calcium excretion and decreased levels of calcium regulating hormones (1,25-dihydroxyvitamin D and parathyroid hormone). A separate study with radiolabeled calcium confirmed that increased urinary calcium excretion was mediated via increased calcium absorption from the gastrointestinal tract. It was hypothesized that, at high doses, canagliflozin might have inhibited glucose absorption in the intestine via SGLT1 inhibition that resulted in glucose malabsorption, which increased calcium absorption by stimulating colonic glucose fermentation and reducing intestinal pH. Pheochromocytomas and adrenal medullary hyperplasia were attributed to altered calcium homeostasis, which have a known relationship in the rat. In conclusion, Leydig cell tumors were associated with increased luteinizing hormone levels and pheochromocytomas were most likely related to glucose malabsorption and altered calcium homeostasis. Renal tubular tumors may also have been linked to glucose malabsorption.
Collapse
Affiliation(s)
- Sandra De Jonghe
- Janssen Research & Development, A Division of Janssen Pharmaceutica NV, Turnhoutseweg 30, B-2340 Beerse, Belgium
| | - Jim Proctor
- Janssen Research & Development, LLC, 1000 Route 202 South, Raritan, NJ 08869, United States
| | - Petra Vinken
- Janssen Research & Development, A Division of Janssen Pharmaceutica NV, Turnhoutseweg 30, B-2340 Beerse, Belgium
| | - Bianca Feyen
- Janssen Research & Development, A Division of Janssen Pharmaceutica NV, Turnhoutseweg 30, B-2340 Beerse, Belgium
| | - Inneke Wynant
- Janssen Research & Development, A Division of Janssen Pharmaceutica NV, Turnhoutseweg 30, B-2340 Beerse, Belgium
| | - Dirk Marien
- Janssen Research & Development, A Division of Janssen Pharmaceutica NV, Turnhoutseweg 30, B-2340 Beerse, Belgium
| | - Helena Geys
- Janssen Research & Development, A Division of Janssen Pharmaceutica NV, Turnhoutseweg 30, B-2340 Beerse, Belgium
| | - Rao N V S Mamidi
- Janssen Research & Development, LLC, 1000 Route 202 South, Raritan, NJ 08869, United States
| | - Mark D Johnson
- Janssen Research & Development, LLC, 1000 Route 202 South, Raritan, NJ 08869, United States.
| |
Collapse
|
11
|
|
12
|
Kuiper RM, Gerhard D, Hothorn LA. Identification of the Minimum Effective Dose for Normally Distributed Endpoints Using a Model Selection Approach. Stat Biopharm Res 2014. [DOI: 10.1080/19466315.2013.847384] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
13
|
Hothorn LA, Reisinger K, Wolf T, Poth A, Fieblinger D, Liebsch M, Pirow R. Statistical analysis of the hen's egg test for micronucleus induction (HET-MN assay). MUTATION RESEARCH-GENETIC TOXICOLOGY AND ENVIRONMENTAL MUTAGENESIS 2013; 757:68-78. [DOI: 10.1016/j.mrgentox.2013.04.023] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2013] [Revised: 04/24/2013] [Accepted: 04/28/2013] [Indexed: 10/26/2022]
|
14
|
Jaki T, Hothorn LA. Statistical evaluation of toxicological assays: Dunnett or Williams test-take both. Arch Toxicol 2013; 87:1901-1910. [PMID: 23652543 DOI: 10.1007/s00204-013-1065-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2013] [Accepted: 04/23/2013] [Indexed: 12/20/2022]
Abstract
The US National Toxicology Program recommends the use of the parametric multiple comparison procedures of Dunnett and Williams for the evaluation of repeated toxicity studies. For endpoints where either increasing or decreasing effects are of toxicological relevance, we recommend the use of the two-sided Dunnett test exclusively. For the many other endpoints, where a priori only one direction is of toxicological relevance, however, we recommend the combination of Dunnett and Williams test. In particular, we recommend the so-called Umbrella-protected Williams test which offers insights for all interesting monotone and non-monotone alternatives while only suffering a marginal loss in power compared to the Dunnett test. We illustrate the power difference analytically and compare the approach for different endpoint types using three real data examples to alternative tests available. Nonparametric tests, which are suitable for the evaluation of skewed distributed or scores data, are also considered. Particular attention is given to the different interpretations of the findings revealed by the different test. R programs used for the analyses are provided.
Collapse
Affiliation(s)
- Thomas Jaki
- Medical and Pharmaceutical Statistics Research Unit, Lancaster University, Lancaster, LA1 4YF, UK
| | - Ludwig A Hothorn
- Institut für Biostatistik, Leibniz Universität Hannover, Herrenhäuser Str. 2, 30419, Hannover, Germany.
| |
Collapse
|
15
|
Herberich E, Hothorn LA. Statistical evaluation of mortality in long-term carcinogenicity bioassays using a Williams-type procedure. Regul Toxicol Pharmacol 2012; 64:26-34. [DOI: 10.1016/j.yrtph.2012.06.014] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2011] [Revised: 06/18/2012] [Accepted: 06/18/2012] [Indexed: 11/26/2022]
|
16
|
Hasler M, Hothorn LA. Simultaneous confidence intervals on multivariate non-inferiority. Stat Med 2012; 32:1720-9. [PMID: 22996204 DOI: 10.1002/sim.5633] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2011] [Accepted: 09/04/2012] [Indexed: 11/07/2022]
Abstract
We discuss the analysis of non-inferiority trials based on a multi-armed design with multiple correlated endpoints. We consider five different scenarios: (1) global non-inferiority; (2) non-inferiority for subsets of comparisons; (3) global non-inferiority for a treatment group; (4) global non-inferiority for an endpoint; and (5) local non-inferiority. We describe and compare related approaches, which are based on the union-intersection and intersection-union test principles - alone and in combination. Because non-inferiority thresholds for many differently scaled endpoints are rarely available in practice, the approaches described focus on the estimation of simultaneous confidence limits and their post hoc interpretation on non-inferiority. We discuss and demonstrate pros and cons by means of a real data example.
Collapse
Affiliation(s)
- M Hasler
- Lehrfach Variationsstatistik, Christian-Albrechts-University of Kiel, Germany.
| | | |
Collapse
|
17
|
Konietschke F, Hothorn LA. Evaluation of Toxicological Studies Using a Nonparametric Shirley-Type Trend Test for Comparing Several Dose Levels with a Control Group. Stat Biopharm Res 2012. [DOI: 10.1080/19466315.2011.633861] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
18
|
Röhmel J. On familywise type I error control for multiplicity in equivalence trials with three or more treatments. Biom J 2011; 53:914-26. [DOI: 10.1002/bimj.201100073] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Joachim Röhmel
- Bremen Institute for Prevention Research and Social Medicine (BIPS), Achterstrasse 30, 28359 Bremen, Germany
| |
Collapse
|
19
|
|
20
|
Tao J, Tang ML, Shi NZ. A STEPWISE CONFIDENCE INTERVAL PROCEDURE BASED ON AN ASYMMETRIC LOSS FUNCTION WITH APPLICATIONS TO TOXICOLOGICAL EVALUATION. AUST NZ J STAT 2010. [DOI: 10.1111/j.1467-842x.2010.00568.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
21
|
Schaarschmidt F, Sill M, Hothorn LA. Poly-k-Trend Tests for Survival Adjusted Analysis of Tumor Rates Formulated as Approximate Multiple Contrast Test. J Biopharm Stat 2008; 18:934-48. [DOI: 10.1080/10543400802294285] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Frank Schaarschmidt
- a Leibniz University Hannover, Institute of Biostatistics , Hannover, Germany
| | - Martin Sill
- a Leibniz University Hannover, Institute of Biostatistics , Hannover, Germany
| | - Ludwig A. Hothorn
- a Leibniz University Hannover, Institute of Biostatistics , Hannover, Germany
| |
Collapse
|