1
|
Stander S, Kim BS, Guillemin I, Rhoten S, Wratten S, Brookes E, O'Malley JT, Bansal A, Msihid J, Thomas R, Bahloul D. Estimating meaningful change thresholds for Skin Pain-Numeric Rating Scale, Sleep-Numeric Rating Scale and Dermatology Life Quality Index in patients with prurigo nodularis. J Eur Acad Dermatol Venereol 2024; 38:1401-1409. [PMID: 38329222 DOI: 10.1111/jdv.19800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 11/21/2023] [Indexed: 02/09/2024]
Abstract
BACKGROUND Prurigo nodularis (PN) is characterized by intensely itchy nodules/lesions and skin pain, which can have a substantial impact on health-related quality of life (HRQoL). Treatment benefits on such symptoms and impacts are best assessed in trials using patient-reported outcome (PROs) instruments such as Skin Pain Numerical Rating Scale (NRS), Sleep-NRS and Dermatology Life Quality Index (DLQI). However, no guidance exists for interpreting meaningful changes in scores using these PROs in patients with PN. OBJECTIVES The main objective was to derive within-patient (responder definition) and between-group improvement thresholds for interpreting Skin Pain-NRS, Sleep-NRS and DLQI total scores in patients with PN. The measurement properties of the three PROs were also evaluated. METHODS Intention-to-treat (ITT), blinded and pooled data were used from the Phase 3 PRIME (NCT04183335) and PRIME2 (NCT04202679) studies evaluating the efficacy of dupilumab in adult patients with PN. Anchor- and distribution-based methods were applied to derive responder definition and between-group thresholds for Skin Pain-NRS, Sleep-NRS and DLQI. Data were additionally used to examine the instrument measurement properties, including reliability, validity and responsiveness. RESULTS A total of 311 patients (mean age 49.5 years, 65.3% female) were included in the pooled ITT population. The within-patient improvement threshold for Skin Pain-NRS was estimated as 4.0 points, 2.0 points for Sleep-NRS and 9.0 points for DLQI total score. A 1.5-point improvement in Skin Pain-NRS scores, 1.0-point in Sleep-NRS and 4.0-point in DLQI indicated a between-group meaningful change. Adequate to good psychometric properties were demonstrated for all three instruments. CONCLUSIONS The results of this study can aid interpretation of Skin Pain-NRS, Sleep-NRS and DLQI scores in patients with PN in both clinical trials and clinical practice to better understand and treat PN-related skin pain and the impact of PN on sleep quality and HRQoL.
Collapse
Affiliation(s)
- S Stander
- University Clinic Münster, Münster, Germany
| | - B S Kim
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | | | - S Rhoten
- IQVIA, San Francisco, California, USA
| | | | | | | | - A Bansal
- Regeneron, Tarrytown, New York, USA
| | | | - R Thomas
- Regeneron, Tarrytown, New York, USA
| | | |
Collapse
|
2
|
Swigris JJ, Aronson K, Fernández Pérez ER. A first look at the reliability, validity and responsiveness of L-PF-35 dyspnea domain scores in fibrotic hypersensitivity pneumonitis. BMC Pulm Med 2024; 24:188. [PMID: 38641768 PMCID: PMC11031991 DOI: 10.1186/s12890-024-02991-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 04/02/2024] [Indexed: 04/21/2024] Open
Abstract
BACKGROUND Dyspnea impairs quality of life (QOL) in patients with fibrotic hypersensitivity pneumonitis (FHP). The Living with Pulmonary Fibrosis questionnaire (L-PF) assesses symptoms, their impacts and PF-related QOL in patients with any form of PF. Its scores have not undergone validation analyses in an FHP cohort. METHODS We used data from the Pirfenidone in FHP trial to examine reliability, validity and responsiveness of the L-PF-35 Dyspnea domain score (Dyspnea) and to estimate its meaningful within-patient change (MWPC) threshold for worsening. Lack of suitable anchors precluded conducting analyses for other L-PF-35 scores. RESULTS At baseline, Dyspnea's internal consistency (Cronbach's coefficient alpha) was 0.85; there were significant correlations with all four anchors (University of California San Diego Shortness of Breath Questionnaire scores r = 0.81, St. George's Activity domain score r = 0.82, percent predicted forced vital capacity r = 0.37, and percent predicted diffusing capacity of the lung for carbon monoxide r = 0.37). Dyspnea was significantly different between anchor subgroups (e.g., lowest percent predicted forced vital capacity (FVC%) vs. highest, 33.5 ± 18.5 vs. 11.1 ± 9.8, p = 0.01). There were significant correlations between changes in Dyspnea and changes in anchor scores at all trial time points. Longitudinal models further confirmed responsiveness. The MWPC threshold estimate for worsening was 6.6 points (range 5-8). CONCLUSION The L-PF-35 Dyspnea domain appears to possess acceptable psychometric properties for assessing dyspnea in patients with FHP. Because instrument validation is never accomplished with one study, additional research is needed to build on the foundation these analyses provide. TRIAL REGISTRATION The data for the analyses presented in this manuscript were generated in a trial registered on ClinicalTrials.gov; the identifier was NCT02958917.
Collapse
Affiliation(s)
- Jeffrey J Swigris
- Center for Interstitial Lung Disease, National Jewish Health, 1400 Jackson Street, G07, 80206, Denver, CO, USA.
| | - Kerri Aronson
- Division of Pulmonary and Critical Care Medicine, Weill Cornell College of Medicine, New York, NY, USA
| | - Evans R Fernández Pérez
- Center for Interstitial Lung Disease, National Jewish Health, 1400 Jackson Street, G07, 80206, Denver, CO, USA
| |
Collapse
|
3
|
Sims J, Sloesen B, Bentley S, Naujoks C, Arbuckle R, Chiva-Razavi S, Pascoe B, Stochl J, Findley A, O'Brien P, Wolffsohn JS. Psychometric evaluation of the near activity visual questionnaire presbyopia (NAVQ-P) and additional patient-reported outcome items. J Patient Rep Outcomes 2024; 8:41. [PMID: 38592339 PMCID: PMC11004101 DOI: 10.1186/s41687-024-00717-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 03/18/2024] [Indexed: 04/10/2024] Open
Abstract
BACKGROUND The Near Visual Acuity Questionnaire Presbyopia (NAVQ-P) is a patient-reported outcome (PRO) measure that was developed in a phakic presbyopia population to assess near vision function impacts. The study refined and explored the psychometric properties and score interpretability of the NAVQ-P and additional PRO items assessing near vision correction independence (NVCI), near vision satisfaction (NVS), and near vision correction preference (NVCP). METHODS This was a psychometric validation study conducted using PRO data collected as part of a Phase IIb clinical trial (CUN8R44 A2202) consisting of 235 randomized adults with presbyopia from the US, Japan, Australia, and Canada. Data collected at baseline, week 2, and months 1, 2, and 3 during the 3-month trial treatment period were included in the analyses to assess item (question) properties, NAVQ-P dimensionality and scoring, reliability, validity, and score interpretation. RESULTS Item responses were distributed across the full response scale for most NAVQ-P and additional PRO items. Confirmatory factor analysis supported the pre-defined unidimensional structure and calculation of a NAVQ-P total score as a measure of near vision function. Item deletion informed by item response distributions, dimensionality analyses, item response theory, and previous qualitative findings, including clinical input, supported retention of 14 NAVQ-P items. The 14-item NAVQ-P total score had excellent internal consistency (α = 0.979) and high test-retest reliability (Intraclass Correlation Coefficients > = 0.898). There was good evidence of construct-related validity for all PROs supported by strong correlations with concurrent measures. Excellent results for known-groups validity and ability to detect change analyses were also demonstrated. Anchor-based and distribution-based methods supported interpretation of scores through generation of group-level and within-individual estimates of meaningful change thresholds. A meaningful within-patient change in the range of 8-15-point improvement on the NAVQ-P total score (score range 0-42) was recommended, including a more specific responder definition of 10-point improvement. CONCLUSIONS The NAVQ-P, NVCI, and NVS are valid and reliable instruments which have the ability to detect change over time. Findings strongly support the use of these measures as outcome assessments in clinical/research studies and in clinical practice in the presbyopia population.
Collapse
Affiliation(s)
- Joel Sims
- Adelphi Values, Patient-Centered Outcomes, Bollington, UK.
| | | | - Sarah Bentley
- Adelphi Values, Patient-Centered Outcomes, Bollington, UK
| | | | - Rob Arbuckle
- Adelphi Values, Patient-Centered Outcomes, Bollington, UK
| | | | - Ben Pascoe
- Adelphi Values, Patient-Centered Outcomes, Bollington, UK
| | - Jan Stochl
- Adelphi Values, Patient-Centered Outcomes, Bollington, UK
- Charles University, Prague, Czechia
| | - Amy Findley
- Adelphi Values, Patient-Centered Outcomes, Bollington, UK
| | | | - James S Wolffsohn
- School of Optometry, College of Health and Life Sciences, Aston University, Birmingham, UK
| |
Collapse
|
4
|
Zeidler C, Stander S, Rhoten S, Wratten S, Zhang D, Msihid J, Brookes E, Thomas R, Bahloul D. Validation of a scoring algorithm for the clinician-reported outcome tool 'prurigo activity and severity (PAS)' based on clinical studies of dupilumab in adults with prurigo Nodularis. J Eur Acad Dermatol Venereol 2024. [PMID: 38563058 DOI: 10.1111/jdv.19961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 02/16/2024] [Indexed: 04/04/2024]
Abstract
BACKGROUND Prurigo nodularis (PN) also known as chronic prurigo, is a chronic inflammatory skin disease characterized by intensely itchy nodules/lesions which occur due to intensive scratching. PN management is, in part, based on clinician evaluations of PN lesions, which can be supported by clinician-reported outcomes (ClinRO) such as the Prurigo Activity and Severity (PAS) instrument. A 5-item version of PAS was included in recent phase-3 dupilumab PN trials (PRIME [NCT04183335]/PRIME2 [NCT04202679]). The PAS score was derived using the unweighted sum of 3-items of the 5-item PAS (range, 0-11; higher score indicates worse activity and severity): Item 2 (number of lesions), Item 5a (percentage of lesions with excoriations/crusts) and Item 5b (percentage of healed lesions) for use in clinical practice and for communication of treatment benefit to physicians. OBJECTIVES To evaluate the measurement properties of PAS score and derive within-patient (responder definition) and between-group improvement thresholds for interpreting changes in PAS score in patients with PN. METHODS The data source was the pooled treatment group, intention-to-treat (ITT) data from the phase-3 PRIME (NCT04183335) and PRIME2 (NCT04202679) studies evaluating the efficacy of dupilumab in adult patients with PN with ≥20 nodules and severe itch uncontrolled with topical therapies. PAS score reliability, validity and sensitivity to change were evaluated, and anchor- and distribution-based methods were applied to derive meaningful change thresholds. RESULTS The pooled ITT population included 311 patients (mean age 49.5 years, 65.3% female). Adequate to good psychometric properties were demonstrated for PAS score. The within-patient meaningful improvement threshold was estimated as 3.0 points (absolute change) and 37% (per cent change). A 1.7-point (absolute change) and 20% (per cent change) improvement were estimated to reflect a between-group meaningful change in PAS score. CONCLUSIONS PAS score is a simple, clinically relevant indicator of PN lesion activity and severity supported by suitable psychometric performance.
Collapse
Affiliation(s)
- Claudia Zeidler
- Center for Chronic Pruritus, University Hospital Münster, Münster, Germany
| | - Sonja Stander
- Center for Chronic Pruritus, University Hospital Münster, Münster, Germany
| | | | | | | | | | | | | | | |
Collapse
|
5
|
Clarke NA, Braverman J, Worthy G, Shaw JW, Bennett B, Dhanda D, Cocks K. A Review of Meaningful Change Thresholds for EORTC QLQ-C30 and FACT-G Within Oncology. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2024; 27:458-468. [PMID: 38191023 DOI: 10.1016/j.jval.2023.12.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 12/11/2023] [Accepted: 12/22/2023] [Indexed: 01/10/2024]
Abstract
OBJECTIVES This literature review provides an overview of meaningful change thresholds for the European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire Core 30 (QLQ-C30) and the Functional Assessment of Cancer Therapy - General (FACT-G) used across hematological cancers and solid tumors (melanoma, lung, bladder, and prostate). METHODS Embase, MEDLINE, and PubMed were searched to identify relevant oncology publications from 2016 to 2021. Label claims from the US Food and Drug Administration and the European Medicines Agency for 7 recently approved drugs (pembrolizumab, atezolizumab, glasdegib, gilteritinib, tisagenlecleucel, axicabtagene ciloleucel, and daratumumab plus hyaluronidase-fihj) were reviewed. RESULTS Publications providing guidance on meaningful change thresholds for the QLQ-C30 displayed a growing trend away from broad "legacy" thresholds of 10 points for all QLQ-C30 scales), toward deriving "contemporary" thresholds (eg, subscale specific, population specific). Contemporary publications generally provide guidance on selecting thresholds for specific scales that account for improved or worsening thresholds (eg, QLQ-C30 subscales). This trend was not clear for FACT-G, with less new guidance available. Most clinical trials used in regulatory label submissions have used thresholds of 10 points for the QLQ-C30 subscales and 3 to 7 points for the FACT-G total score. Despite the availability of more recent guidelines, contemporary meaningful change thresholds seem slow to emerge in the published literature and regulatory labels. CONCLUSIONS Trialists should consider using contemporary thresholds, rather than legacy thresholds, for QLQ-C30 endpoints. Thresholds derived for a similar patient-population should be used where available. Further work is required to provide these across a broader range of cancer sites.
Collapse
Affiliation(s)
- Nathan A Clarke
- Statistics and Programming, Adelphi Values, Bollington, Cheshire, England, UK.
| | - Julia Braverman
- Worldwide Health and Economic Outcomes Research, Bristol Myers Squib, Princeton, NJ, USA
| | - Gill Worthy
- Statistics and Programming, Adelphi Values, Bollington, Cheshire, England, UK
| | - James W Shaw
- Worldwide Health and Economic Outcomes Research, Bristol Myers Squib, Princeton, NJ, USA
| | - Bryan Bennett
- Worldwide Health and Economic Outcomes Research, Bristol Myers Squib, Uxbridge, England, UK
| | - Devender Dhanda
- Worldwide Health and Economic Outcomes Research, Bristol Myers Squib, Princeton, NJ, USA
| | - Kim Cocks
- Statistics and Programming, Adelphi Values, Bollington, Cheshire, England, UK
| |
Collapse
|
6
|
Silverberg JI, Agner T, Baranowski K, Plohberger U, Thoning H, Arbuckle R, Grant L, Skingley G, Bissonnette R. Validation of the Investigator Global Assessment of Chronic Hand Eczema (IGA-CHE): a new clinician reported outcome measure of CHE severity. Arch Dermatol Res 2024; 316:110. [PMID: 38507100 PMCID: PMC10955004 DOI: 10.1007/s00403-024-02818-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 12/20/2023] [Accepted: 01/11/2024] [Indexed: 03/22/2024]
Abstract
The Investigator Global Assessment of Chronic Hand Eczema (IGA-CHE) is a novel Clinician-Reported Outcome measure that allows investigators to assess cross-sectional CHE global disease severity using clinical characteristics of erythema, scaling, lichenification/hyperkeratosis, vesiculation, oedema, and fissures as guidelines for overall severity assessment. This study aimed to evaluate the psychometric properties of the IGA-CHE for use as an outcome measure in CHE clinical trials and clinical practice. Psychometric analyses were performed using data from a sample of 280 patients with moderate to severe CHE from a phase 3 trial of delgocitinib cream, pooled across treatment groups. Test-retest reliability results were moderate to strong with kappa coefficients ranging from 0.63 to 0.76. Correlations with measures assessing related concepts were moderate or strong (range 0.65-0.72) and exceeded a priori hypotheses, providing evidence of convergent validity. Known-groups validity was supported by statistically significant differences between severity groups (< 0.001). Within-group effect sizes were consistently larger for improved groups compared to stable groups, providing evidence of ability to detect change. Anchor-based analyses generated within-subject meaningful change estimates ranging from - 0.8 to - 2.3. A correlation weighted average suggested a single value of - 1.7 in change from baseline. These findings provide evidence the IGA-CHE scale has strong reliability, construct validity, and ability to detect change, supporting its use as an endpoint in CHE clinical trials and clinical practice. Based on the evidence, 2-level changes in IGA-CHE score are considered a conservative meaningful change threshold; however, findings also indicate 1-level change in IGA-CHE scores reflects a clinically meaningful improvement for patients.Clinical trial registration: NCT04871711.
Collapse
Affiliation(s)
- Jonathan I Silverberg
- School of Medicine and Health Sciences, The George Washington University, Washington, DC, USA.
| | - Tove Agner
- Department of Dermatology, Bispebjerg Hospital, University of Copenhagen, Copenhagen, Denmark
| | | | | | | | - Rob Arbuckle
- Patient-Centered Outcomes, Adelphi Values Ltd, Bollington, Cheshire, UK
| | - Laura Grant
- Patient-Centered Outcomes, Adelphi Values Ltd, Bollington, Cheshire, UK
| | - George Skingley
- Patient-Centered Outcomes, Adelphi Values Ltd, Bollington, Cheshire, UK
| | | |
Collapse
|
7
|
Barbosa-Silva J, Calixtre LB, Von Piekartz D, Driusso P, Armijo-Olivo S. The minimal important difference of patient-reported outcome measures related to female urinary incontinence: a systematic review. BMC Med Res Methodol 2024; 24:60. [PMID: 38459428 PMCID: PMC10921720 DOI: 10.1186/s12874-024-02188-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 02/22/2024] [Indexed: 03/10/2024] Open
Abstract
BACKGROUND The minimal important difference is a valuable metric in ascertaining the clinical relevance of a treatment, offering valuable guidance in patient management. There is a lack of available evidence concerning this metric in the context of outcomes related to female urinary incontinence, which might negatively impact clinical decision-making. OBJECTIVES To summarize the minimal important difference of patient-reported outcome measures associated with urinary incontinence, calculated according to both distribution- and anchor-based methods. METHODS This is a systematic review conducted according to the PRISMA guidelines. The search strategy including the main terms for urinary incontinence and minimal important difference were used in five different databases (Medline, Embase, CINAHL, Web of Science, and Scopus) in 09 June 2021 and were updated in January 09, 2024 with no limits for date, language or publication status. Studies that provided minimal important difference (distribution- or anchor-based methods) for patient-reported outcome measures related to female urinary incontinence outcomes were included. The study selection and data extraction were performed independently by two different researchers. Only studies that reported the minimal important difference according to anchor-based methods were assessed by credibility and certainty of the evidence. When possible, absolute minimal important differences were calculated for each study separately according to the mean change of the group of participants that slightly improved. RESULTS Twelve studies were included. Thirteen questionnaires with their respective minimal important differences reported according to distribution (effect size, standard error of measurement, standardized response mean) and anchor-based methods were found. Most of the measures for anchor methods did not consider the smallest difference identified by the participants to calculate the minimal important difference. All reports related to anchor-based methods presented low credibility and very low certainty of the evidence. We pooled 20 different estimates of minimal important differences using data from primary studies, considering different anchors and questionnaires. CONCLUSIONS There is a high variability around the minimal important difference related to patient-reported outcome measures for urinary incontinence outcomes according to the method of analysis, questionnaires, and anchors used, however, the credibility and certainty of the evidence to support these is still limited.
Collapse
Affiliation(s)
- Jordana Barbosa-Silva
- Women's Health Research Laboratory (LAMU), Physical Therapy Department, Federal University of São Carlos, Rodovia Washington Luís, km 235, Monjolinho, São Carlos, SP, 13565-905, Brazil.
- Faculty of Business and Social Sciences, University of Applied Sciences - Hochschule Osnabrück, Osnabrück, Germany.
| | | | - Daniela Von Piekartz
- Faculty of Business and Social Sciences, University of Applied Sciences - Hochschule Osnabrück, Osnabrück, Germany
| | - Patricia Driusso
- Women's Health Research Laboratory (LAMU), Physical Therapy Department, Federal University of São Carlos, Rodovia Washington Luís, km 235, Monjolinho, São Carlos, SP, 13565-905, Brazil
| | - Susan Armijo-Olivo
- Faculty of Business and Social Sciences, University of Applied Sciences - Hochschule Osnabrück, Osnabrück, Germany
- Faculty of Rehabilitation Medicine/Faculty of Medicine and Dentistry, University of Alberta, Edmonton, Canada
| |
Collapse
|
8
|
Molin S, Larsen LS, Joensson P, Oesterdal ML, Arbuckle R, Grant L, Skingley G, Schuttelaar MLA. Development and Psychometric Validation of a Patient-Reported Outcome Measure to Assess the Signs and Symptoms of Chronic Hand Eczema: The Hand Eczema Symptom Diary (HESD). Dermatol Ther (Heidelb) 2024; 14:643-669. [PMID: 38485862 PMCID: PMC10965865 DOI: 10.1007/s13555-024-01114-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 02/09/2024] [Indexed: 03/27/2024] Open
Abstract
INTRODUCTION Chronic Hand Eczema (CHE) is an inflammatory skin disease of the hands. The Hand Eczema Symptom Diary (HESD) is a new patient-reported outcome measure of worst severity of core CHE signs/symptoms. This study aimed to evaluate content and psychometric validity of the HESD. METHODS The HESD was developed based on the literature and concept elicitation interviews. Qualitative cognitive debriefing interviews were conducted with CHE patients to assess relevance and understanding of items, response options and recall period. Psychometric properties of the HESD (item performance, dimensionality, reliability, validity, responsiveness and estimation of meaningful change thresholds) were then assessed, first using data from a phase 2b trial (NCT03683719), and confirmed using data from the first 280 participants completing the 16-week treatment phase of a phase 3 trial (NCT04871711). RESULTS Cognitive debriefing supported item refinement and removal of items and confirmed all items were well understood and relevant to patients. Item properties and dimensionality analyses in the phase 2b data supported removal of additional items, resulting in the 6-item HESD included in the phase 3 trial. Unidimensionality was supported by inter-item correlations (all > 0.70) and Rasch analysis. Internal consistency (Cronbach's alpha = 0.96) and test-retest reliability (Intraclass Correlation Coefficient > 0.89) results were very strong. Construct validity was supported by moderate correlations with concurrent measures (0.53-0.64) and significant differences between severity groups (p < 0.001). Large effect sizes for mean change scores in participants that improved and significant differences between change groups indicated the ability to detect change. Anchor-based analyses supported within-individual responder definitions of ≥ 4-points for improvements in 7-day average HESD scores. CONCLUSION The HESD is the first CHE-specific, patient-reported outcome measure of CHE signs/symptoms developed and validated in line with regulatory guidance. This article provides evidence of strong content validity and psychometric validity and shows improvements of ≥ 4 points on 7-day average HESD scores represent clinically meaningful, important changes. TRIAL REGISTRATION NCT03683719, NCT04871711.
Collapse
Affiliation(s)
- Sonja Molin
- Division of Dermatology, Queen's University, Kingston, ON, Canada.
| | | | | | | | - Rob Arbuckle
- Adelphi Values Ltd, Patient-Centered Outcomes, Bollington, Cheshire, UK
| | - Laura Grant
- Adelphi Values Ltd, Patient-Centered Outcomes, Bollington, Cheshire, UK
| | - George Skingley
- Adelphi Values Ltd, Patient-Centered Outcomes, Bollington, Cheshire, UK
| | | |
Collapse
|
9
|
Wells JR, Hillier A, Holland R, Mwacalimba K, Noli C, Panter C, Tatlock S, Wright A. Development and validation of a questionnaire to assess owner and canine quality-of-life and treatment satisfaction in canine allergic dermatitis. Vet Dermatol 2024. [PMID: 38361109 DOI: 10.1111/vde.13242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 12/06/2023] [Accepted: 01/28/2024] [Indexed: 02/17/2024]
Abstract
BACKGROUND Animal and owner quality-of-life (QoL) is pivotal in treatment decisions. Accurate measurement of owner-reported QoL and treatment satisfaction (TS) supports disease burden and treatment benefit evaluation. OBJECTIVES Develop and evaluate an owner-completed canine dermatitis QoL and TS questionnaire (CDQoL-TSQ) in allergic dogs. MATERIALS AND METHODS The CDQoL-TSQ was drafted following review of existing measures and expert input. Content validity was assessed through interviews with owners of allergic dogs. Psychometric properties of the QoL domains (Canine QoL, Owner QoL) were evaluated. Score interpretation was derived. RESULTS Twenty dog owners were interviewed. Item wording was amended following the first 10 interviews. Data from 211 owners were used in the psychometric evaluation. The Canine QoL domain demonstrated strong internal consistency (α = 0.89), test-retest reliability (ICC2,1 = 0.844), moderate convergent validity (r = 0.41) and moderate-high known-groups validity (effect size 0.37-0.64). The Owner QoL domain demonstrated strong internal consistency (α = 0.73), high convergent validity (r = 0.63) and moderate-high known-groups validity (0.43-0.63). Test-retest reliability approached moderate strength (ICC2,1 = 0.490). Group-level interpretation analysis showed minimal important difference of 7.0-13.6 points for dogs and 13.0-13.6 for owners. For individual dogs a change of 6.3 or 12.5 points for dogs, and 12.5 or 18.8 for owners indicates a response. CONCLUSIONS AND CLINICAL RELEVANCE The CDQOL-TSQ is a two-part assessment to evaluate QoL and TS in canine allergic dermatitis. The QoL questionnaire demonstrated validity and reliability, and interpretation of scores was derived, making it suitable for use in research and practice. The TS module is suitable for clinical setting use to improve owner-veterinarian communication.
Collapse
Affiliation(s)
- J R Wells
- Patient-Centered Outcomes, Adelphi Values Ltd, Bollington, UK
- Clinical Outcomes Assessment Department, Sanofi, UK
| | | | | | | | - C Noli
- Servizi Dermatologici Veterinari, Peveragno, Italy
| | - C Panter
- Patient-Centered Outcomes, Adelphi Values Ltd, Bollington, UK
| | - S Tatlock
- Patient-Centered Outcomes, Adelphi Values Ltd, Bollington, UK
| | - A Wright
- Zoetis, Parsippany, New Jersey, USA
| |
Collapse
|
10
|
Mitchell SA, Hunter R, Fry A, Pavletic SZ, Widemann BC, Wiener L. Development and psychometric testing of a pediatric chronic graft-versus-host disease symptom scale: protocol for a two-phase, mixed methods study. Front Psychol 2024; 14:1243005. [PMID: 38259542 PMCID: PMC10800914 DOI: 10.3389/fpsyg.2023.1243005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 11/23/2023] [Indexed: 01/24/2024] Open
Abstract
Background Chronic graft-versus-host disease (cGVHD) is a debilitating late complication of hematopoietic stem cell transplantation. It is often accompanied by extensive symptom burden. No validated cGVHD patient-reported outcome (PRO) measure exists to evaluate cGVHD symptom bother in children and adolescents younger than 18 years. This paper presents the study protocol for a multi-center, two-phase protocol to develop a psychometrically valid pediatric cGVHD Symptom Scale (PCSS) and a companion caregiver-proxy measure to capture the symptom burden experienced by children with cGVHD. In the first phase of the study, our aim is to evaluate the comprehension, clarity and ease of response of the PCSS through cognitive interviewing and to iteratively refine the measure to optimize content validity. In the second phase of the study, we will quantitatively examine the measurement properties of the PCSS in children and their caregiver-proxies. Methods and analysis Eligible participants are children/adolescents ages 5-17 with cGVHD who are receiving systemic immunosuppressive treatment or have recently tapered to discontinuation. In the first phase, we are enrolling 60 child and caregiver-proxy dyads in three child age strata (5-7, 8-12, and 13-17 years old). Semi-scripted cognitive debriefing interviews are conducted to assess comprehension, clarity, and ease of response of each PCSS item with the child alone, and then jointly with the caregiver-proxy to explore discordant ratings. In phase two, an age-stratified cohort of 120 child-caregiver dyads will be enrolled to evaluate test-retest reliability, construct validity, and responsiveness. Anchors for known-groups validity include the PedsQL module and clinical variables, including cGVHD clinician-rated severity scores. In participants ages 13-17, we will also compare responses on the PCSS with those from the Lee cGVHD Symptom Scale, to gauge the youngest age at which adolescent respondents can comprehend this adult measure. Discussion This study will yield a well-validated, counterpart measure to the Lee cGVHD Symptom Scale for use in children with cGVHD and their caregiver-proxies. This new patient-reported outcome measure can be integrated into clinical trials and care delivery for pediatric transplant survivors to improve the precision and accuracy with which their cGVHD symptom experience is captured. Clinical trial registration www.ClinicalTrials.gov, NCT04044365.
Collapse
Affiliation(s)
- Sandra A. Mitchell
- Outcomes Research Branch, Healthcare Delivery Research Program, National Cancer Institute, Bethesda, MD, United States
| | - Rachael Hunter
- Pediatric Oncology Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States
| | - Abigail Fry
- Pediatric Oncology Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States
| | - Steven Z. Pavletic
- Immune Deficiency Cellular Therapy Program, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States
| | - Brigitte C. Widemann
- Pediatric Oncology Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States
| | - Lori Wiener
- Pediatric Oncology Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD, United States
| |
Collapse
|
11
|
Swigris JJ, Danoff S, Dellaripa PF, Doyle TJ, Solomon JJ. Reliability and responsiveness of the D12 and validity of its scores as a measure of dyspnoea severity in patients with rheumatoid arthritis-related interstitial lung disease. BMJ Open Respir Res 2023; 10:e001872. [PMID: 37748807 PMCID: PMC10533789 DOI: 10.1136/bmjresp-2023-001872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Accepted: 09/01/2023] [Indexed: 09/27/2023] Open
Abstract
BACKGROUND Interstitial lung disease due to rheumatoid arthritis (RA-ILD) affects a substantial minority of patients with RA, inducing life-altering symptoms, impairing quality of life (QOL) and forcing patients to confront the potential for shortened survival. Dyspnoea is the predominant respiratory symptom of RA-ILD and a strong driver of QOL impairment in patients with it. The D12 is a 12-item questionnaire that assesses the physical and affective components of dyspnoea. It was one of a battery of patient-reported outcomes used in the double-blind, placebo-controlled TRAIL 1 trial of pirfenidone for RA-ILD. There is little information on the reliability, validity or responsiveness of the D12 in RA-ILD. METHODS In accordance with COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) methodology, we conducted analyses on data from the TRAIL 1 trial to assess the measurement properties of the D12. RESULTS Internal consistency (α=0.95, 0.95, 0.95, 0.95 and 0.96 at baseline, 13, 26, 39 and 52 weeks) and test-retest reliability 0.85 (0.71 to 0.92) exceeded acceptability criteria. Well over the 75% benchmark of hypotheses (43/46=93%) around D12 measurement properties were confirmed. Known-groups validity was supported by significant differences between subgroups of patients with differing levels of dyspnoea (eg, St. George's Respiratory Questionnaire (SGRQ) Activity score ≥50 vs <50, 9.36 (1.27) points, p<0.0001, with a large effect size=1.7) and physiological impairment at baseline. Longitudinal validity was supported by significant associations between D12 and anchor scores over time (eg, at 52 weeks, correlation between D12 change and SGRQ Activity change was 0.54, p<0.0001; between D12 change and Routine Assessment of Patient Index Data (RAPID) Functioning Component was 0.41, p<0.0001). A battery of analyses confirmed the responsiveness of D12 scores for capturing change in dyspnoea over time. We estimated the minimal within-patient change threshold for worsening as 3 points. CONCLUSIONS D12 scores possess acceptable measurement properties in RA-ILD, such that it can be used with confidence in this population to assess dyspnoea severity defined by its physical and affective components. As validation is an ongoing process, and never accomplished in a single study, additional research on the psychometric properties of the D12 in RA-ILD is encouraged.
Collapse
Affiliation(s)
- Jeffrey J Swigris
- National Jewish Health Center for Interstitial Lung Disease, Denver, Colorado, USA
| | - Sonye Danoff
- Division of Pulmonary Critical Care Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - Paul F Dellaripa
- Division of Rheumatology, Immunology and Allergy, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Tracy J Doyle
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Joshua J Solomon
- National Jewish Health Center for Interstitial Lung Disease, Denver, Colorado, USA
| |
Collapse
|
12
|
Mc Carthy M, Burrows K, Griffiths P, Black PM, Demanuele C, Karlsson N, Buenconsejo J, Patel N, Chen WH, Cappelleri JC. From Meaningful Outcomes to Meaningful Change Thresholds: A Path to Progress for Establishing Digital Endpoints. Ther Innov Regul Sci 2023; 57:629-645. [PMID: 37020160 DOI: 10.1007/s43441-023-00502-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 02/24/2023] [Indexed: 04/07/2023]
Abstract
This paper examines the use of digital endpoints (DEs) derived from digital health technologies (DHTs), focusing primarily on the specific considerations regarding the determination of meaningful change thresholds (MCT). Using DHTs in drug development is becoming more commonplace. There is general acceptance of the value of DHTs supporting patient-centric trial design, capturing data outside the traditional clinical trial setting, and generating DEs with the potential to be more sensitive to change than conventional assessments. However, the transition from exploratory endpoints to primary and secondary endpoints capable of supporting labeling claims requires these endpoints to be substantive with reproducible population-specific values. Meaningful change represents the amount of change in an endpoint measure perceived as important to patients and should be determined for each digital endpoint and given population under consideration. This paper examines existing approaches to determine meaningful change thresholds and explores examples of these methodologies and their use as part of DE development: emphasizing the importance of determining what aspects of health are important to patients and ensuring the DE captures these concepts of interest and aligns with the overarching endpoint strategy. Examples are drawn from published DE qualification documentation and responses to qualification submissions under review by the various regulatory authorities. It is the hope that these insights will inform and strengthen the development and validation of DEs as drug development tools, particularly for those new to the approaches to determine MCTs.
Collapse
|
13
|
Nercessian TR, Banbury S, Chandler C. A Systematic Review Looking at Anodyspareunia Among Cisgender Men and Women. JOURNAL OF SEX & MARITAL THERAPY 2023; 49:829-841. [PMID: 37089031 DOI: 10.1080/0092623x.2023.2196265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
This systematic review aims to provide an up-to-date evaluation of the available literature on anodyspareunia and treatment interventions. It aims to identify research gaps and to evaluate treatment methods in psychosexual healthcare services. Electronic searches including PubMed, PsycINFO, Web of Science, and registered clinical trials yielded 8 studies. Research centered on men who have sex with men (MSM) and cisgender women with a 3:1 ratio. In terms of causes of anodyspareunia, correlates among both MSM and cisgender women included generalized anxiety, performance anxiety, compromised well-being, lack of stimulation, lack of lubrication, and lack of sexual arousal. While research has focused on anodyspareunia among cisgender gay men, limited research has targeted other genders or sexualities including heterosexual men and lesbian and bisexual women. No research was found on intersex, transgender, and gender nonconforming people. Similarly, there were no articles that discussed or evaluated treatment strategies. The findings of this review demonstrate the paucity of research on both the condition of anodyspareunia and its treatment across genders and sexually diverse groups. Both quantitative and qualitative research would not only develop our understanding of anodyspareunia but also contribute to public health through the development of suitable treatment interventions.
Collapse
Affiliation(s)
| | - Samantha Banbury
- Department of Psychology, School of Social Sciences and Professions, London Metropolitan University, London, UK
| | | |
Collapse
|
14
|
Trigg A, Lenderking WR, Boehnke JR. Introduction to the special section: "Methodologies and considerations for meaningful change". Qual Life Res 2023; 32:1223-1230. [PMID: 37027088 DOI: 10.1007/s11136-023-03413-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2023]
Affiliation(s)
- Andrew Trigg
- Medical Affairs Statistics, Bayer plc, Reading, UK
| | | | - Jan R Boehnke
- School of Health Sciences, University of Dundee, 11 Airlie Place, Dundee, DD1 4HJ, UK.
| |
Collapse
|
15
|
Wratten S, Abetz-Webb L, Arenson E, Griffiths P, Bowman S, Hueber W, Ndife B, Kuessner D, Goswami P. Development and testing of an alternative responder definition for EULAR Sjögren's Syndrome Patient Reported Index (ESSPRI). RMD Open 2023; 9:rmdopen-2022-002721. [PMID: 36931685 PMCID: PMC10030922 DOI: 10.1136/rmdopen-2022-002721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 02/27/2023] [Indexed: 03/19/2023] Open
Abstract
OBJECTIVES Dryness, fatigue and joint/muscle pain are typically assessed in Sjögren's trials using European Alliance of Associations for Rheumatology Sjögren's Syndrome Patient Reported Index (ESSPRI). A Patient Acceptable Symptom State of <5 and a Minimal Clinically Important Improvement (MCII)/responder definition (RD) of ≥1 point or 15% on ESSPRI have previously been defined. This study explored alternative RDs to better discriminate between active treatment and placebo in trials. METHODS Anchor-based and distribution-based methods were used to derive RD thresholds in blinded phase IIb trial data (N=190) and confirm these in blinded data pooled from three early phase II trials (N=126). The populations consisted of individuals with moderate-to-severe systemic primary Sjögren's. Anchors were prioritised by ESSPRI correlations and used in similar conditions. Triangulated estimates were discussed with experts (N=3). The revised RD was compared with the original using unblinded data to assess placebo and treatment responder rates. RESULTS Patients were predominantly female (>90%), white (90%), with mean age of 50 years. Receiver operating characteristic estimates supported an MCII threshold of 1.5-1.6 in the phase II data, whereas correlation-weighted mean change estimates supported a low/minimal symptom severity threshold of ≥2. A low/minimal symptom severity of ≤3 showed the greatest sensitivity/specificity balance. Analyses in the pooled data supported these thresholds (MCII: 1.5-2.1; low/minimal symptom severity: 2.7-3.7). Unblinded analyses confirmed the revised RD reduced placebo rates. CONCLUSIONS Completing a trial with an improvement of ≥1.5 points compared with baseline and an ESSPRI score of ≤3 points is a relevant RD for moderate-to-severe systemic Sjögren's and reduces placebo rates.
Collapse
Affiliation(s)
| | | | - Ethan Arenson
- Patient-Centered Outcomes, Adelphi Values, Bollington, UK
| | - Pip Griffiths
- Patient-Centered Outcomes, Adelphi Values, Bollington, UK
| | - Simon Bowman
- Rheumatology, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Wolfgang Hueber
- Department of Immunology, Novartis Pharma, Basel, Switzerland
| | - Briana Ndife
- Department of Oncology, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA
| | - Daniel Kuessner
- Department of Immunology, Novartis Pharma, Basel, Switzerland
| | | |
Collapse
|
16
|
Transitions in depression: if, how, and when depressive symptoms return during and after discontinuing antidepressants. Qual Life Res 2022; 32:1295-1306. [PMID: 36418524 PMCID: PMC10123048 DOI: 10.1007/s11136-022-03301-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/12/2022] [Indexed: 11/25/2022]
Abstract
Abstract
Purpose
The aim of the current study is to provide insight into if, how, and when meaningful changes occur in individual patients who discontinue antidepressant medication. Agreement between macro-level quantitative symptom data, qualitative ratings, and micro-level Ecological Momentary Assessments is examined.
Methods
During and shortly after antidepressant discontinuation, depressive symptoms and ‘feeling down’ were measured in 56 participants, using the SCL-90 depression subscale weekly (macro-level) for 6 months, and 5 Ecological Momentary Assessments daily (micro-level) for 4 months (30.404 quantitative measurements in total). Qualitative information was also obtained, providing additional information to verify that changes were clinically meaningful.
Results
At the macro-level, an increase in depressive symptoms was found in 58.9% of participants that (a) was statistically reliable, (b) persisted for 3 weeks and/or required intervention, and (c) was clinically meaningful to patients. Of these increases, 30.3% happened suddenly, 42.4% gradually, and for 27.3% criteria were inconclusive. Quantitative and qualitative criteria showed a very high agreement (Cohen’s κ = 0.85) regarding if a participant experienced a recurrence of depression, but a moderate agreement (Cohen’s κ = 0.49) regarding how that change occurred. At the micro-level, 41.1% of participants experienced only sudden increases in depressed mood, 12.5% only gradual, 30.4% experienced both types of increase, and 16.1% neither.
Conclusion
Meaningful change is common in patients discontinuing antidepressants, and there is substantial heterogeneity in how and when these changes occur. Depressive symptom change at the macro-level is not the same as depressive symptom change at the micro-level.
Collapse
|