1
|
Development and validation of a novel patient-reported outcome measure in people with episodic migraine and chronic migraine: The Activity Impairment in Migraine Diary. Headache 2021; 62:89-105. [PMID: 34962305 PMCID: PMC9306594 DOI: 10.1111/head.14229] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 09/04/2021] [Accepted: 09/13/2021] [Indexed: 12/14/2022]
Abstract
Objective To evaluate the content validity and psychometric properties of the Activity Impairment in Migraine Diary (AIM‐D). Background Measuring treatment effects on migraine impairment requires a psychometrically sound patient‐reported outcome (PRO) measure developed consistent with U.S. Food and Drug Administration guidance. Methods The AIM‐D was created from concepts that emerged during qualitative interviews with five clinicians experienced in treating migraine and concept elicitation (CE) interviews with 40 adults with episodic migraine (EM) or chronic migraine (CM). The initial version was refined based on three waves of cognitive interviews with 38 adults with EM or CM and input from a panel of clinical and measurement experts. The AIM‐D was psychometrically evaluated using data from 316 adults with EM or CM who participated in a 13‐week prospective observational study. Study participants completed PRO assessments including the AIM‐D and a daily headache diary. Exploratory and confirmatory factor analysis were used to determine the factor structure. The reliability, validity, and responsiveness of the AIM‐D were assessed. Additional PRO measures including the Patient Global Impression – Severity (PGI‐S), Migraine Specific Quality of Life Questionnaire, Version 2.1 Role Function‐Restrictive domain, and Headache Impact Test were used for psychometric evaluation of the AIM‐D. Results Based on CE interviews with adults with migraine and input from an expert panel, activity impairment was identified as the target in the preliminary conceptual framework, which had two domains: performance of daily activities (PDAs) and physical impairment (PI). Revision of the draft AIM‐D through multiple rounds of cognitive interviews and expert panel meetings resulted in a content valid 11‐item version. Exploratory factor analysis supported both one‐ and two‐domain structures for the AIM‐D, which were further supported by confirmatory factor analysis (factor loadings all >0.90). The AIM‐D domains (PDA and PI) and total score showed high internal consistency reliability (Cronbach's alpha 0.95–0.97), acceptable test–retest reliability for weekly average scores (intraclass correlation coefficient >0.60 for participants with no change in PGI‐S between baseline and week 2), and good convergent and known‐groups validity. There was evidence of responsiveness based on changes in PGI‐S score and monthly migraine days. Conclusion The AIM‐D is a content valid and psychometrically sound measure designed to evaluate activity impairment and is suitable for use in clinical trials of preventive treatments for EM or CM.
Collapse
|
2
|
Clinician- and Patient-reported Endpoints in CNS Orphan Drug Clinical Trials: ISCTM Position Paper on Best Practices for Endpoint Selection, Validation, Training, and Standardization. INNOVATIONS IN CLINICAL NEUROSCIENCE 2021; 18:15-22. [PMID: 35096477 PMCID: PMC8794479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
OBJECTIVE The International Society of CNS Clinical Trials Methodology (ISCTM) Working Group on Rare Disease/Orphan Drug Development is dedicated to improving and streamlining trials to best develop new treatments for rare diseases. The rarity of these disorders requires a drug development strategy that differs from those of nonrare conditions. Rare disease drug development programs are challenged with small sample sizes, heterogeneous clinical presentations, and few, if any, off-the-shelf endpoints. When disease-specific clinical endpoints exist, they might not be validated and are typically not well known or broadly used in clinical practice. This paper aims to provide an overview of the special issues surrounding endpoints in rare disease drug development, with guidance, practical applications, and discussion. DISCUSSION The paper covers regulatory considerations in endpoint selection; identification of relevant measurement domains; methods of quantifying clinical meaningfulness; incorporation of patient- and clinician-reported outcomes; considerations for global clinician- and patient-rated clinical assessments; cognition assessment challenges in rare diseases; translation considerations; training, standardization, and calibration of assessors; and endpoint quality assurance. Additionally, it provides guidance and resources for those involved in drug development for rare diseases. CONCLUSION In keeping with the mission of ISCTM and the rare disease/orphan drug development working group, this article is designed to encourage thoughtful consideration and provide insight and guidance to promote and further efforts in in central nervous system (CNS) rare disease drug development efforts.
Collapse
|
3
|
SPIRIT-PRO Extension explanation and elaboration: guidelines for inclusion of patient-reported outcomes in protocols of clinical trials. BMJ Open 2021; 11:e045105. [PMID: 34193486 PMCID: PMC8246371 DOI: 10.1136/bmjopen-2020-045105] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 12/21/2020] [Accepted: 01/08/2021] [Indexed: 01/21/2023] Open
Abstract
Patient-reported outcomes (PROs) are used in clinical trials to provide valuable evidence on the impact of disease and treatment on patients' symptoms, function and quality of life. High-quality PRO data from trials can inform shared decision-making, regulatory and economic analyses and health policy. Recent evidence suggests the PRO content of past trial protocols was often incomplete or unclear, leading to research waste. To address this issue, international, consensus-based, PRO-specific guidelines were developed: the Standard Protocol Items: Recommendations for Interventional Trials (SPIRIT)-PRO Extension. The SPIRIT-PRO Extension is a 16-item checklist which aims to improve the content and quality of aspects of clinical trial protocols relating to PRO data collection to minimise research waste, and ultimately better inform patient-centred care. This SPIRIT-PRO explanation and elaboration (E&E) paper provides information to promote understanding and facilitate uptake of the recommended checklist items, including a comprehensive protocol template. For each SPIRIT-PRO item, we provide a detailed description, one or more examples from existing trial protocols and supporting empirical evidence of the item's importance. We recommend this paper and protocol template be used alongside the SPIRIT 2013 and SPIRIT-PRO Extension paper to optimise the transparent development and review of trial protocols with PROs.
Collapse
|
4
|
Validation of Psychometric Properties of the Itch Numeric Rating Scale for Pruritus Associated With Prurigo Nodularis: A Secondary Analysis of a Randomized Clinical Trial. JAMA Dermatol 2021; 156:1354-1358. [PMID: 32936233 DOI: 10.1001/jamadermatol.2020.3071] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Importance There is an unmet need for psychometrically sound instruments to measure pruritus associated with prurigo nodularis (PN). Objective To evaluate the psychometric properties of the itch numeric rating scale (itch NRS), both the Worst Itch Numeric Rating Scale (WI-NRS) and the Average Itch Numeric Rating Scale (AI-NRS). Design, Setting, and Participants This secondary analysis is based on a secondary end point of a phase 2 randomized clinical trial of serlopitant for treatment of pruritus associated with PN. This randomized, double-blind, placebo-controlled study was conducted at 15 sites in Germany. Eligible patients were aged 18 to 80 years and had generalized PN for more than 6 weeks that was refractory to previous antipruritic therapies. Patients were required to have a visual analog scale itch score of 7 or higher at screening. Data were collected from July 2014 to June 2016 and analyzed from June 2016 to January 2017. Main Outcomes and Measures The itch NRS (AI-NRS and WI-NRS) was correlated together with the following measures: the electronic verbal rating scale (eVRS) for itch self-categorization, average itch visual analog scale (AI-VAS), worst itch visual analog scale (WI-VAS), the pruritus-specific quality-of-life rating instrument ItchyQoL, Dermatology Life Quality Index (DLQI), and Prurigo Activity and Severity Score (items 7b and 7a: percentage healed prurigo lesions and percentage of prurigo lesions with excoriations). Results There were 123 participants in this study; the mean (SD) age of participants was 57.3 (11.58) years, and 58 (47.2%) were male. Strong associations (r ≥ 0.5) were observed between itch NRS items (WI-NRS and AI-NRS) and AI-VAS (24 hours) at weeks 2, 4, and 8 (r = 0.72-0.90; P < .001). Similar strong associations were also observed between itch NRS items and WI-VAS (24 hours) and eVRS for itch severity across weeks 2, 4, and 8 (r = 0.65-0.92; all P < .001). Strong correlations were seen between change scores for WI-NRS and WI-VAS and AI-VAS (r = 0.76 and 0.70, respectively; both P < .001). Similar findings were seen for AI-NRS, where correlations between change scores for WI-VAS and AI-VAS were 0.71 and 0.72, respectively (both P < .001). Analyses for the itch NRS items also showed that test-retest reliability was acceptable and provided evidence of acceptable convergent validity based on the eVRS and visit verbal rating score for itch self-categorization, ItchyQoL, and DLQI. Conclusions and Relevance Results from this secondary analysis show that the itch NRS items WI-NRS and AI-NRS have good psychometric properties for pruritus associated with PN and should be considered acceptable tools for assessing pruritus in future clinical trials of PN. Trial Registration ClinicalTrials.gov Identifier: NCT02196324.
Collapse
|
5
|
The Patient Experience of Premenopausal Women Treated with Bremelanotide for Hypoactive Sexual Desire Disorder: RECONNECT Exit Study Results. J Womens Health (Larchmt) 2021; 30:587-595. [PMID: 33538638 DOI: 10.1089/jwh.2020.8460] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Background: Hypoactive sexual desire disorder (HSDD) has a significant negative impact on women's overall health and relationships with their partners. Primary analyses from the RECONNECT clinical trials demonstrated statistically significant and clinically meaningful improvements in sexual desire and related distress with bremelanotide relative to placebo in premenopausal women with HSDD. Exit surveys and patient interviews were conducted to evaluate the impact of HSDD and bremelanotide treatment from the patient's perspective. Materials and Methods: Upon completion of the double-blind study but before participation in the open-label extension, up to 250 participants were recruited to complete the quantitative exit survey (17 questions). A subset of up to 90 patients was invited to participate in the telephone interview (17 questions). Patients who volunteered to participate remained blinded to study drug until the survey and interviews were completed. Results: Quantitative exit surveys were completed by 242 RECONNECT participants; 80 of these women also completed qualitative telephone exit interviews. Participants who received bremelanotide described increased feelings of sexual desire, physical arousal, and improvements in overall quality of their sexual activities in their partner relationship. In comparison, women taking placebo reported benefits that did not include the physiological responses described by women receiving bremelanotide, such as positive experiences of seeking HSDD treatment and improved communication with their partner. Conclusions: Exit surveys and patient interviews support the primary findings from RECONNECT and provide quantitative and qualitative assessments of the impact of HSDD on patients' quality of life and the patients' perspectives on the impact of bremelanotide. Clinical trial numbers NCT02333071, NCT02338960.
Collapse
|
6
|
Suicide Ideation and Behavior Assessment Tool (SIBAT): Evaluation of Intra- and Inter-Rater Reliability, Validity, and Mapping to Columbia Classification Algorithm of Suicide Assessment. Psychiatry Res 2020; 294:113495. [PMID: 33068913 DOI: 10.1016/j.psychres.2020.113495] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Accepted: 09/30/2020] [Indexed: 12/13/2022]
Abstract
The psychometric properties of the Suicide Ideation and Behavior Assessment Tool (SIBAT) were evaluated in 130 participants with varying levels of suicidality. Inter- and intra-rater reliability were assessed for clinician-rated outcomes, including the revised Clinical Global Impressions (CGI) of severity of suicidality (CGI-SS-r). Concurrent validity of patient-reported modules with Patient-reported Outcomes Measurement Information System (PROMIS) depression scale and Sheehan-Suicidality Tracking Scale Clinically Meaningful Change Measure (S-STS CMCM), and concordance between Columbia Classification Algorithm of Suicide Assessment (C-CASA) mappings for SIBAT, S-STS CMCM and Columbia-Suicide Severity Rating Scale (C-SSRS) were assessed. 52/130 participants (mean [SD] age: 38.3 [17.77] years) consented for multiple interviews (C-CASA mappings: n=52; rater-reliability: n=25/52). SIBAT demonstrated good intra-rater reliability (weighted-kappa range:0.64-0.76; CGI-SS-r, 0.75) and adequate inter-rater reliability (ICC range:0.68-0.82; CGI-SS-r, 0.81). There were strong correlations between PROMIS depression scores and SIBAT Module 5 ratings (Spearman correlations, r=0.64-0.74) and moderate correlations (r=0.29-0.72) between S-STS CMCM and SIBAT Modules 2, 3 and 5 ratings. Moderate agreement was noted between SIBAT C-CASA mappings and corresponding mappings from S-STS CMCM (weighted kappa: 0.54) and C-SSRS (weighted kappa: 0.56). Thus, the SIBAT provided valid assessment of suicidal ideation and behavior that could be reliably rated and adequately mapped to the C-CASA.
Collapse
|
7
|
Patient-Reported Outcomes and Economic Burden of Adults with Sickle Cell Disease in the United States: A Systematic Review. Int J Gen Med 2020; 13:361-377. [PMID: 32753936 PMCID: PMC7354084 DOI: 10.2147/ijgm.s257340] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 06/17/2020] [Indexed: 12/12/2022] Open
Abstract
Purpose To systematically estimate the patient-reported outcomes (PROs) and economic burden of sickle cell disease (SCD) among adults in the United States (US). Patients and Methods Two systematic literature reviews (SLRs), one each for the PROs and economic topics, were performed using MEDLINE and Embase to identify observational studies of adults with SCD. Included studies were published between 2007 and 2018 and evaluated health-related quality of life (HRQL), function, healthcare resource utilization (HCRU), or costs. Given the high degree of clinical and methodological heterogeneity, findings were summarized qualitatively. Results The SLRs identified 7 studies evaluating the PROs and 15 studies evaluating the economic burden meeting the pre-specified selection criteria. The PRO evidence showed the prevalence of depression and anxiety to be 21–33% and 7–36%, respectively, in adults with SCD. The mean SF-36 physical summary scores ranged from 33.6 to 59.0 and from 46.3 to 61.5 for the mental summary scores. Overall HRQL for adults with SCD was poor and significantly worse in those with opioid use. Adult SCD patients were found to have varying rates of emergency department (ED) utilization (0.3–3.5 annual ED visits), hospitalizations (0.5–27.9 per patient per year), and/or readmission (12–41%). Key factors associated with significant HCRU were age, dental infection, and SCD-related complications. SCD specialized care settings and SCD intensive management strategy were reported to significantly decrease the number of hospitalizations. Conclusion This systematic evidence synthesis found that disease burden measured by PROs and economic burden of SCD on adults in the US are substantial despite the availability of approved SCD treatments during 2007–2018. The use of hydroxyurea, optimal management with opioids, and employing intensive treatment strategies may help decrease the overall burden to patients and healthcare systems. Published data on costs associated with SCD are limited and highlight the need for more economic studies to characterize the full burden of the disease.
Collapse
|
8
|
011 Correlation Between Validated Instruments Used in the RECONNECT Studies. J Sex Med 2020. [DOI: 10.1016/j.jsxm.2020.04.247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
9
|
012 Bremelanotide Treatment Provided Clinically Meaningful Benefits in Premenopausal Women With Hypoactive Sexual Desire Disorder. J Sex Med 2020. [DOI: 10.1016/j.jsxm.2020.04.248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
10
|
P02.317 Olanzapine versus risperidone: A prospective comparison of clinical and economic outcomes in schizophrenia. Eur Psychiatry 2020. [DOI: 10.1016/s0924-9338(00)94724-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
11
|
U.K. utility weights for the EORTC QLU-C10D. HEALTH ECONOMICS 2019; 28:1385-1401. [PMID: 31482619 DOI: 10.1002/hec.3950] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Revised: 05/06/2019] [Accepted: 06/22/2019] [Indexed: 05/13/2023]
Abstract
The EORTC QLU-C10D is a new multi-attribute utility instrument derived from the widely used cancer-specific quality of life questionnaire, EORTC QLQ-C30. It contains 10 dimensions (physical functioning, role functioning, social functioning, emotional functioning, pain, fatigue, sleep, appetite, nausea, bowel problems), each with four levels. The aim of this study was to provide U.K. general population utility weights for the QLU-C10D. A U.K. online panel was quota-sampled to align the sample to the general population proportions of sex and age (≥18 years). The online valuation survey included a discrete choice experiment (DCE). Each participant was asked to complete 16 choice-pairs, each comprising two QLU-C10D health states plus duration. DCE data were analysed using conditional logistic regression to generate utility weights. Data from 2,187 respondents who completed at least one choice set were included in the DCE analysis. The final U.K. QLU-C10D utility weights comprised decrements for each level of each health dimension. For nine of the 10 dimensions (all except appetite), the expected monotonic pattern was observed across levels: Utility decreased as severity increased. For the final model, consistent monotonicity was achieved by merging inconsistent adjacent levels for appetite. The largest utility decrements were associated with physical functioning and pain. The worst possible health state (the worst level of each dimension) is -0.083, which is considered slightly worse than being dead. The U.K.-specific utility weights will enable cost-utility analysis (CUA) for the economic evaluation of new oncology therapies and technologies in the United Kingdom, where CUA is commonly used to inform resource allocation.
Collapse
|
12
|
Measurement properties of the minimal disease activity criteria for psoriatic arthritis. RMD Open 2019; 5:e001002. [PMID: 31565243 PMCID: PMC6744081 DOI: 10.1136/rmdopen-2019-001002] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Revised: 07/24/2019] [Accepted: 08/11/2019] [Indexed: 11/19/2022] Open
Abstract
Objective To comprehensively assess evidence on the measurement properties of the minimal disease activity (MDA) criteria, a composite measure of the state of disease activity in psoriatic arthritis (PsA). Methods A targeted literature review was conducted to identify studies that informed the validity and/or ability of the MDA to detect change among patients known to have experienced a change in clinical status. The search was conducted using MEDLINE and Embase databases (published as of October 2017). Pertinent articles provided by investigators and identified from select conference proceedings were also evaluated. Results A total of 20 publications met the inclusion criteria. The MDA criteria were consistently associated with other indicators of disease activity/severity. The ability of the MDA criteria to detect change was supported in randomised controlled trials (n=10), with a greater percentage of patients randomised to active treatments achieving MDA relative to patients in comparator arms. Long-term observational studies (n=2) provided additional support for the ability of the MDA to detect within-subject change in the real-world settings. Conclusion Evidence supports the MDA as a valid measure of disease activity in PsA that can detect between-group and within-subject change. The MDA is a comprehensive measure and clinically meaningful endpoint to assess the impact of interventions on PsA disease activity.
Collapse
|
13
|
080 Women’s Experiences With Bremelanotide Administered, On Demand, for the Treatment of Hypoactive Sexual Desire Disorder. J Sex Med 2019. [DOI: 10.1016/j.jsxm.2019.03.521] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
14
|
Content validity of the National Comprehensive Cancer Network - Functional Assessment of Cancer Therapy - Breast Cancer Symptom Index (NFBSI-16) and Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function Short Form with advanced breast cancer patients. Health Qual Life Outcomes 2019; 17:92. [PMID: 31142325 PMCID: PMC6542025 DOI: 10.1186/s12955-019-1162-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 05/20/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The purpose of this study is to evaluate the content validity of the National Comprehensive Cancer Network - Functional Assessment of Cancer Therapy - Breast Cancer Symptom Index (NFBSI-16) and the Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function Short Form 10b among patients with hormone receptor positive (HR+)/human epidermal growth factor receptor 2 negative (HER2-) advanced breast cancer. METHODS Cognitive debriefing interviews sought to evaluate patients' ability to read, understand, and meaningfully respond to the questionnaires, as well as to evaluate the questionnaires' relevance in the target patient population. Interviews were conducted by telephone and lasted approximately 90 min. Audio recordings were transcribed, anonymized, and analyzed using qualitative data analysis software. RESULTS Fifteen cognitive debriefing interviews were conducted with women (mean age 66.0 years [standard deviation = 12.4]). Patients reported metastases in the bone (86.7%), liver (20.0%), lung (13.3%), skin (6.7%), and lymph node (6.7%) (not mutually exclusive). All patients for whom data were available demonstrated understanding of the instructions and the recall period of the NFBSI-16 (n = 14/14, 100.0%) and the PROMIS (n = 14/14, 100.0%). Greater than 90% of patients demonstrated understanding of each of the items in the NFBSI-16 and the PROMIS. Greater than 70% of patients demonstrated understanding of the response options of the NFBSI-16, > 90% understood response options of PROMIS Items 1-6, and ≥ 50% understood response options of PROMIS Items 7-10. Conceptual relevance was supported for most items in both questionnaires based on patients' reports of experiencing the concepts as part of their breast cancer experience. CONCLUSIONS The results of the cognitive debriefing interviews provide evidence that the NFBSI-16 and PROMIS Physical Function Short Form 10b have content validity in the HR+/HER2- advanced breast cancer patient population. Patients may benefit from additional instructions at the point the response options reverse direction in the PROMIS.
Collapse
|
15
|
Clostridium difficile Infection-Daily Symptoms (CDI-DaySyms™) questionnaire: psychometric characteristics and responder thresholds. Health Qual Life Outcomes 2019; 17:77. [PMID: 31053093 PMCID: PMC6499966 DOI: 10.1186/s12955-019-1142-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 04/10/2019] [Indexed: 01/03/2023] Open
Abstract
Background The purpose of the current study was to determine the final content validation, psychometric characteristics, clinically meaningful improvement, and responder thresholds of the Clostridium difficile infection (CDI)–Daily Symptoms (CDI-DaySyms™) patient-reported outcome (PRO) questionnaire. Methods This validation study was part of two phase III studies (NCT01987895 and NCT01983683) conducted in patients with mild-to-moderate or severe CDI who completed the CDI-DaySyms™ daily throughout the treatment period. The questionnaire was evaluated in three stages: final PRO item content validation (Stage I); psychometric evaluation of reliability and construct validity (Stage II); and determination of clinically meaningful improvement and responder thresholds using distribution-based methods (Stage III). Results The analysis included 168 patients. Most patients were female and Caucasian with mild-to-moderate CDI. The mean age was 57.1 years. Initial item analysis supported by confirmatory factor analysis demonstrated the relevance of 10 items grouped into three distinct domains (Diarrhea Symptoms, Abdominal Symptoms, and Systemic/Other Symptoms). Domain scores demonstrated acceptable internal consistency and test-retest reliability, were sensitive to change, and correlated in expected directions with other relevant symptom and disease-severity measures. Responder thresholds were defined as score changes of − 1.00, − 0.80, and − 0.70 in the Diarrhea Symptoms, Abdominal Symptoms, and Systemic/Other Symptoms domains, respectively. Conclusions The CDI-DaySyms™ is a valid measure of patient-reported CDI symptoms, with good measurement properties, which supports its utility as an endpoint in clinical studies. Further studies confirming responder thresholds based on anchor-based methods are required. Trial registration NCT01987895, registered November 20, 2013; NCT01983683, registered November 14, 2013. Electronic supplementary material The online version of this article (10.1186/s12955-019-1142-9) contains supplementary material, which is available to authorized users.
Collapse
|
16
|
Abstract
OBJECTIVES The PROMIS-Preference (PROPr) score is a recently developed summary score for the Patient-Reported Outcomes Measurement Information System (PROMIS). PROPr is a preference-based scoring system for seven PROMIS domains created using multiplicative multi-attribute utility theory. It serves as a generic, societal, preference-based summary scoring system of health-related quality of life. This manuscript evaluates construct validity of PROPr in two large samples from the US general population. METHODS We utilized 2 online panel surveys, the PROPr Estimation Survey and the Profiles-Health Utilities Index (HUI) Survey. Both included the PROPr measure, patient demographic information, self-reported chronic conditions, and other preference-based summary scores: the EuroQol-5D (EQ-5D-5L) and HUI in the PROPr Estimation Survey and the HUI in the Profiles-HUI Survey. The HUI was scored as both the Mark 2 and the Mark 3. Known-groups validity was evaluated using age- and gender-stratified mean scores and health condition impact estimates. Condition impact estimates were created using ordinary least squares regression in which a summary score was regressed on age, gender, and a single health condition. The coefficient for the health condition is the estimated effect on the preference score of having a condition vs. not having it. Convergent validity was evaluated using Pearson correlations between PROPr and other summary scores. RESULTS The sample consisted of 983 respondents from the PROPr Estimation Survey and 3,000 from the Profiles-HUI survey. Age- and gender-stratified mean PROPr scores were lower than EQ-5D and HUI scores, with fewer subjects having scores corresponding to perfect health on the PROPr. In the PROPr Estimation survey, all 11 condition impact estimates were statistically significant using PROPr, 8 were statistically significant by the EQ-5D, 7 were statistically significant by HUI Mark 2, and 9 were statistically significant by HUI Mark 3. In the Profiles-HUI survey, all 21 condition impact estimates were statistically significant using summary scores from all three scoring systems. In these samples, the correlations between PROPr and the other summary measures ranged from 0.67 to 0.70. CONCLUSIONS These results provide evidence of construct validity for PROPr using samples from the US general population.
Collapse
|
17
|
002 Elements of Desire Questionnaire Assessment of Bremelanotide Efficacy for Hypoactive Sexual Desire Disorder in the RECONNECT Study. J Sex Med 2018. [DOI: 10.1016/j.jsxm.2018.03.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
18
|
Abstract
Importance Patient-reported outcome (PRO) data from clinical trials can provide valuable evidence to inform shared decision making, labeling claims, clinical guidelines, and health policy; however, the PRO content of clinical trial protocols is often suboptimal. The SPIRIT (Standard Protocol Items: Recommendations for Interventional Trials) statement was published in 2013 and aims to improve the completeness of trial protocols by providing evidence-based recommendations for the minimum set of items to be addressed, but it does not provide PRO-specific guidance. Objective To develop international, consensus-based, PRO-specific protocol guidance (the SPIRIT-PRO Extension). Design, Setting, and Participants The SPIRIT-PRO Extension was developed following the Enhancing Quality and Transparency of Health Research (EQUATOR) Network's methodological framework for guideline development. This included (1) a systematic review of existing PRO-specific protocol guidance to generate a list of potential PRO-specific protocol items (published in 2014); (2) refinements to the list and removal of duplicate items by the International Society for Quality of Life Research (ISOQOL) Protocol Checklist Taskforce; (3) an international stakeholder survey of clinical trial research personnel, PRO methodologists, health economists, psychometricians, patient advocates, funders, industry representatives, journal editors, policy makers, ethicists, and researchers responsible for evidence synthesis (distributed by 38 international partner organizations in October 2016); (4) an international Delphi exercise (n = 137 invited; October 2016 to February 2017); and (5) consensus meeting (n = 30 invited; May 2017). Prior to voting, consensus meeting participants were informed of the results of the Delphi exercise and given data from structured reviews evaluating the PRO protocol content of 3 defined samples of trial protocols. Results The systematic review identified 162 PRO-specific protocol recommendations from 54 sources. The ISOQOL Taskforce (n = 21) reduced this to 56 items, which were considered by 138 international stakeholder survey participants and 99 Delphi panelists. The final wording of the SPIRIT-PRO Extension was agreed on at a consensus meeting (n = 29 participants) and reviewed by external group of experts during a consultation period. Eleven extensions and 5 elaborations to the SPIRIT 2013 checklist were recommended for inclusion in clinical trial protocols in which PROs are a primary or key secondary outcome. Extension items focused on PRO-specific issues relating to the trial rationale, objectives, eligibility criteria, concepts used to evaluate the intervention, time points for assessment, PRO instrument selection and measurement properties, data collection plan, translation to other languages, proxy completion, strategies to minimize missing data, and whether PRO data will be monitored during the study to inform clinical care. Conclusions and Relevance The SPIRIT-PRO guidelines provide recommendations for items that should be addressed and included in clinical trial protocols in which PROs are a primary or key secondary outcome. Improved design of clinical trials including PROs could help ensure high-quality data that may inform patient-centered care.
Collapse
|
19
|
Selection of key health domains from PROMIS ® for a generic preference-based scoring system. Qual Life Res 2017; 26:3377-3385. [PMID: 28825177 PMCID: PMC5682190 DOI: 10.1007/s11136-017-1686-2] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/11/2017] [Indexed: 01/06/2023]
Abstract
PURPOSE We sought to select a parsimonious subset of domains from the patient-reported outcomes measurement information system (PROMIS®) that could be used for preference-based valuation. Domain selection criteria included face validity, comprehensiveness, and structural independence. METHODS First, 9 health outcomes measurement experts selected domains appropriate for a general health measure using a modified Delphi procedure. Second, 50 adult community members assessed structural independence of domain pairs. For each pair, the participant was asked if it were possible to have simultaneously good functioning in domain 1 but poor functioning in domain 2, and vice versa. The community members also rated the relative importance of the domains. Finally, the experts selected domains, guided by community members' judgments of structural independence and importance. RESULTS After 3 rounds of surveys, the experts agreed on 10 potential domains. The percent of pairs deemed structurally independent by community members ranged from 50 to 95 (mean = 78). Physical Function, Pain Interference, and Depression were retained because of their inclusion in existing preference-based measures and their importance to community members. Four other domains were added because they were important to community members and judged to be independent by at least 67% of respondents: Cognitive Function-Abilities; Fatigue; Ability to Participate in Social Roles and Activities; and Sleep Disturbance. CONCLUSION With input from measurement experts and community members, we selected 7 PROMIS domains that can be used to create a preference-based score.
Collapse
|
20
|
Journal of Patient-Reported Outcomes: aims and scope. J Patient Rep Outcomes 2017; 1:6. [PMID: 29757316 PMCID: PMC5934939 DOI: 10.1186/s41687-017-0009-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Accepted: 08/16/2017] [Indexed: 11/10/2022] Open
|
21
|
Comorbidity indices for clinical trials: adaptation of two existing indices for use with the FREEDOM trial in women with postmenopausal osteoporosis. Osteoporos Int 2016; 27:75-80. [PMID: 26174879 PMCID: PMC4715856 DOI: 10.1007/s00198-015-3215-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/26/2015] [Accepted: 06/15/2015] [Indexed: 12/15/2022]
Abstract
UNLABELLED Two comorbidity indices were adapted for use in the FREEDOM trial and significantly correlated with the number of medications and impaired health status at baseline. The indices have applications for the analysis of clinical trial data and would allow for the appropriate adjustment of comorbidities when evaluating clinical trial outcomes. INTRODUCTION The purpose of this study is to adapt two published comorbidity indices for use with the FREEDOM clinical trial evaluating postmenopausal women with osteoporosis. METHODS FREEDOM enrolled women aged 60-90 years with a bone mineral density T-score <-2.5 at the lumbar spine or total hip and ≥-4.0 at both sites. Comorbidity indices were calculated using methods described by Sangha (Arthritis Rheum 49:156-163, 2003) and Wolfe (J Rheumatol 37:305-315, 2010) following modification. The adapted Sangha index included 12 conditions with a summary score of 0-12; the adapted Wolfe index included 7 conditions with a weighted summary score of 0-8. Higher scores indicated greater comorbidity. A panel of clinicians independently reviewed subjects' medical histories using a systematic process based on Medical Dictionary for Regulatory Activities (MedDRA) preferred terms to map specified comorbid conditions. Spearman correlations between the adapted indices and baseline subject characteristics expected to be associated with comorbidities were examined. RESULTS Of the 7808 subjects in this study, 74 % had ≥1 comorbidities based on the adapted Sangha or Wolfe comorbidity indices. The mean (SD) adapted Sangha and Wolfe comorbidity indices were 1.4 (1.2) and 1.4 (1.3), respectively. Both indices correlated positively with age, body mass index, and the number of medications (r = 0.54 to 0.55) at baseline and inversely correlated with health-related quality of life (r = -0.22 to -0.30) (all P < 0.0001). Further, when either the adapted Sangha or Wolfe index was included as a covariate for assessing mortality over 36 months in the FREEDOM population, the hazard ratio of the comorbidity index indicated that the mortality risk increased by 27 or 28 %, respectively, for each unit increase in the adapted index (both P < 0.0001). CONCLUSIONS Our work suggests these comorbidity indices may be adapted for use with clinical trial data, thereby allowing for the appropriate adjustment and reporting of covariates in the evaluation of clinical trial outcomes in an osteoporotic population.
Collapse
|
22
|
Determination of minimal important difference thresholds for Parkinson’s disease Questionnaire-39 in advanced Parkinson’s disease patients. Parkinsonism Relat Disord 2016. [DOI: 10.1016/j.parkreldis.2015.10.539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
23
|
Determination of minimal important difference thresholds for Parkinson’s disease questionnaire-39 in advanced Parkinson’s disease patients. Parkinsonism Relat Disord 2016. [DOI: 10.1016/j.parkreldis.2015.10.206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
24
|
Reliability and Validity of the Work Instability Scale for Rheumatoid Arthritis. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2015; 18:1008-1015. [PMID: 26686785 DOI: 10.1016/j.jval.2015.09.2941] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2014] [Revised: 08/13/2015] [Accepted: 09/18/2015] [Indexed: 06/05/2023]
Abstract
OBJECTIVE The objective was to evaluate the psychometric properties of the Rheumatoid Arthritis-Work Instability Scale (RA-WIS) in a clinical trial setting. METHODS Secondary analyses were conducted using data from a 56-week, randomized controlled trial of patients with early rheumatoid arthritis (RA). Patient-reported outcome measures included the RA-WIS, the Health Assessment Questionnaire (HAQ), the Rheumatoid Arthritis Quality of Life Questionnaire, and the Global Assessment of Disease Activity and Pain, data for which were collected at baseline and at weeks 12, 16, 24, and 56. Data were analyzed for reliability, validity, and responsiveness. RESULTS Among 148 patients whose data were analyzed, more than half were women (56.1%) with a mean age of 46.8 years. On average, patients experienced RA symptoms for 8.7 months; the mean 28-Joint Disease Activity Score (DAS28) was 5.9, and the mean HAQ - Disability Index was 1.3. The RA-WIS demonstrated excellent internal consistency and test-retest reliability (α = 0.89 and intraclass correlation coefficient = 0.91, respectively). At baseline and week 24, moderate to strong correlations were seen between RA-WIS total scores and the HAQ, the Global Assessment of Disease Activity, and the Pain Rheumatoid Arthritis Quality of Life Questionnaire, ranging from 0.47 to 0.81 (all P < 0.0001). Mean RA-WIS total scores and work disability risk levels discriminated between clinical severity scores on the DAS28, the HAQ - Disability Index, and the Physician Global Assessment of Disease Activity (all P < 0.05). Mean baseline to week 24 RA-WIS total change scores were significantly different among American College of Rheumatology responder groups (P ≤ 0.0001) and between DAS28 remission status groups (P < 0.001). CONCLUSIONS These findings provide evidence supporting the reliability, validity, and responsiveness of the RA-WIS for evaluating work disability in patients with RA in a clinical trial setting.
Collapse
|
25
|
A Checklist for Reporting Valuation Studies of Multi-Attribute Utility-Based Instruments (CREATE). PHARMACOECONOMICS 2015; 33:867-77. [PMID: 26026667 PMCID: PMC4519579 DOI: 10.1007/s40273-015-0292-9] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Multi-attribute utility-based instruments (MAUIs) assess health status and provide an index score on the full health-dead scale, and are widely used to support reimbursement decisions for new healthcare interventions worldwide. A valuation study is a key part of the development of MAUIs, with the primary goal of developing a scoring algorithm through eliciting societal preferences. We developed the 21-item Checklist for REporting VAluaTion StudiEs (CREATE) by following a modified two-round Delphi panel approach plus an email survey. CREATE is intended to promote good reporting practice as well as guiding developers to thoroughly and carefully think through key methodological elements in designing valuation studies.
Collapse
|
26
|
Abstract
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach’s alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach’s alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed.
Collapse
|
27
|
Calibration of the Dutch-Flemish PROMIS Pain Behavior item bank in patients with chronic pain. Eur J Pain 2015; 20:284-96. [DOI: 10.1002/ejp.727] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/12/2015] [Indexed: 11/08/2022]
|
28
|
Cdad-Daysyms™: A New Patient-Reported Outcome Tool For Clostridium Difficile-Associated Diarrhoea. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2014; 17:A569. [PMID: 27201895 DOI: 10.1016/j.jval.2014.08.1897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
|
29
|
THU0564 Validation of the Dutch-Flemish PROMIS Physical Functioning Item Bank in Patients with Chronic Pain. Ann Rheum Dis 2014. [DOI: 10.1136/annrheumdis-2014-eular.1721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
30
|
THU0563 Validation of the Dutch-Flemish PROMIS Pain Behavior and Pain Interference Item Banks in Patients with Chronic Pain. Ann Rheum Dis 2014. [DOI: 10.1136/annrheumdis-2014-eular.1723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
31
|
A164: Development of Pediatric Item Banks to Measure Pain Behavior in the Patient Reported Outcomes Measurement Information System. Arthritis Rheumatol 2014. [DOI: 10.1002/art.38590] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
32
|
AB0266 Responsiveness and minimal important difference of the rheumatoid arthritis-work instability scale (ra-wis). Ann Rheum Dis 2014. [DOI: 10.1136/annrheumdis-2013-eular.2588] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
33
|
The case for an international patient-reported outcomes measurement information system (PROMIS®) initiative. Health Qual Life Outcomes 2013; 11:210. [PMID: 24359143 PMCID: PMC3879205 DOI: 10.1186/1477-7525-11-210] [Citation(s) in RCA: 106] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2013] [Accepted: 12/03/2013] [Indexed: 01/24/2023] Open
Abstract
Patient-reported outcomes (PROs) play an increasingly important role in clinical practice and research. Modern psychometric methods such as item response theory (IRT) enable the creation of item banks that support fixed-length forms as well as computerized adaptive testing (CAT), often resulting in improved measurement precision and responsiveness. Here we describe and discuss the case for developing an international core set of PROs building from the US PROMIS® network. PROMIS is a U.S.-based cooperative group of research sites and centers of excellence convened to develop and standardize PRO measures across studies and settings. If extended to a global collaboration, PROMIS has the potential to transform PRO measurement by creating a shared, unifying terminology and metric for reporting of common symptoms and functional life domains. Extending a common set of standardized PRO measures to the international community offers great potential for improving patient-centered research, clinical trials reporting, population monitoring, and health care worldwide. Benefits of such standardization include the possibility of: international syntheses (such as meta-analyses) of research findings; international population monitoring and policy development; health services administrators and planners access to relevant information on the populations they serve; better assessment and monitoring of patients by providers; and improved shared decision making. The goal of the current PROMIS International initiative is to ensure that item banks are translated and culturally adapted for use in adults and children in as many countries as possible. The process includes 3 key steps: translation/cultural adaptation, calibration, and validation. A universal translation, an approach focusing on commonalities, rather than differences across versions developed in regions or countries speaking the same language, is proposed to ensure conceptual equivalence for all items. International item calibration using nationally representative samples of adults and children within countries is essential to demonstrate that all items possess expected strong measurement properties. Finally, it is important to demonstrate that the PROMIS measures are valid, reliable and responsive to change when used in an international context. IRT item banking will allow for tailoring within countries and facilitate growth and evolution of PROs through contributions from the international measurement community. A number of opportunities and challenges of international development of PROs item banks are discussed.
Collapse
|
34
|
Development and validation of a new self-report measure of pain behaviors. Pain 2013; 154:2867-2876. [PMID: 23994451 DOI: 10.1016/j.pain.2013.08.024] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 08/21/2013] [Accepted: 08/22/2013] [Indexed: 10/26/2022]
Abstract
Pain behaviors that are maintained beyond the acute stage after injury can contribute to subsequent psychosocial and physical disability. Critical to the study of pain behaviors is the availability of psychometrically sound pain behavior measures. In this study we developed a self-report measure of pain behaviors, the Pain Behaviors Self Report (PaB-SR). PaB-SR scores were developed using item response theory and evaluated using a rigorous, multiple-witness approach to validity testing. Participants included 661 survey participants with chronic pain and with multiple sclerosis, back pain, or arthritis; 618 survey participants who were significant others of a chronic pain participant; and 86 participants in a videotaped pain behavior observation protocol. Scores on the PaB-SR were found to be measurement invariant with respect to clinical condition. PaB-SR scores, observer reports, and the videotaped protocol yielded distinct, but convergent views of pain behavior, supporting the validity of the new measure. The PaB-SR is expected to be of substantial utility to researchers wishing to explore the relationship between pain behaviors and constructs such as pain intensity, pain interference, and disability.
Collapse
|
35
|
Patient-reported outcomes in randomized clinical trials: development of ISOQOL reporting standards. Qual Life Res 2013; 22:1161-75. [PMID: 22987144 PMCID: PMC3731511 DOI: 10.1007/s11136-012-0252-1] [Citation(s) in RCA: 144] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/06/2012] [Indexed: 02/07/2023]
Abstract
PURPOSE To develop expert consensus on a suite of reporting standards for HRQL outcomes of RCTs. METHODS A Task Force of The International Society of Quality of Life Research (ISOQOL) undertook a systematic review of the literature to identify candidate reporting standards for HRQL in RCTs. Subsequently, a web-based survey was circulated to the ISOQOL membership. Respondents were asked to rate candidate standards on a 4-point Likert scale based on their perceived value in reporting studies in which HRQL was a study outcome (primary or secondary). Results were synthesized into draft reporting guidelines, which were further reviewed by the membership to inform the final guidance. RESULTS Forty-six existing candidate standards for reporting HRQL results in RCTs were synthesized to produce a 40 item survey that was completed electronically by 161 respondents. The majority of respondents rated all 40 items to be either 'essential' or 'desirable' when HRQL was a primary RCT outcome. Ratings changed when HRQL was a secondary study outcome. Feedback on the survey findings resulted in the Task Force generalizing the guidance to include patient-reported outcomes (PROs). The final guidance, which recommends standards for use in reporting PROs generally, and more specifically, for PROs identified as primary study outcomes, was approved by the ISOQOL Board of Directors. CONCLUSIONS ISOQOL has developed a suite of recommended standards for reporting PRO results of RCTs. Improved reporting of PROs will enable accurate interpretation of evidence to inform patient choice, aid clinical decision making, and inform health policy.
Collapse
|
36
|
Assessing health-related quality of life in patients with heart failure: a systematic, standardized comparison of available measures. Heart Fail Rev 2013; 19:359-67. [DOI: 10.1007/s10741-013-9394-7] [Citation(s) in RCA: 100] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
37
|
The relationships between asthma control, daytime sleepiness, and quality of life among children with asthma: a path analysis. Sleep Med 2013; 14:641-7. [PMID: 23684939 DOI: 10.1016/j.sleep.2013.04.002] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Revised: 04/04/2013] [Accepted: 04/06/2013] [Indexed: 10/26/2022]
Abstract
OBJECTIVES We aimed to examine the relationships between asthma control, daytime sleepiness, and asthma-specific health-related quality of life (HRQOL) among children with asthma. Path analyses were conducted to test if daytime sleepiness can mediate the effect of asthma control status on asthma-specific HRQOL. METHODS 160 dyads (pairs) of asthmatic children and their parents were collected for analyses. The Asthma Control and Communication Instrument (ACCI) was used to categorize adequate and poor asthma control status. The Cleveland Adolescent Sleepiness Questionnaire (CASQ) was used to measure children's daytime sleepiness, including sleep in school, awake in school, sleep in evening, and sleep during transport. The Patient-Reported Outcomes Measurement Information System (PROMIS) Asthma Impact Scale was used to measure asthma-specific HRQOL. RESULTS Poorly controlled asthma was associated with daytime sleepiness and impaired asthma-specific HRQOL. Asthma control status was directly associated with asthma-specific HRQOL (P<.05), whereas sleep in school and sleep in evening domains of daytime sleepiness significantly mediated the relationship between poor asthma control and impaired HRQOL (P<.01). CONCLUSIONS Asthma control status was associated with pediatric asthma-specific HRQOL, and the association was significantly mediated by daytime sleepiness. Healthcare providers need to address pediatric sleep needs related to poor asthma control to reduce the negative impact on HRQOL.
Collapse
|
38
|
Review of patient-reported outcome measures in chronic hepatitis C. Health Qual Life Outcomes 2012; 10:92. [PMID: 22871087 PMCID: PMC3547737 DOI: 10.1186/1477-7525-10-92] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2011] [Accepted: 07/13/2012] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Chronic hepatitis C (CHC) and its treatment are associated with a variety of patient-reported symptoms and impacts. Some CHC symptoms and impacts may be difficult to evaluate through objective clinical testing, and more easily measured through patient self-report. This literature review identified concepts raised by CHC patients related to symptoms, impacts, and treatment effects, and evaluated integration of these concepts within patient-reported outcome (PRO) measures. The goal of this work was to provide recommendations for incorporation of PRO measurement of concepts that are relevant to the CHC experience into CHC clinical trial design. METHODS A three-tiered literature search was conducted. This included searches on concepts of importance, PRO measures used in clinical trials, and existing PRO measures. The PRO Concept Search focused on reviewing issues raised by CHC patients about CHC symptoms, disease impact, and treatment effects. The CHC Trials with PRO Endpoints Search reviewed clinical trials with PRO endpoints to assess differences between treatments over time. The PRO Measure Search reviewed existing PRO measures associated with the concepts of interest. RESULTS This multi-tiered approach identified five key concepts of interest: depression/anxiety, fatigue, flu-like symptoms, cognitive function, insomnia. Comparing these five concepts of interest to the PRO measures in published CHC clinical trials showed that, while treatment of CHC may decrease health-related quality of life in a number of mental and physical domains, the PRO measures that were utilized in published clinical trials inadequately covered the concepts of interest. Further review of 18 existing PRO measures of the concepts of interest showed only four of the 18 were validated in CHC populations. CONCLUSIONS This review identified several gaps in the literature regarding assessment of symptoms and outcomes reported as important by CHC patients. Further research is needed to ensure that CHC clinical trials evaluate concepts that are important to patients and include measures that have evidence supporting content validity, reliability, construct validity, and responsiveness.
Collapse
|
39
|
Health status and health-related quality of life in a pediatric palliative care program. J Palliat Med 2012; 15:790-7. [PMID: 22686119 DOI: 10.1089/jpm.2011.0504] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
BACKGROUND Children with life-threatening illnesses have unique physical and psychosocial needs that pediatric palliative care programs can address. Integrated programs strive to address these needs from the point of diagnosis through death, if needed, at the same time that curative care is provided. To better understand the variation in these needs, we assessed the health status and health-related quality of life (HRQOL) of children enrolled in an integrated pediatric palliative care program. METHODS A telephone survey was conducted with 98 parents whose children were enrolled in an integrated pediatric palliative care program in Florida. The Health Utilities Index (HUI) system was used to assess health status and HRQOL. RESULTS HUI2 attribute levels show that children have the greatest impairment with moderate-to-severe burdens related to self-care, mobility, and sensation, and the least impairment with emotion. HUI3 attribute levels show that children have the greatest impairment with moderate-to-severe burdens related to ambulation and cognition and the least impairment with hearing and emotional functioning. Mean overall HUI2 and HUI3 utility scores are 0.37 and 0.15, respectively. CONCLUSION Children with life-threatening illnesses in our sample had a high level of morbidity compared with those found in other HUI studies of children with acute or chronic health conditions. Not only do our results highlight severely impaired HRQOL, they also demonstrate the wide variety of health states and needs for children in integrated palliative care programs. This information can help develop strategies to encourage more providers to participate in integrated pediatric palliative care programs.
Collapse
|
40
|
Impact of clinical fractures on health-related quality of life is dependent on time of assessment since fracture: results from the FREEDOM trial. Osteoporos Int 2012; 23:1361-9. [PMID: 21769664 DOI: 10.1007/s00198-011-1720-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/07/2011] [Accepted: 06/07/2011] [Indexed: 10/18/2022]
Abstract
UNLABELLED In the Fracture Reduction Evaluation of Denosumab in Osteoporosis Every 6 Months (FREEDOM) study, women with incident clinical fractures reported significant declines in health-related quality of life (HRQoL). The largest declines were observed when the assessment was <3 months post fracture. The largest impact of incident clinical fractures was on physical function, and that of incident clinical vertebral fractures was on back pain. INTRODUCTION In the FREEDOM trial, denosumab significantly reduced the risk of new vertebral, hip, and nonvertebral fractures. We evaluated the effect of denosumab on HRQoL and the association between incident clinical fractures and HRQoL. METHODS The FREEDOM trial enrolled 7,868 women aged 60-90 years with a total hip and/or lumbar spine BMD T-score <-2.5 and not <-4.0 at either site. Women were randomized to receive denosumab 60 mg or placebo every 6 months, in addition to daily calcium and vitamin D. HRQoL was assessed with the Osteoporosis Assessment Questionnaire-Short Version (OPAQ-SV) at baseline and every 6 months for 36 months. The OPAQ-SV assesses physical function, emotional status, and back pain. Higher scores indicate better health status. RESULTS No statistically significant differences in mean change in HRQoL from baseline to end of study were found when comparing treatment groups. Compared with women without any incident fractures during the study, women with incident clinical fractures reported significant declines in physical function (-4.0 vs. -0.5) and emotional status (-5.0 vs. -0.8) at month 36 (P < 0.001 for both). Importantly, time-dependent covariate analyses demonstrated that the largest declines were observed when the assessment was <3 months post fracture. The largest impact of incident clinical fractures was on physical function, and that of incident clinical vertebral fractures was on back pain. CONCLUSIONS These findings not only demonstrate that incident clinical fractures impact HRQoL but also contribute new information regarding the impact of these fracture events on HRQoL over time.
Collapse
|
41
|
|
42
|
Norms-based assessment of patient-reported outcomes associated with adalimumab monotherapy in patients with ankylosing spondylitis. Clin Exp Rheumatol 2011; 29:624-632. [PMID: 21813060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2010] [Accepted: 02/14/2011] [Indexed: 05/31/2023]
Abstract
OBJECTIVES To compare the impact of ankylosing spondylitis (AS) on health-related quality of life (HRQL) and of adalimumab on initial and sustained improvement in HRQL for patients with active AS versus the general US population. METHODS Data from the 5-year ATLAS trial were analysed. HRQL burden of AS and treatment impact on HRQL were assessed by comparing health status and utility scores from ATLAS (Short Form 36 Health Survey [SF-36] and Health Utilities Index Mark 3 [HUI3]) with population norms. RESULTS Baseline scores for all measures were comparable between adalimumab and placebo. All scores for both groups were significantly worse than general population norms (all p<0.0001). Within- and between-group improvements in SF-36 Physical Component Summary and SF-6D scores from baseline to Weeks 12 and 24 were clinically relevant for patients receiving adalimumab. For patients initially randomised to adalimumab, HRQL scores improved from Weeks 25 to 52 and remained relatively stable through 3 years but remained lower than for the general US population at all time points. CONCLUSIONS Findings demonstrate a significant burden of AS on HRQL. Treatment with adalimumab significantly improved physical functioning and other measures of HRQL compared with placebo. Clinically relevant improvements in HRQL outcomes over 3 years represent a significant benefit of adalimumab. Because of the advanced AS disease, patient health status remained below that of the general population. Treatment earlier in the course of AS may be needed to restore HRQL to the level of the general population.
Collapse
|
43
|
Cost-effectiveness of combination versus sequential docetaxel and carboplatin for the treatment of platinum-sensitive, recurrent ovarian cancer. Cancer 2011; 118:386-91. [PMID: 21598242 DOI: 10.1002/cncr.26199] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2011] [Revised: 03/22/2011] [Accepted: 03/29/2011] [Indexed: 11/09/2022]
Abstract
BACKGROUND In a randomized controlled trial (RCT) of patients with recurrent, platinum-sensitive ovarian cancer, the combination weekly docetaxel and carboplatin was associated a with progression-free survival (PFS) of 13.7 months compared with 8.4 months for sequential, single-agent docetaxel followed by carboplatin. The objective of the current study was to construct a cost-utility model to compare these 2 regimens with the incorporation of prospectively collected quality-of-life (QoL) data. METHODS An RCT of concurrent docetaxel and carboplatin (cDC) versus docetaxel followed by carboplatin (sequential docetaxel and carboplatin [sDC]) was the basis for a Markov decision model, and the primary effectiveness outcome was PFS. Costs were estimated using US dollars based on Medicare reimbursements for chemotherapy regimens, bone marrow support, and management of adverse events. QoL data obtained using the Functional Assessment of Cancer Therapy-General questionnaire were converted to utilities. Costs and incremental cost-effectiveness ratios (ICERs) were reported in US dollars per quality-adjusted life year (QALY). Extensive 1-way sensitivity analyses and a Monte Carlo probabilistic sensitivity analysis were performed. RESULTS The least expensive strategy was sDC, which cost an average of $20,381, compared with cDC, which cost an average of $25,122. cDC had an ICER of $25,239 per QALY compared with sDC. cDC remained cost-effective, with an ICER <$50,000 per QALY, over a range of costs and estimates. In Monte Carlo sensitivity analysis using a $50,000 per QALY willingness-to-pay threshold, cDC was either dominant or cost-effective with an ICER <$50,000 per QALY in 83% of simulations. CONCLUSIONS Combined weekly cDC appeared to be cost-effective compared with sDC as treatment strategy for patients with platinum-sensitive ovarian cancer, even when accounting for slightly lower QoL during treatment.
Collapse
|
44
|
The Patient-Reported Outcomes Measurement Information System (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005-2008. J Clin Epidemiol 2010; 63:1179-94. [PMID: 20685078 DOI: 10.1016/j.jclinepi.2010.04.011] [Citation(s) in RCA: 3146] [Impact Index Per Article: 224.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2009] [Revised: 03/31/2010] [Accepted: 04/08/2010] [Indexed: 12/12/2022]
Abstract
OBJECTIVES Patient-reported outcomes (PROs) are essential when evaluating many new treatments in health care; yet, current measures have been limited by a lack of precision, standardization, and comparability of scores across studies and diseases. The Patient-Reported Outcomes Measurement Information System (PROMIS) provides item banks that offer the potential for efficient (minimizes item number without compromising reliability), flexible (enables optional use of interchangeable items), and precise (has minimal error in estimate) measurement of commonly studied PROs. We report results from the first large-scale testing of PROMIS items. STUDY DESIGN AND SETTING Fourteen item pools were tested in the U.S. general population and clinical groups using an online panel and clinic recruitment. A scale-setting subsample was created reflecting demographics proportional to the 2000 U.S. census. RESULTS Using item-response theory (graded response model), 11 item banks were calibrated on a sample of 21,133, measuring components of self-reported physical, mental, and social health, along with a 10-item Global Health Scale. Short forms from each bank were developed and compared with the overall bank and with other well-validated and widely accepted ("legacy") measures. All item banks demonstrated good reliability across most of the score distributions. Construct validity was supported by moderate to strong correlations with legacy measures. CONCLUSION PROMIS item banks and their short forms provide evidence that they are reliable and precise measures of generic symptoms and functional reports comparable to legacy instruments. Further testing will continue to validate and test PROMIS items and banks in diverse clinical populations.
Collapse
|
45
|
Development of a PROMIS item bank to measure pain interference. Pain 2010; 150:173-182. [PMID: 20554116 PMCID: PMC2916053 DOI: 10.1016/j.pain.2010.04.025] [Citation(s) in RCA: 721] [Impact Index Per Article: 51.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2009] [Revised: 02/27/2010] [Accepted: 04/20/2010] [Indexed: 01/22/2023]
Abstract
This paper describes the psychometric properties of the PROMIS-pain interference (PROMIS-PI) bank. An initial candidate item pool (n=644) was developed and evaluated based on the review of existing instruments, interviews with patients, and consultation with pain experts. From this pool, a candidate item bank of 56 items was selected and responses to the items were collected from large community and clinical samples. A total of 14,848 participants responded to all or a subset of candidate items. The responses were calibrated using an item response theory (IRT) model. A final 41-item bank was evaluated with respect to IRT assumptions, model fit, differential item function (DIF), precision, and construct and concurrent validity. Items of the revised bank had good fit to the IRT model (CFI and NNFI/TLI ranged from 0.974 to 0.997), and the data were strongly unidimensional (e.g., ratio of first and second eigenvalue=35). Nine items exhibited statistically significant DIF. However, adjusting for DIF had little practical impact on score estimates and the items were retained without modifying scoring. Scores provided substantial information across levels of pain; for scores in the T-score range 50-80, the reliability was equivalent to 0.96-0.99. Patterns of correlations with other health outcomes supported the construct validity of the item bank. The scores discriminated among persons with different numbers of chronic conditions, disabling conditions, levels of self-reported health, and pain intensity (p<0.0001). The results indicated that the PROMIS-PI items constitute a psychometrically sound bank. Computerized adaptive testing and short forms are available.
Collapse
|
46
|
Abstract
This paper describes the psychometric properties of the PROMIS-pain interference (PROMIS-PI) bank. An initial candidate item pool (n=644) was developed and evaluated based on the review of existing instruments, interviews with patients, and consultation with pain experts. From this pool, a candidate item bank of 56 items was selected and responses to the items were collected from large community and clinical samples. A total of 14,848 participants responded to all or a subset of candidate items. The responses were calibrated using an item response theory (IRT) model. A final 41-item bank was evaluated with respect to IRT assumptions, model fit, differential item function (DIF), precision, and construct and concurrent validity. Items of the revised bank had good fit to the IRT model (CFI and NNFI/TLI ranged from 0.974 to 0.997), and the data were strongly unidimensional (e.g., ratio of first and second eigenvalue=35). Nine items exhibited statistically significant DIF. However, adjusting for DIF had little practical impact on score estimates and the items were retained without modifying scoring. Scores provided substantial information across levels of pain; for scores in the T-score range 50-80, the reliability was equivalent to 0.96-0.99. Patterns of correlations with other health outcomes supported the construct validity of the item bank. The scores discriminated among persons with different numbers of chronic conditions, disabling conditions, levels of self-reported health, and pain intensity (p<0.0001). The results indicated that the PROMIS-PI items constitute a psychometrically sound bank. Computerized adaptive testing and short forms are available.
Collapse
|
47
|
Measurement of non-restorative sleep in insomnia: A review of the literature. Sleep Med Rev 2010; 14:205-12. [DOI: 10.1016/j.smrv.2009.10.002] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2009] [Revised: 10/05/2009] [Accepted: 10/05/2009] [Indexed: 10/20/2022]
|
48
|
Impact of systemic lupus erythematosus on health, family, and work: The patient perspective. Arthritis Care Res (Hoboken) 2010; 62:266-73. [DOI: 10.1002/acr.20077] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
49
|
9314 Impact of ipilimumab on the health-related quality of life (HRQL) of patients with previously treated unresectable stage III or IV melanoma. EJC Suppl 2009. [DOI: 10.1016/s1359-6349(09)71958-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
|
50
|
Health perceptions and clinical characteristics of relapsing-remitting multiple sclerosis patients: baseline data from an international clinical trial. Curr Med Res Opin 2009; 25:1121-30. [PMID: 19317608 DOI: 10.1185/03007990902797675] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
OBJECTIVE Baseline clinical and health-related quality of life (HRQoL) data from a phase 2, multi-site, international, randomized, controlled trial were analyzed to: (1) characterize the health status of patients with relapsing-remitting multiple sclerosis (RRMS), (2) explore cross-sectional relationships between HRQoL and clinical measures, and (3) evaluate differences in HRQoL scores for subsequent validation as minimally important differences (MID). CLINICAL TRIAL REGISTRATION www.clinicaltrials.gov, NCT00207727. RESEARCH DESIGN AND METHODS Baseline clinical and HRQoL data were selected and analyzed. HRQoL questionnaires included the Short Form-36 (SF-36), Fatigue Severity Scale (FSS), a Patient Assessment of multiple sclerosis (MS) Impact (PAMSI), and MS-specific symptom scales for Bladder and Bowel Control, Cognition, and Sexual Satisfaction. Standard summary statistics described the population while Pearson and Spearman correlations evaluated the baseline association between HRQoL and clinical measures. Cross-sectional estimates of MID in HRQoL scores were derived using several clinical anchors, the PAMSI, and two tests: Tukey multiple comparisons and adjacent mean difference. RESULTS Patients (n = 249) had a mean age of 39.0 (Standard deviation, SD = 10.5), 70% were female, 63% resided in Europe, and 96% were Caucasian. Baseline median Expanded Disability Severity Scale (EDSS) was 2.5 (range = 0.0-6.5); median disease duration was 1.9 years (range = 0.1-33.6). The worst baseline mean (normalized) SF-36 scores were for General Health (39.9), Role Physical (40.4), Physical Functioning (41.0), and Vitality (42.7). The worst MS symptom mean scores were for Cognition (6.3) and FSS (4.4). Fatigue scores indicated substantial burden and were consistent with SF-36 Vitality results. Baseline HRQoL scores (SF-36, FSS, MS symptom scales) correlated most with EDSS, Multiple Sclerosis Functional Composite (MSFC), age and disease duration. Lesion count and pre-baseline relapse rate had no meaningful association with HRQoL or other clinical measures. The MID for several HRQoL measures are proposed for confirmation in longitudinal patient datasets. CONCLUSION Clinical and HRQoL assessments documented health impairments in physical functioning, fatigue, and cognition among these RRMS patients with relatively short disease duration. HRQoL data varied with clinical measures and contributed new information regarding disease burden. The association between clinical and HRQoL measures was limited to cross-sectional analysis and requires confirmation in longitudinal datasets. These findings reflect an ambulatory, early-stage RRMS population that was mostly European in location or descent. The PAMSI also requires further validation as a measure of patient health status.
Collapse
|