1
|
Mondal D, Vanbelle S, Cassese A, Candel MJJM. Review of sample size determination methods for the intraclass correlation coefficient in the one-way analysis of variance model. Stat Methods Med Res 2024; 33:532-553. [PMID: 38320802 PMCID: PMC10981208 DOI: 10.1177/09622802231224657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2024]
Abstract
Reliability of measurement instruments providing quantitative outcomes is usually assessed by an intraclass correlation coefficient. When participants are repeatedly measured by a single rater or device, or, are each rated by a different group of raters, the intraclass correlation coefficient is based on a one-way analysis of variance model. When planning a reliability study, it is essential to determine the number of participants and measurements per participant (i.e. number of raters or number of repeated measurements). Three different sample size determination approaches under the one-way analysis of variance model were identified in the literature, all based on a confidence interval for the intraclass correlation coefficient. Although eight different confidence interval methods can be identified, Wald confidence interval with Fisher's large sample variance approximation remains most commonly used despite its well-known poor statistical properties. Therefore, a first objective of this work is comparing the statistical properties of all identified confidence interval methods-including those overlooked in previous studies. A second objective is developing a general procedure to determine the sample size using all approaches since a closed-form formula is not always available. This procedure is implemented in an R Shiny app. Finally, we provide advice for choosing an appropriate sample size determination method when planning a reliability study.
Collapse
Affiliation(s)
- Dipro Mondal
- Faculty of Health Medicine and Life Sciences, Department of Methodology and Statistics, Care and Public Health Research Institute (CAPHRI), Maastricht University, Limburg, The Netherlands
| | - Sophie Vanbelle
- Faculty of Health Medicine and Life Sciences, Department of Methodology and Statistics, Care and Public Health Research Institute (CAPHRI), Maastricht University, Limburg, The Netherlands
| | - Alberto Cassese
- Department of Statistics, Computer Science, Applications “Giuseppe Parenti”, The University of Florence, Italy
| | - Math JJM Candel
- Faculty of Health Medicine and Life Sciences, Department of Methodology and Statistics, Care and Public Health Research Institute (CAPHRI), Maastricht University, Limburg, The Netherlands
| |
Collapse
|
2
|
Middleton H, Grimes L, Willis SC, Steinke D, Shaw M. Reliability and validity testing of the medicines related - consultation assessment tool for assessing pharmacists' consultations. Int J Clin Pharm 2023; 45:201-9. [PMID: 36394786 DOI: 10.1007/s11096-022-01489-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 09/14/2022] [Indexed: 11/18/2022]
Abstract
BACKGROUND Demonstrating a person-centred approach in a consultation is a key component of delivering high-quality healthcare. To support development of such an approach requires training underpinned by valid assessment tools. Given the lack of a suitable pharmacy-specific tool, a new global consultation skills assessment tool: the medicines related-consultation assessment tool (MR-CAT) was designed and tested. AIM This study aimed to test the validity and reliability of the MR-CAT using psychometric methods. METHOD Psychometric testing involved analysis of participants' (n = 13) assessment of fifteen pre-recorded simulated consultations using the MR-CAT. Analysis included discriminant validity testing, intrarater and interrater reliability testing for each of the five sections of the MR-CAT and for the overall global assessment of the consultation. Analysis also included internal consistency testing for the whole tool. RESULTS Internal consistency for the overall global assessment of the consultation was good (Cronbach's alpha = 0.97). The MR-CAT discriminated well for the overall global assessment of the consultation (p < 0.001). Moderate to high intrarater reliability was observed for the overall global assessment of the consultation and for all five sections of the MR-CAT (rho = 0.64-0.84) in the test-retest analysis. Moderate to good interrater reliability (Kendall's W = 0.68-0.90) was observed for the overall global assessment of the consultation and for all five sections of the MR-CAT. CONCLUSION The MR-CAT is a valid and reliable tool for assessing person-centred pharmacist's consultations. Moreover, its unique design means that the MR-CAT can be used in both formative and summative assessment.
Collapse
|
3
|
Runer A, Dammerer D, Kranewitter C, Giesinger JM, Henninger B, Hirschmann MT, Liebensteiner MC. Injuries to the anterolateral ligament are observed more frequently compared to lesions to the deep iliotibial tract (Kaplan fibers) in anterior cruciate ligamant deficient knees using magnetic resonance imaging. Knee Surg Sports Traumatol Arthrosc 2022; 30:309-318. [PMID: 33770221 PMCID: PMC8800892 DOI: 10.1007/s00167-021-06535-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 03/09/2021] [Indexed: 01/26/2023]
Abstract
PURPOSE To determine the accuracy of detection, injury rate and inter- and intrarater reproducibility in visualizing lesions to the anterolateral ligament (ALL) and the deep portion of the iliotibial tract (dITT) in anterior cruciate ligament (ACL) deficient knees. METHODS Ninety-one consecutive patients, out of those 25 children (age 14.3 ± 3.5 years), with diagnosed ACL tears were included. Two musculoskeletal radiologists retrospectively reviewed MRI data focusing on accuracy of detection and potential injuries to the ALL or dITT. Lesion were diagnosed in case of discontinued fibers in combination with intra- or peri-ligamentous edema and graded as intact, partial or complete tears. Cohen's Kappa and 95% confidence intervals (95% CI) were determined for inter- and intrarater reliability measures. RESULTS The ALL and dITT were visible in 52 (78.8%) and 56 (84.8%) of adult-and 25 (100%) and 19 (76.0%) of pediatric patients, respectively. The ALL was injured in 45 (58.5%; partial: 36.4%, compleate: 22.1%) patients. Partial and comleate tears, where visualized in 21 (40.4%) and 16 (30.8%) adult- and seven (28.0%) and one (4%) peditric patients. A total of 16 (21.3%; partial: 13.3%, compleate: 8.0%) dITT injuries were identified. Partal and complete lesions were seen in seven (12.5%) and five (8.9%) adult- and three (15.8%) and one (5.3%) pediatric patients. Combined injuries were visualized in nine (12.7%) patients. Inter-observer (0.91-0.95) and intra-observer (0.93-0.95) reproducibility was high. CONCLUSION In ACL injured knees, tears of the ALL are observed more frequently compared to lesions to the deep iliotibial tract. Combined injuries of both structures are rare. Clinically, the preoperative visualization of potentially injured structures of the anterolateral knee is crucial and is important for a more personalized preoperative planning and tailored anatomical reconstruction. The clinical implication of injuries to the anterolateral complex of the knee needs further investigation. LEVEL OF EVIDENCE II.
Collapse
Affiliation(s)
- Armin Runer
- Department of Orthopaedics and Traumatology, Medical University of Innsbruck, Anichstrasse 35, 6020 Innsbruck, Austria
| | - Dietmar Dammerer
- Department of Orthopaedics and Traumatology, Medical University of Innsbruck, Anichstrasse 35, 6020 Innsbruck, Austria
| | | | - Johannes M. Giesinger
- Innsbruck Institute of Patient-Centered Outcome Research (IIPCOR), Innsbruck, Austria
| | - Benjamin Henninger
- Department of Orthopaedic Surgery, Medical University Innsbruck, Innsbruck, Austria
| | - Michael T. Hirschmann
- Department of Orthopaedic Surgery and Traumatology, Kantonsspital Baselland (Bruderholz, Liestal, Laufen), Bruderholz, Switzerland
| | - Michael C. Liebensteiner
- Department of Orthopaedics and Traumatology, Medical University of Innsbruck, Anichstrasse 35, 6020 Innsbruck, Austria
| |
Collapse
|
4
|
Alford EN, Rotman LE, Lepard JR, Agee BS, Markert JM. Interrater and Intrarater Reliability of the Colloid Cyst Risk Score. Neurosurgery 2020; 86:E47-E53. [PMID: 31552408 DOI: 10.1093/neuros/nyz399] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 07/05/2019] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND The Colloid Cyst Risk Score (CCRS) was developed to identify symptomatic patients and stratify risk of hydrocephalus among patients with colloid cysts. Its components consider patient age, cyst diameter, presence/absence of headache, fluid-attenuated inversion recovery (FLAIR) hyperintensity, and location within the third ventricle. OBJECTIVE To independently evaluate the inter- and intrarater reliability of the CCRS. METHODS Patients with a colloid cyst were identified from billing records and radiology archives. Three independent raters reviewed electronic medical records to determine age, presence/absence of headache, cyst diameter (mm), FLAIR hyperintensity, and risk zone location. Raters made 53 observations, including 5 repeat observations.Fleiss' generalized kappa (κ) was calculated for all of the nominal criteria, whereas Kendall's coefficient of concordance (W) and the intraclass correlation coefficient (ICC) were calculated for the overall score. RESULTS Total CCRS score demonstrated extremely strong agreement (W = 0.83) using Kendall's W coefficient and good agreement (ICC = 0.74) using the ICC (P < .001). For interrater reliability of individual criteria, age (κ = 1.00) and FLAIR hyperintensity (κ = 0.89) demonstrated near perfect agreement. Axial diameter (κ = 0.63) demonstrated substantial agreement, whereas agreement was moderate for risk zone (κ = 0.51) and fair for headache (κ = 0.26). Intrarater reliability for total CCRS score was extremely strong using Kendall's W, good to excellent using ICC, and fair to substantial using weighted kappa. CONCLUSION The CCRS has good inter- and intrarater reliability when tested in an independent sample of patients, though strength of agreement varies among individual criteria. The validity of the CCRS requires independent evaluation.
Collapse
Affiliation(s)
- Elizabeth N Alford
- Department of Neurosurgery, University of Alabama at Birmingham, Birmingham, Alabama
| | - Lauren E Rotman
- Department of Neurosurgery, University of Alabama at Birmingham, Birmingham, Alabama
| | - Jacob R Lepard
- Department of Neurosurgery, University of Alabama at Birmingham, Birmingham, Alabama
| | - Bonita S Agee
- Department of Neurosurgery, University of Alabama at Birmingham, Birmingham, Alabama
| | - James M Markert
- Department of Neurosurgery, University of Alabama at Birmingham, Birmingham, Alabama
| |
Collapse
|
5
|
Parker LA, Kunduk M, Fink DS, McWhorter A. Reliability of High-speed Videoendoscopic Ratings of Essential Voice Tremor and Adductor Spasmodic Dysphonia. J Voice 2019; 33:16-26. [PMID: 29246397 DOI: 10.1016/j.jvoice.2017.10.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Revised: 10/06/2017] [Accepted: 10/11/2017] [Indexed: 11/23/2022]
Abstract
OBJECTIVES The main objectives of this study were to (1) examine intrarater and inter-rater reliabilities in perceptual ratings of vocal fold vibratory patterns and supraglottic characteristics for essential vocal tremor and adductor spasmodic dysphonia (AdSD) using high-speed videoendoscopy (HSV), and (2) to investigate the specificity of the parameters in differentiating these two voice disorders. METHODS HSV recordings of 34 cases diagnosed with essential vocal tremor, AdSD, or AdSD with vocal tremor were evaluated blindly by two voice speech pathologists. The two raters examined all HSV video segments twice across nine supraglottic and vocal fold vibratory characteristics for inter-rater and intrarater reliabilities. A separate consensus rating was then developed, with the results analyzed to explore differentiation. RESULTS Raters demonstrated moderate intrarater reliability with mean Spearman's rho correlation coefficients of 0.68 (rater 1) and 0.73 (rater 2). Moderate inter-rater reliability for the two raters was seen across all parameters with a mean Cohen's kappa coefficient of 0.51. Raters showed higher intrarater and inter-rater reliabilities for supraglottic parameters. Only the presence of tremor differentiated between the two voice disorders in cases with a consistent diagnosis. CONCLUSIONS The high level of concomitance between vocal tremor and AdSD may affect subjective perceptual analysis of supraglottic and vocal fold vibratory patterns. Results indicate similar global involvement of supraglottic laryngeal structures for both vocal tremor and AdSD.
Collapse
|
6
|
Jackson SM, Cheng MS, Smith AR Jr, Kolber MJ. Intrarater reliability of hand held dynamometry in measuring lower extremity isometric strength using a portable stabilization device. Musculoskelet Sci Pract 2017; 27:137-41. [PMID: 27476066 DOI: 10.1016/j.math.2016.07.010] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/26/2016] [Revised: 06/25/2016] [Accepted: 07/15/2016] [Indexed: 11/20/2022]
Abstract
Hand held dynamometry (HHD) is a more objective way to quantify muscle force production (MP) compared to traditional manual muscle testing. HHD reliability can be negatively impacted by both the strength of the tester and the subject particularly in the lower extremities due to larger muscle groups. The primary aim of this investigation was to assess intrarater reliability of HHD with use of a portable stabilization device for lower extremity MP in an athletic population. Isometric lower extremity strength was measured for bilateral lower extremities including hip abductors, external rotators, adductors, knee extensors, and ankle plantar flexors was measured in a sample of healthy recreational runners (8 male, 7 females, = 30 limbs) training for a marathon. These measurements were assessed using an intrasession intrarater reliability design. Intraclass correlation coefficients (ICC) were calculated using 3,1 model based on the single rater design. The standard error of measurement (SEM) for each muscle group was also calculated. ICC were excellent ranging from ICC (3,1) = 0.93-0.98 with standard error of measurements ranging from 0.58 to 17.2 N. This study establishes the use of a HHD with a portable stabilization device as demonstrating good reliability within testers for measuring lower extremity muscle performance in an active healthy population.
Collapse
|
7
|
Hellweg S, Schuster-Amft C. German version, inter- and intrarater reliability and internal consistency of the "Agitated Behavior Scale" (ABS-G) in patients with moderate to severe traumatic brain injury. Health Qual Life Outcomes 2016; 14:106. [PMID: 27431448 PMCID: PMC4950165 DOI: 10.1186/s12955-016-0511-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Accepted: 07/13/2016] [Indexed: 11/17/2022] Open
Abstract
Background Agitation is frequently observed during early recovery after traumatic brain injury (TBI). Agitated behaviour often interferes with a goal-orientated rehabilitation and can be a substantial hindrance to therapy. Despite the relatively high occurance of agitation in TBI population there is no objective assessement in German (G) available. An existing scale with excellent psychometric properties is the “Agitated Behavior Scale (ABS)” developed by Corrigan in 1989. The aim of the study was to translate the Agitated Behavior Scale (ABS) into German (ABS-G) and investigate the inter- and intrarater reliability and internal consistency in patients with moderate to severe TBI. Methods A formal nine-step translation and cross-cultural adaptation procedure (TCCA) was applied. Subsequently a prospective observational patient study was conducted. To examine the interrater reliability and internal consistency, two therapists rated 20 patients independently after a therapy session. This procedure was repeated twice on a weekly basis. The intrarater reliability was assessed through video recordings from three patients. Nine raters scored the demonstrated behaviour on the videotape with the ABS-G independently twice within one month. The inter- and intrarater reliability were evaluated with the Spearman rank correlation coefficient and the quadratic weighted kappa. The internal consistency was tested with Cronbach’s alpha. Results Behaviour of 20 patients (18 males; mean age 41 ± 20.7; mean Functional Independence Measure (FIM) cognitive score on admission 7.1 ± 4.04; mean ABS-G score at first observation 17.3 ± 2.83) was assessed threefold. Interrater reliability yielded a correlation coefficient for ABS-G total score of all 60 paired observations of rs 0.845 and a weighted Kappa of 0.738. Intrarater reliability for ABS-G total score ranged between rs 0.719 and 0.953 and showed a weighted Kappa between 0.871 and 0.953. Cronbach’s alpha indicated moderate internal consistency with 0.661. Conclusion This study demonstrates that the ABS-G is a reliable instrument for evaluating agitation in patients with moderate to severe TBI. Hereby it would be possible to monitor agitation objectively and optimise the management of agitated patients according to international recommendations.
Collapse
Affiliation(s)
- Stephanie Hellweg
- Department of Neurological Rehabilitation, Rehaklinik Bellikon, Bellikon, 5454, Switzerland. .,Institute of Physiotherapy, School of Health Professions, Zurich University of Applied Sciences, Winterthur, 8400, Switzerland.
| | - Corina Schuster-Amft
- Research Department, Reha Rheinfelden, Salinenstrasse 98, Rheinfelden, 4310, Switzerland.,Institute for Rehabilitation and Performance Technology, Bern University of Applied Sciences, Pestalozzistrasse 20, Burgdorf, 3400, Switzerland
| |
Collapse
|
8
|
Hirano M, Katoh M, Kawaguchi S, Uemura T. Intrarater reliabilities of shoulder joint horizontal adductor muscle strength measurements using a handheld dynamometer for geriatric and stroke patients. J Phys Ther Sci 2016; 28:51-5. [PMID: 26957727 PMCID: PMC4755973 DOI: 10.1589/jpts.28.51] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2015] [Accepted: 10/05/2015] [Indexed: 11/24/2022] Open
Abstract
[Purpose] This study aimed to verify the appropriate number of measurements and the
intrarater reliabilities of shoulder joint horizontal adductor muscle strength
measurements using a handheld dynamometer (HHD) for geriatric and stroke patients.
[Subjects and Methods] The subjects were 40 inpatients, who were divided into two groups:
20 stroke patients in the stroke group (SG), and 20 geriatric patients in the no-stroke
group (N-SG). Measurements were performed three times using an HHD with a belt. The
reliability was verified using Bland-Altman analysis and the intraclass correlation
coefficient (ICC). [Results] ICC (1, 1) was >0.9. A systematic bias was not observed
between the first and second measurement values except for the right side in N-SG. A
systematic bias between the maximum value obtained during the first and second
measurements and third measurement value was observed on the left side in N-SG, and on the
non-paralyzed side in SG: the third measurement values were small in both cases.
[Conclusion] Intrarater reliabilities were high for shoulder horizontal adductor strength
measurements using an HHD with a belt for geriatric and stroke patients. Taking the
systematic bias into consideration, these findings suggest that the required number of
measurements is two.
Collapse
Affiliation(s)
- Masahiro Hirano
- Department of Physical Therapy, Faculty of Health Sciences, Ryotokuji University, Japan
| | - Munenori Katoh
- Department of Physical Therapy, Faculty of Health Sciences, Ryotokuji University, Japan
| | - Saori Kawaguchi
- Department of Rehabilitation, Higashi Funabashi Hospital, Japan
| | - Tomomi Uemura
- Department of Rehabilitation, Higashi Funabashi Hospital, Japan
| |
Collapse
|
9
|
Thelle A, Gjerdevik M, Grydeland T, Skorge TD, Wentzel-Larsen T, Bakke PS. Pneumothorax size measurements on digital chest radiographs: Intra- and inter- rater reliability. Eur J Radiol 2015. [PMID: 26205973 DOI: 10.1016/j.ejrad.2015.07.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
PURPOSE Detailed and reliable methods may be important for discussions on the importance of pneumothorax size in clinical decision-making. Rhea's method is widely used to estimate pneumothorax size in percent based on chest X-rays (CXRs) from three measure points. Choi's addendum is used for anterioposterior projections. The aim of this study was to examine the intrarater and interrater reliability of the Rhea and Choi method using digital CXR in the ward based PACS monitors. MATERIALS AND METHODS Three physicians examined a retrospective series of 80 digital CXRs showing pneumothorax, using Rhea and Choi's method, then repeated in a random order two weeks later. We used the analysis of variance technique by Eliasziw et al. to assess the intrarater and interrater reliability in altogether 480 estimations of pneumothorax size. RESULTS Estimated pneumothorax sizes ranged between 5% and 100%. The intrarater reliability coefficient was 0.98 (95% one-sided lower-limit confidence interval C 0.96), and the interrater reliability coefficient was 0.95 (95% one-sided lower-limit confidence interval 0.93). CONCLUSION This study has shown that the Rhea and Choi method for calculating pneumothorax size has high intrarater and interrater reliability. These results are valid across gender, side of pneumothorax and whether the patient is diagnosed with primary or secondary pneumothorax.
Collapse
Affiliation(s)
- Andreas Thelle
- Department of Thoracic Medicine, Haukeland University Hospital, Jonas Lies vei 65, NO-5021 Bergen, Norway.
| | - Miriam Gjerdevik
- Norwegian Registry for Long-Term Mechanical Ventilation, Department of Thoracic Medicine, Haukeland University Hospital, Jonas Lies vei 65, NO-5021 Bergen, Norway; Norwegian Registry for Chronic Obstructive Pulmonary Disease, Department of Thoracic Medicine, Haukeland University Hospital, Jonas Lies vei 65, NO-5021 Bergen, Norway.
| | - Thomas Grydeland
- Department of Thoracic Medicine, Haukeland University Hospital, Jonas Lies vei 65, NO-5021 Bergen, Norway.
| | - Trude D Skorge
- Department of Occupational Medicine, Haukeland University Hospital, Jonas Lies vei 65, NO-5021 Bergen, Norway.
| | - Tore Wentzel-Larsen
- Norwegian Centre for Violence and Traumatic Stress Studies, Oslo, Norway; Centre for Child and Adolescent Mental Health, Eastern and Southern Norway, P.O.Box 4623 Nydalen, NO-0405 Oslo, Norway; Centre for Clinical Research, Haukeland University Hospital,Jonas Lies vei 65, NO-5021 Bergen, Norway.
| | - Per S Bakke
- Department of Clinical Science, University of Bergen, Jonas Lies vei 65, NO-5021, Norway.
| |
Collapse
|
10
|
Jalan NS, Daftari SS, Retharekar SS, Rairikar SA, Shyam AM, Sancheti PK. Intra- and inter-rater reliability of maximum inspiratory pressure measured using a portable capsule-sensing pressure gauge device in healthy adults. Can J Respir Ther 2015; 51:39-42. [PMID: 26089737 PMCID: PMC4467477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
BACKGROUND Measurement of maximum inspiratory pressure is the most prevalent method used in clinical practice to assess the strength of the inspiratory muscles. Although there are many devices available for the assessment of inspiratory muscle strength, there is a dearth of literature describing the reliability of devices that can be used in clinical patient assessment. The capsule-sensing pressure gauge (CSPG-V) is a new tool that measures the strength of inspiratory muscles; it is easy to use, noninvasive, inexpensive and lightweight. OBJECTIVE To test the intra- and inter-rater reliability of a CSPG-V device in healthy adults. METHODS A cross-sectional study involving 80 adult subjects with a mean (± SD) age of 22±3 years was performed. Using simple randomization, 40 individuals (20 male, 20 female) were used for intrarater and 40 (20 male, 20 female) were used for inter-rater reliability testing of the CSPG-V device. The subjects performed three inspiratory efforts, which were sustained for at least 3 s; the best of the three readings was used for intra- and inter-rater comparison. The intra- and inter-rater reliability were calculated using intraclass correlation coefficients (ICCs). RESULTS The intrarater reliability ICC was 0.962 and the inter-rater reliability ICC was 0.922. CONCLUSION Results of the present study suggest that maximum inspiratory pressure measured using a CSPG-V device has excellent intraand inter-rater reliability, and can be used as a diagnostic and prognostic tool in patients with respiratory muscle impairment.
Collapse
Affiliation(s)
- Nikita S Jalan
- Sancheti Institute College of Physiotherapy, Maharashtra University of Health Sciences, Pune, Maharashtra, India
| | - Sonam S Daftari
- Sancheti Institute College of Physiotherapy, Maharashtra University of Health Sciences, Pune, Maharashtra, India
| | - Seemi S Retharekar
- Sancheti Institute College of Physiotherapy, Maharashtra University of Health Sciences, Pune, Maharashtra, India
| | - Savita A Rairikar
- Sancheti Institute College of Physiotherapy, Maharashtra University of Health Sciences, Pune, Maharashtra, India
| | - Ashok M Shyam
- Sancheti Institute for Orthopaedics & Rehabilitation, Maharashtra University of Health Sciences, Pune, Maharashtra, India
| | - Parag K Sancheti
- Sancheti Institute for Orthopaedics & Rehabilitation, Maharashtra University of Health Sciences, Pune, Maharashtra, India
| |
Collapse
|
11
|
Westergren A, Torfadóttir O, Hagell P. Inter- and intrarater reliability of Minimal Eating Observation and Nutrition Form - version II (MEONF-II) nurse assessments among hospital inpatients. BMC Nurs 2014; 13:18. [PMID: 25093011 PMCID: PMC4120003 DOI: 10.1186/1472-6955-13-18] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2014] [Accepted: 06/19/2014] [Indexed: 01/04/2023] Open
Abstract
Background The Minimal Eating Observation and Nutrition form – version II (MEONF – II) is a recently developed nursing nutritional screening tool. However, its inter- and intrarater reliability has not been assessed. Methods Inpatients (n = 24; median age, 69 years; 11 women) were assessed by eight nurses (interrater reliability, two nurses scored each patient independently) using the MEONF-II on two consecutive days (intrarater reliability, each patient was scored by the same nurse day 1 and day 2). Results Six patients were at moderate/high undernutrition risk. Inter- and intrarater reliabilities (Gwet’s agreement coefficient) for the MEONF-II 2-category classification (no/low risk versus moderate/high risk) were 0.93 and 0.81; for the 3-category classification (no/low – moderate – high risk) reliabilities (Gwet’s weighted agreement coefficient) were 0.98 and 0.88; and total score inter- and intrarater reliabilities (intraclass correlation) were 0.92 and 0.84. Conclusion Reliability of MEONF-II nurse assessments among adult hospital inpatients was supported and the tool can be used in research and clinical practice.
Collapse
Affiliation(s)
- Albert Westergren
- The PRO-CARE Group, School of Health and Society, Kristianstad University, Kristianstad SE-291 88, Sweden
| | | | - Peter Hagell
- The PRO-CARE Group, School of Health and Society, Kristianstad University, Kristianstad SE-291 88, Sweden
| |
Collapse
|
12
|
Grant AC, Abdel-Baki SG, Weedon J, Arnedo V, Chari G, Koziorynska E, Lushbough C, Maus D, McSween T, Mortati KA, Reznikov A, Omurtag A. EEG interpretation reliability and interpreter confidence: a large single-center study. Epilepsy Behav 2014; 32:102-7. [PMID: 24531133 PMCID: PMC3965251 DOI: 10.1016/j.yebeh.2014.01.011] [Citation(s) in RCA: 76] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 01/16/2014] [Accepted: 01/20/2014] [Indexed: 10/25/2022]
Abstract
The intrarater and interrater reliability (I&IR) of EEG interpretation has significant implications for the value of EEG as a diagnostic tool. We measured both the intrarater reliability and the interrater reliability of EEG interpretation based on the interpretation of complete EEGs into standard diagnostic categories and rater confidence in their interpretations and investigated sources of variance in EEG interpretations. During two distinct time intervals, six board-certified clinical neurophysiologists classified 300 EEGs into one or more of seven diagnostic categories and assigned a subjective confidence to their interpretations. Each EEG was read by three readers. Each reader interpreted 150 unique studies, and 50 studies were re-interpreted to generate intrarater data. A generalizability study assessed the contribution of subjects, readers, and the interaction between subjects and readers to interpretation variance. Five of the six readers had a median confidence of ≥99%, and the upper quartile of confidence values was 100% for all six readers. Intrarater Cohen's kappa (κc) ranged from 0.33 to 0.73 with an aggregated value of 0.59. Cohen's kappa ranged from 0.29 to 0.62 for the 15 reader pairs, with an aggregated Fleiss kappa of 0.44 for interrater agreement. Cohen's kappa was not significantly different across rater pairs (chi-square=17.3, df=14, p=0.24). Variance due to subjects (i.e., EEGs) was 65.3%, due to readers was 3.9%, and due to the interaction between readers and subjects was 30.8%. Experienced epileptologists have very high confidence in their EEG interpretations and low to moderate I&IR, a common paradox in clinical medicine. A necessary, but insufficient, condition to improve EEG interpretation accuracy is to increase intrarater and interrater reliability. This goal could be accomplished, for instance, with an automated online application integrated into a continuing medical education module that measures and reports EEG I&IR to individual users.
Collapse
Affiliation(s)
- Arthur C. Grant
- Department of Neurology, SUNY Downstate Medical Center, Brooklyn, NY, USA,Department of Physiology & Pharmacology, SUNY Downstate Medical Center, Brooklyn, NY, USA,To whom correspondence should be addressed at: SUNY Downstate Medical Center, Comprehensive Epilepsy Center, 450 Clarkson Ave., Box 1275, Brooklyn, NY 11203, 718.270.2959 (tel), 718.270.4711 (fax),
| | | | - Jeremy Weedon
- The Scientific Computing Center, SUNY Downstate Medical Center, Brooklyn, NY, USA
| | - Vanessa Arnedo
- Department of Neurology, SUNY Downstate Medical Center, Brooklyn, NY, USA
| | - Geetha Chari
- Department of Neurology, SUNY Downstate Medical Center, Brooklyn, NY, USA
| | - Ewa Koziorynska
- Department of Neurology, SUNY Downstate Medical Center, Brooklyn, NY, USA
| | | | - Douglas Maus
- Department of Neurology, SUNY Downstate Medical Center, Brooklyn, NY, USA,Department of Physiology & Pharmacology, SUNY Downstate Medical Center, Brooklyn, NY, USA
| | - Tresa McSween
- Department of Neurology, SUNY Downstate Medical Center, Brooklyn, NY, USA
| | | | - Alexandra Reznikov
- Department of Neurology, SUNY Downstate Medical Center, Brooklyn, NY, USA
| | - Ahmet Omurtag
- BioSignal Group, Corp. Brooklyn, NY, USA,Department of Biomedical Engineering, University of Houston, Houston, TX, USA
| |
Collapse
|