Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hofner B, Boccuto L, Göker M. Controlling false discoveries in high-dimensional situations: boosting with stability selection. BMC Bioinformatics 2015;16:144. [PMID: 25943565 PMCID: PMC4464883 DOI: 10.1186/s12859-015-0575-3] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2014] [Accepted: 04/16/2015] [Indexed: 12/17/2022] Open

For:	Hofner B, Boccuto L, Göker M. Controlling false discoveries in high-dimensional situations: boosting with stability selection. BMC Bioinformatics 2015;16:144. [PMID: 25943565 PMCID: PMC4464883 DOI: 10.1186/s12859-015-0575-3] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2014] [Accepted: 04/16/2015] [Indexed: 12/17/2022] Open

Number

Cited by Other Article(s)

Dürauer A, Jungbauer A, Scharl T. Sensors and chemometrics in downstream processing. Biotechnol Bioeng 2024;121:2347-2364. [PMID: 37470278 DOI: 10.1002/bit.28499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 06/14/2023] [Accepted: 07/07/2023] [Indexed: 07/21/2023]

Potts S, Bergherr E, Reinke C, Griesbach C. Prediction-based variable selection for component-wise gradient boosting. Int J Biostat 2024;20:293-314. [PMID: 38000054 DOI: 10.1515/ijb-2023-0052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 09/18/2023] [Indexed: 11/26/2023]

Abiose O, Rutledge J, Moran‐Losada P, Belloy ME, Wilson EN, He Z, Trelle AN, Channappa D, Romero A, Park J, Yutsis MV, Sha SJ, Andreasson KI, Poston KL, Henderson VW, Wagner AD, Wyss‐Coray T, Mormino EC. Post-translational modifications linked to preclinical Alzheimer's disease-related pathological and cognitive changes. Alzheimers Dement 2024;20:1851-1867. [PMID: 38146099 PMCID: PMC10984434 DOI: 10.1002/alz.13576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 11/08/2023] [Accepted: 11/13/2023] [Indexed: 12/27/2023]

Affiliation(s)

Olamide Abiose Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Wu Tsai Neurosciences InstituteStanford University School of MedicineStanfordCaliforniaUSA
Jarod Rutledge The Phil and Penny Knight Initiative for Brain ResilienceStanford UniversityStanfordCaliforniaUSA Department of GeneticsStanford UniversityStanfordCaliforniaUSA
Patricia Moran‐Losada Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Wu Tsai Neurosciences InstituteStanford University School of MedicineStanfordCaliforniaUSA The Phil and Penny Knight Initiative for Brain ResilienceStanford UniversityStanfordCaliforniaUSA
Michael E. Belloy Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA
Edward N. Wilson Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Wu Tsai Neurosciences InstituteStanford University School of MedicineStanfordCaliforniaUSA
Zihuai He Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Center for Biomedical Informatics ResearchStanford University School of MedicineStanfordCaliforniaUSA
Alexandra N. Trelle Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA
Divya Channappa Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA
America Romero Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA
Jennifer Park Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA
Maya V. Yutsis Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA
Sharon J. Sha Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA
Katrin I. Andreasson Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Wu Tsai Neurosciences InstituteStanford University School of MedicineStanfordCaliforniaUSA Chan Zuckerberg BiohubSan FranciscoCaliforniaUSA
Kathleen L. Poston Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Wu Tsai Neurosciences InstituteStanford University School of MedicineStanfordCaliforniaUSA The Phil and Penny Knight Initiative for Brain ResilienceStanford UniversityStanfordCaliforniaUSA
Victor W. Henderson Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Department of Epidemiology & Population HealthStanford University School of MedicineStanfordCaliforniaUSA
Anthony D. Wagner Wu Tsai Neurosciences InstituteStanford University School of MedicineStanfordCaliforniaUSA Department of PsychologyStanford UniversityStanfordCaliforniaUSA
Tony Wyss‐Coray Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Wu Tsai Neurosciences InstituteStanford University School of MedicineStanfordCaliforniaUSA The Phil and Penny Knight Initiative for Brain ResilienceStanford UniversityStanfordCaliforniaUSA
Elizabeth C. Mormino Department of Neurology and Neurological SciencesStanford University School of MedicinePalo AltoCaliforniaUSA Wu Tsai Neurosciences InstituteStanford University School of MedicineStanfordCaliforniaUSA

Collapse

Pedrero-Martin Y, Falla D, Rodriguez-Brazzarola P, Torrontegui-Duarte M, Fernandez-Sanchez M, Jerez-Aragones JM, Liew BXW, Luque-Suarez A. Prognostic Factors of Perceived Disability and Perceived Recovery After Whiplash: A Longitudinal, Prospective Study With One-year Follow-up. Clin J Pain 2024;40:165-173. [PMID: 38031848 DOI: 10.1097/ajp.0000000000001182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 11/20/2023] [Indexed: 12/01/2023]

Battauz M, Vidoni P. A boosting method to select the random effects in linear mixed models. Biometrics 2024;80:ujae010. [PMID: 38465986 DOI: 10.1093/biomtc/ujae010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 12/07/2023] [Accepted: 01/29/2024] [Indexed: 03/12/2024]

Cardner M, Marass F, Gedvilaite E, Yang JL, Tsui DWY, Beerenwinkel N. Predicting tumour content of liquid biopsies from cell-free DNA. BMC Bioinformatics 2023;24:368. [PMID: 37777714 PMCID: PMC10543881 DOI: 10.1186/s12859-023-05478-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 09/12/2023] [Indexed: 10/02/2023] Open

Liew BXW, Kovacs FM, Rügamer D, Royuela A. Automatic Variable Selection Algorithms in Prognostic Factor Research in Neck Pain. J Clin Med 2023;12:6232. [PMID: 37834877 PMCID: PMC10573798 DOI: 10.3390/jcm12196232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 09/21/2023] [Accepted: 09/26/2023] [Indexed: 10/15/2023] Open

Abstract

This study aims to compare the variable selection strategies of different machine learning (ML) and statistical algorithms in the prognosis of neck pain (NP) recovery. A total of 3001 participants with NP were included. Three dichotomous outcomes of an improvement in NP, arm pain (AP), and disability at 3 months follow-up were used. Twenty-five variables (twenty-eight parameters) were included as predictors. There were more parameters than variables, as some categorical variables had >2 levels. Eight modelling techniques were compared: stepwise regression based on unadjusted p values (stepP), on adjusted p values (stepPAdj), on Akaike information criterion (stepAIC), best subset regression (BestSubset) least absolute shrinkage and selection operator [LASSO], Minimax concave penalty (MCP), model-based boosting (mboost), and multivariate adaptive regression splines (MuARS). The algorithm that selected the fewest predictors was stepPAdj (number of predictors, p = 4 to 8). MuARS was the algorithm with the second fewest predictors selected (p = 9 to 14). The predictor selected by all algorithms with the largest coefficient magnitude was "having undergone a neuroreflexotherapy intervention" for NP (β = from 1.987 to 2.296) and AP (β = from 2.639 to 3.554), and "Imaging findings: spinal stenosis" (β = from -1.331 to -1.763) for disability. Stepwise regression based on adjusted p-values resulted in the sparsest models, which enhanced clinical interpretability. MuARS appears to provide the optimal balance between model sparsity whilst retaining high predictive performance across outcomes. Different algorithms produced similar performances but resulted in a different number of variables selected. Rather than relying on any single algorithm, confidence in the variable selection may be increased by using multiple algorithms.

Collapse

Zanetti D, Stell L, Gustafsson S, Abbasi F, Tsao PS, Knowles JW, Zethelius B, Ärnlöv J, Balkau B, Walker M, Lazzeroni LC, Lind L, Petrie JR, Assimes TL. Plasma proteomic signatures of a direct measure of insulin sensitivity in two population cohorts. Diabetologia 2023;66:1643-1654. [PMID: 37329449 PMCID: PMC10390625 DOI: 10.1007/s00125-023-05946-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 04/12/2023] [Indexed: 06/19/2023]

Abstract

AIMS/HYPOTHESIS

The euglycaemic-hyperinsulinaemic clamp (EIC) is the reference standard for the measurement of whole-body insulin sensitivity but is laborious and expensive to perform. We aimed to assess the incremental value of high-throughput plasma proteomic profiling in developing signatures correlating with the M value derived from the EIC.

METHODS

We measured 828 proteins in the fasting plasma of 966 participants from the Relationship between Insulin Sensitivity and Cardiovascular disease (RISC) study and 745 participants from the Uppsala Longitudinal Study of Adult Men (ULSAM) using a high-throughput proximity extension assay. We used the least absolute shrinkage and selection operator (LASSO) approach using clinical variables and protein measures as features. Models were tested within and across cohorts. Our primary model performance metric was the proportion of the M value variance explained (R2).

RESULTS

A standard LASSO model incorporating 53 proteins in addition to routinely available clinical variables increased the M value R2 from 0.237 (95% CI 0.178, 0.303) to 0.456 (0.372, 0.536) in RISC. A similar pattern was observed in ULSAM, in which the M value R2 increased from 0.443 (0.360, 0.530) to 0.632 (0.569, 0.698) with the addition of 61 proteins. Models trained in one cohort and tested in the other also demonstrated significant improvements in R2 despite differences in baseline cohort characteristics and clamp methodology (RISC to ULSAM: 0.491 [0.433, 0.539] for 51 proteins; ULSAM to RISC: 0.369 [0.331, 0.416] for 67 proteins). A randomised LASSO and stability selection algorithm selected only two proteins per cohort (three unique proteins), which improved R2 but to a lesser degree than in standard LASSO models: 0.352 (0.266, 0.439) in RISC and 0.495 (0.404, 0.585) in ULSAM. Reductions in improvements of R2 with randomised LASSO and stability selection were less marked in cross-cohort analyses (RISC to ULSAM R2 0.444 [0.391, 0.497]; ULSAM to RISC R2 0.348 [0.300, 0.396]). Models of proteins alone were as effective as models that included both clinical variables and proteins using either standard or randomised LASSO. The single most consistently selected protein across all analyses and models was IGF-binding protein 2.

CONCLUSIONS/INTERPRETATION

A plasma proteomic signature identified using a standard LASSO approach improves the cross-sectional estimation of the M value over routine clinical variables. However, a small subset of these proteins identified using a stability selection algorithm affords much of this improvement, especially when considering cross-cohort analyses. Our approach provides opportunities to improve the identification of insulin-resistant individuals at risk of insulin resistance-related adverse health consequences.

Collapse

Affiliation(s)

Daniela Zanetti Department of Medicine, Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, CA, USA VA Palo Alto Health Care System, Palo Alto, CA, USA
Laurel Stell VA Palo Alto Health Care System, Palo Alto, CA, USA Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA
Stefan Gustafsson Department of Medical Sciences, Uppsala University, Uppsala, Sweden
Fahim Abbasi Department of Medicine, Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, CA, USA Stanford Diabetes Research Center, Stanford University School of Medicine, Stanford, CA, USA
Philip S Tsao Department of Medicine, Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, CA, USA VA Palo Alto Health Care System, Palo Alto, CA, USA Stanford Cardiovascular Institute, Stanford University School of Medicine, Stanford, CA, USA
Joshua W Knowles Department of Medicine, Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, CA, USA Stanford Diabetes Research Center, Stanford University School of Medicine, Stanford, CA, USA Stanford Cardiovascular Institute, Stanford University School of Medicine, Stanford, CA, USA Stanford Prevention Research Center, Stanford University School of Medicine, Stanford, CA, USA
Björn Zethelius Department of Public Health/Geriatrics, Uppsala University, Uppsala, Sweden
Johan Ärnlöv Division of Family Medicine and Primary Care, Department of Neurobiology, Care Sciences and Society, Karolinska Institute, Stockholm, Sweden Department of Health and Social Studies, Dalarna University, Falun, Sweden
Beverley Balkau Clinical Epidemiology, Centre for Research in Epidemiology and Population Health, Inserm U1018, Villejuif, France
Mark Walker Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, UK
Laura C Lazzeroni Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, USA Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, USA
Lars Lind Department of Medical Sciences, Uppsala University, Uppsala, Sweden.
John R Petrie School of Health and Wellbeing, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK.
Themistocles L Assimes Department of Medicine, Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, CA, USA. VA Palo Alto Health Care System, Palo Alto, CA, USA. Stanford Diabetes Research Center, Stanford University School of Medicine, Stanford, CA, USA. Stanford Cardiovascular Institute, Stanford University School of Medicine, Stanford, CA, USA. Department of Epidemiology and Population Health, Stanford University School of Medicine, Stanford, CA, USA.

Collapse

Cardner M, Tuckwell D, Kostikova A, Forrer P, Siegel RM, Marti A, Vandemeulebroecke M, Ferrero E. Analysis of serum proteomics data identifies a quantitative association between beta-defensin 2 at baseline and clinical response to IL-17 blockade in psoriatic arthritis. RMD Open 2023;9:e003042. [PMID: 37321668 DOI: 10.1136/rmdopen-2023-003042] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 05/22/2023] [Indexed: 06/17/2023] Open

Simon T, Mayr GJ, Morgenstern D, Umlauf N, Zeileis A. Amplification of annual and diurnal cycles of alpine lightning. CLIMATE DYNAMICS 2023;61:4125-4137. [PMID: 37854482 PMCID: PMC10579137 DOI: 10.1007/s00382-023-06786-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 04/10/2023] [Indexed: 10/20/2023]

Lokmer A, Alladi CG, Troudet R, Bacq-Daian D, Boland-Auge A, Latapie V, Deleuze JF, RajKumar RP, Shewade DG, Bélivier F, Marie-Claire C, Jamain S. Risperidone response in patients with schizophrenia drives DNA methylation changes in immune and neuronal systems. Epigenomics 2023;15:21-38. [PMID: 36919681 DOI: 10.2217/epi-2023-0017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023] Open

Affiliation(s)

Ana Lokmer Univ Paris Est Créteil, INSERM, IMRB, Translational Neuropsychiatry, Créteil, F-94000, France.,Fondation FondaMental, Créteil, F-94000, France
Charanraj Goud Alladi Université de Paris, INSERM UMRS 1144, Optimisation Thérapeutique en Neuropsychopharmacologie (OTeN), Paris, F-75006, France
Réjane Troudet Univ Paris Est Créteil, INSERM, IMRB, Translational Neuropsychiatry, Créteil, F-94000, France.,Fondation FondaMental, Créteil, F-94000, France
Delphine Bacq-Daian Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, F-91057, France
Anne Boland-Auge Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, F-91057, France
Violaine Latapie Univ Paris Est Créteil, INSERM, IMRB, Translational Neuropsychiatry, Créteil, F-94000, France.,Fondation FondaMental, Créteil, F-94000, France
Jean-François Deleuze Université Paris-Saclay, CEA, Centre National de Recherche en Génomique Humaine (CNRGH), Evry, F-91057, France
Ravi Philip RajKumar Department of Pharmacology, Jawaharlal Institute of Postgraduate Medical Education & Research, Puducherry, 605006, India
Deepak Gopal Shewade Department of Psychiatry, Jawaharlal Institute of Postgraduate Medical Education & Research, Puducherry, 605006, India.,Centre National de Recherche en Génomique Humaine (CNRGH), Institut de Biologie François Jacob, CEA, Université Paris-Saclay, Evry, F-91000, France
Frank Bélivier Fondation FondaMental, Créteil, F-94000, France.,Université de Paris, INSERM UMRS 1144, Optimisation Thérapeutique en Neuropsychopharmacologie (OTeN), Paris, F-75006, France.,Hôpitaux Lariboisière-Fernand Widal, GHU APHP Nord, Département de Psychiatrie et de Médecine Addicto-logique, Paris, F-75010, France
Cynthia Marie-Claire Université de Paris, INSERM UMRS 1144, Optimisation Thérapeutique en Neuropsychopharmacologie (OTeN), Paris, F-75006, France
Stéphane Jamain Univ Paris Est Créteil, INSERM, IMRB, Translational Neuropsychiatry, Créteil, F-94000, France.,Fondation FondaMental, Créteil, F-94000, France

Collapse

Capanu M, Giurcanu M, Begg CB, Gönen M. Subsampling based variable selection for generalized linear models. Comput Stat Data Anal 2023;184. [PMID: 37090139 PMCID: PMC10118238 DOI: 10.1016/j.csda.2023.107740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2023]

Ha CSR, Müller-Nurasyid M, Petrera A, Hauck SM, Marini F, Bartsch DK, Slater EP, Strauch K. Proteomics biomarker discovery for individualized prevention of familial pancreatic cancer using statistical learning. PLoS One 2023;18:e0280399. [PMID: 36701413 PMCID: PMC9879447 DOI: 10.1371/journal.pone.0280399] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 12/28/2022] [Indexed: 01/27/2023] Open

Abstract

BACKGROUND

The low five-year survival rate of pancreatic ductal adenocarcinoma (PDAC) and the low diagnostic rate of early-stage PDAC via imaging highlight the need to discover novel biomarkers and improve the current screening procedures for early diagnosis. Familial pancreatic cancer (FPC) describes the cases of PDAC that are present in two or more individuals within a circle of first-degree relatives. Using innovative high-throughput proteomics, we were able to quantify the protein profiles of individuals at risk from FPC families in different potential pre-cancer stages. However, the high-dimensional proteomics data structure challenges the use of traditional statistical analysis tools. Hence, we applied advanced statistical learning methods to enhance the analysis and improve the results' interpretability.

METHODS

We applied model-based gradient boosting and adaptive lasso to deal with the small, unbalanced study design via simultaneous variable selection and model fitting. In addition, we used stability selection to identify a stable subset of selected biomarkers and, as a result, obtain even more interpretable results. In each step, we compared the performance of the different analytical pipelines and validated our approaches via simulation scenarios.

RESULTS

In the simulation study, model-based gradient boosting showed a more accurate prediction performance in the small, unbalanced, and high-dimensional datasets than adaptive lasso and could identify more relevant variables. Furthermore, using model-based gradient boosting, we discovered a subset of promising serum biomarkers that may potentially improve the current screening procedure of FPC.

CONCLUSION

Advanced statistical learning methods helped us overcome the shortcomings of an unbalanced study design in a valuable clinical dataset. The discovered serum biomarkers provide us with a clear direction for further investigations and more precise clinical hypotheses regarding the development of FPC and optimal strategies for its early detection.

Collapse

Affiliation(s)

Chung Shing Rex Ha Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center, Johannes Gutenberg University, Mainz, Germany Institute of Genetic Epidemiology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany Faculty of Medicine, Institute for Medical Information Processing, Chair of Genetic Epidemiology, Biometry, and Epidemiology (IBE), LMU Munich, Munich, Germany * E-mail:
Martina Müller-Nurasyid Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center, Johannes Gutenberg University, Mainz, Germany Institute of Genetic Epidemiology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany Faculty of Medicine, Institute for Medical Information Processing, Biometry, and Epidemiology (IBE), LMU Munich, Munich, Germanys Faculty of Medicine, Institute for Medical Information Processing, Pettenkofer School of Public Health Munich, Biometry, and Epidemiology (IBE), LMU Munich, Munich, Germany
Agnese Petrera Research Unit Protein Science and Metabolomics and Proteomics Core Facility, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
Stefanie M. Hauck Research Unit Protein Science and Metabolomics and Proteomics Core Facility, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
Federico Marini Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center, Johannes Gutenberg University, Mainz, Germany Research Center for Immunotherapy (FZI), University Medical Center, Johannes Gutenberg University, Mainz, Germany
Detlef K. Bartsch Department of Visceral-, Thoracic- and Vascular Surgery, Philipps University, Marburg, Germany
Emily P. Slater Department of Visceral-, Thoracic- and Vascular Surgery, Philipps University, Marburg, Germany
Konstantin Strauch Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center, Johannes Gutenberg University, Mainz, Germany Institute of Genetic Epidemiology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany Faculty of Medicine, Institute for Medical Information Processing, Chair of Genetic Epidemiology, Biometry, and Epidemiology (IBE), LMU Munich, Munich, Germany

Collapse

Failmezger H, Hessel H, Kapil A, Schmidt G, Harder N. Spatial heterogeneity of cancer associated protein expression in immunohistochemically stained images as an improved prognostic biomarker. Front Oncol 2022;12:964716. [PMID: 36601480 PMCID: PMC9806230 DOI: 10.3389/fonc.2022.964716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 11/23/2022] [Indexed: 12/23/2022] Open

Comparison between LASSO and RT methods for prediction of generic E. coli concentration in pasture poultry farms. Food Res Int 2022;161:111860. [DOI: 10.1016/j.foodres.2022.111860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 07/28/2022] [Accepted: 08/21/2022] [Indexed: 11/21/2022]

Huemer MT, Petrera A, Hauck SM, Drey M, Peters A, Thorand B. Proteomics of the phase angle: Results from the population-based KORA S4 study. Clin Nutr 2022;41:1818-1826. [PMID: 35834914 DOI: 10.1016/j.clnu.2022.06.038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 06/01/2022] [Accepted: 06/23/2022] [Indexed: 11/03/2022]

Abstract

BACKGROUND & AIMS

The phase angle (PhA) measured with bioelectrical impedance analysis is considered to reflect the interrelated components body cell mass and fluid distribution based on technical and physical aspects of the PhA measurement. However, the biomedical meaning of the PhA remains vague. Previous studies mainly assessed associations of the PhA with numerous diseases and health outcomes, but few connected protein markers to the PhA. To broaden our understanding of the biomedical background of the PhA, we aimed to explore a proteomics profile associated with the PhA and related biological factors.

METHODS

The study sample encompassed 1484 participants (725 women and 759 men) aged 55-74 years from the population-based Cooperative Health Research in the Region of Augsburg (KORA) S4 study. Proteomics measurements were performed with a proximity extension assay. We employed boosting with stability selection to establish a set of markers that was strongly associated with the PhA from a group of 233 plasma protein markers. We integrated the selected protein markers into a network and enrichment analysis to identify gene ontology (GO) terms significantly overrepresented for the selected PhA protein markers.

RESULTS

Boosting with stability selection identified seven protein markers that were strongly and independently associated with the PhA: N-terminal prohormone brain natriuretic peptide (NT-proBNP), insulin-like growth factor-binding protein 2 (IGFBP2), adrenomedullin (ADM), myoglobin (MB), matrix metalloproteinase-9 (MMP9), protein-glutamine gamma-glutamyltransferase 2 (TGM2), and fractalkine (CX3CL1) [beta coefficient per 1 standard deviation increase in normalized protein expression values on a log 2 scale (95% confidence interval): -0.12 (-0.15, -0.08), -0.13 (-0.17, -0.09), -0.14 (-0.18, -0.10), 0.10 (0.07, 0.14), 0.07 (0.04, 0.10), 0.08 (0.05, 0.11), -0.06 (-0.10, -0.03), respectively]. According to the enrichment analysis, this protein profile was significantly overrepresented in the following top five GO terms: positive regulation of cell population proliferation (p-value: 1.32E-04), extracellular space (p-value: 1.34E-04), anatomical structure formation involved in morphogenesis (p-value: 2.92E-04), regulation of multicellular organismal development (p-value: 5.72E-04), and metal ion homeostasis (p-value: 8.86E-04).

CONCLUSION

Implementing a proteomics approach, we identified six new protein markers strongly associated with the PhA and confirmed that NT-proBNP is a key PhA marker. The main biological processes that were related to this PhA's protein profile are involved in regulating the amount and growth of cells, reinforcing, from a biomedical perspective, the current technical-based consensus of the PhA to reflect body cell mass.

Collapse

Priya S, Burns MB, Ward T, Mars RAT, Adamowicz B, Lock EF, Kashyap PC, Knights D, Blekhman R. Identification of shared and disease-specific host gene-microbiome associations across human diseases using multi-omic integration. Nat Microbiol 2022;7:780-795. [PMID: 35577971 PMCID: PMC9159953 DOI: 10.1038/s41564-022-01121-z] [Citation(s) in RCA: 54] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 04/06/2022] [Indexed: 12/19/2022]

Teitsdottir UD, Darreh-Shori T, Lund SH, Jonsdottir MK, Snaedal J, Petersen PH. Phenotypic Displays of Cholinergic Enzymes Associate With Markers of Inflammation, Neurofibrillary Tangles, and Neurodegeneration in Pre- and Early Symptomatic Dementia Subjects. Front Aging Neurosci 2022;14:876019. [PMID: 35693340 PMCID: PMC9178195 DOI: 10.3389/fnagi.2022.876019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 05/02/2022] [Indexed: 11/13/2022] Open

Abstract

Background

Cholinergic drugs are the most commonly used drugs for the treatment of Alzheimer’s disease (AD). Therefore, a better understanding of the cholinergic system and its relation to both AD-related biomarkers and cognitive functions is of high importance.

Objectives

To evaluate the relationships of cerebrospinal fluid (CSF) cholinergic enzymes with markers of amyloidosis, neurodegeneration, neurofibrillary tangles, inflammation and performance on verbal episodic memory in a memory clinic cohort.

Methods

In this cross-sectional study, 46 cholinergic drug-free subjects (median age = 71, 54% female, median MMSE = 28) were recruited from an Icelandic memory clinic cohort targeting early stages of cognitive impairment. Enzyme activity of acetylcholinesterase (AChE) and butyrylcholinesterase (BuChE) was measured in CSF as well as levels of amyloid-β_1–42 (Aβ₄₂), phosphorylated tau (P-tau), total-tau (T-tau), neurofilament light (NFL), YKL-40, S100 calcium-binding protein B (S100B), and glial fibrillary acidic protein (GFAP). Verbal episodic memory was assessed with the Rey Auditory Verbal Learning (RAVLT) and Story tests.

Results

No significant relationships were found between CSF Aβ₄₂ levels and AChE or BuChE activity (p > 0.05). In contrast, T-tau (r = 0.46, p = 0.001) and P-tau (r = 0.45, p = 0.002) levels correlated significantly with AChE activity. Although neurodegeneration markers T-tau and NFL did correlate with each other (r = 0.59, p < 0.001), NFL did not correlate with AChE (r = 0.25, p = 0.09) or BuChE (r = 0.27, p = 0.06). Inflammation markers S100B and YKL-40 both correlated significantly with AChE (S100B: r = 0.43, p = 0.003; YKL-40: r = 0.32, p = 0.03) and BuChE (S100B: r = 0.47, p < 0.001; YKL-40: r = 0.38, p = 0.009) activity. A weak correlation was detected between AChE activity and the composite score reflecting verbal episodic memory (r = −0.34, p = 0.02). LASSO regression analyses with a stability approach were performed for the selection of a set of measures best predicting cholinergic activity and verbal episodic memory score. S100B was the predictor with the highest model selection frequency for both AChE (68%) and BuChE (73%) activity. Age (91%) was the most reliable predictor for verbal episodic memory, with selection frequency of both cholinergic enzymes below 10%.

Conclusions

Results indicate a relationship between higher activity of the ACh-degrading cholinergic enzymes with increased neurodegeneration, neurofibrillary tangles and inflammation in the stages of pre- and early symptomatic dementia, independent of CSF Aβ₄₂ levels.

Collapse

Huber KJ, Vieira S, Sikorski J, Wüst PK, Fösel BU, Gröngröft A, Overmann J. Differential Response of Acidobacteria to Water Content, Soil Type, and Land Use During an Extended Drought in African Savannah Soils. Front Microbiol 2022;13:750456. [PMID: 35222321 PMCID: PMC8874233 DOI: 10.3389/fmicb.2022.750456] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 01/20/2022] [Indexed: 11/13/2022] Open

Abstract

Although climate change is expected to increase the extent of drylands worldwide, the effect of drought on the soil microbiome is still insufficiently understood as for dominant but little characterized phyla like the Acidobacteria. In the present study the active acidobacterial communities of Namibian soils differing in type, physicochemical parameters, and land use were characterized by high-throughput sequencing. Water content, pH, major ions and nutrients were distinct for sandy soils, woodlands or dry agriculture on loamy sands. Soils were repeatedly sampled over a 2-year time period and covered consecutively a strong rainy, a dry, a normal rainy and a weak rainy season. The increasing drought had differential effects on different soils. Linear modeling of the soil water content across all sampling locations and sampling dates revealed that the accumulated precipitation of the preceding season had only a weak, but statistically significant effect, whereas woodland and irrigation exerted a strong positive effect on water content. The decrease in soil water content was accompanied by a pronounced decrease in the fraction of active Acidobacteria (7.9-0.7%) while overall bacterial community size/cell counts remained constant. Notably, the strongest decline in the relative fraction of Acidobacteria was observed after the first cycle of rainy and dry season, rather than after the weakest rainy season at the end of the observation period. Over the 2-year period, also the β-diversity of soil Acidobacteria changed. During the first year this change in composition was related to soil type (loamy sand) and land use (woodland) as explanatory variables. A total of 188 different acidobacterial sequence variants affiliated with the "Acidobacteriia," Blastocatellia, and Vicinamibacteria changed significantly in abundance, suggesting either drought sensitivity or formation of dormant cell forms. Comparative physiological testing of 15 Namibian isolates revealed species-specific and differential responses in viability during long-term continuous desiccation or drying-rewetting cycles. These different responses were not determined by phylogenetic affiliation and provide a first explanation for the effect of drought on soil Acidobacteria. In conclusion, the response of acidobacterial communities to water availability is non-linear, most likely caused by the different physiological adaptations of the different taxa present.

Collapse

Zhang B, Hepp T, Greven S, Bergherr E. Adaptive step-length selection in gradient boosting for Gaussian location and scale models. Comput Stat 2022. [DOI: 10.1007/s00180-022-01199-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Marchais A, Marques Da Costa ME, Job B, Abbas R, Drubay D, Piperno-Neumann S, Fromigué O, Gomez-Brouchet A, Françoise R, Droit R, Lervat C, ENTZ-WERLE N, Pacquement H, Devoldere C, Cupissol D, Bodet D, GANDEMER V, Berger MG, Bérard PM, Jimenez M, Vassal G, Geoerger B, Brugieres L, Gaspar N. Immune infiltrate and tumor microenvironment transcriptional programs stratify pediatric osteosarcoma into prognostic groups at diagnosis. Cancer Res 2022;82:974-985. [DOI: 10.1158/0008-5472.can-20-4189] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 07/26/2021] [Accepted: 01/18/2022] [Indexed: 11/16/2022]

Strömer A, Staerk C, Klein N, Weinhold L, Titze S, Mayr A. Deselection of base-learners for statistical boosting-with an application to distributional regression. Stat Methods Med Res 2021;31:207-224. [PMID: 34882438 DOI: 10.1177/09622802211051088] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Werner T. A review on instance ranking problems in statistical learning. Mach Learn 2021. [DOI: 10.1007/s10994-021-06122-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Predicting Physician Consultations for Low Back Pain Using Claims Data and Population-Based Cohort Data-An Interpretable Machine Learning Approach. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:ijerph182212013. [PMID: 34831773 PMCID: PMC8622753 DOI: 10.3390/ijerph182212013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 10/24/2021] [Accepted: 11/12/2021] [Indexed: 11/17/2022]

Koutroulis G, Botler L, Mutlu B, Diwold K, Römer K, Kern R. KOMPOS: Connecting Causal Knots in Large Nonlinear Time Series with Non-Parametric Regression Splines. ACM T INTEL SYST TEC 2021. [DOI: 10.1145/3480971] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Staerk C, Mayr A. Randomized boosting with multivariable base-learners for high-dimensional variable selection and prediction. BMC Bioinformatics 2021;22:441. [PMID: 34530737 PMCID: PMC8447543 DOI: 10.1186/s12859-021-04340-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 08/24/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Statistical boosting is a computational approach to select and estimate interpretable prediction models for high-dimensional biomedical data, leading to implicit regularization and variable selection when combined with early stopping. Traditionally, the set of base-learners is fixed for all iterations and consists of simple regression learners including only one predictor variable at a time. Furthermore, the number of iterations is typically tuned by optimizing the predictive performance, leading to models which often include unnecessarily large numbers of noise variables.

RESULTS

We propose three consecutive extensions of classical component-wise gradient boosting. In the first extension, called Subspace Boosting (SubBoost), base-learners can consist of several variables, allowing for multivariable updates in a single iteration. To compensate for the larger flexibility, the ultimate selection of base-learners is based on information criteria leading to an automatic stopping of the algorithm. As the second extension, Random Subspace Boosting (RSubBoost) additionally includes a random preselection of base-learners in each iteration, enabling the scalability to high-dimensional data. In a third extension, called Adaptive Subspace Boosting (AdaSubBoost), an adaptive random preselection of base-learners is considered, focusing on base-learners which have proven to be predictive in previous iterations. Simulation results show that the multivariable updates in the three subspace algorithms are particularly beneficial in cases of high correlations among signal covariates. In several biomedical applications the proposed algorithms tend to yield sparser models than classical statistical boosting, while showing a very competitive predictive performance also compared to penalized regression approaches like the (relaxed) lasso and the elastic net.

CONCLUSIONS

The proposed randomized boosting approaches with multivariable base-learners are promising extensions of statistical boosting, particularly suited for highly-correlated and sparse high-dimensional settings. The incorporated selection of base-learners via information criteria induces automatic stopping of the algorithms, promoting sparser and more interpretable prediction models.

Collapse

Freijeiro‐González L, Febrero‐Bande M, González‐Manteiga W. A Critical Review of LASSO and Its Derivatives for Variable Selection Under Dependence Among Covariates. Int Stat Rev 2021. [DOI: 10.1111/insr.12469] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Kidney Allograft Function Is a Confounder of Urine Metabolite Profiles in Kidney Allograft Recipients. Metabolites 2021;11:metabo11080533. [PMID: 34436474 PMCID: PMC8399888 DOI: 10.3390/metabo11080533] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 07/24/2021] [Accepted: 08/03/2021] [Indexed: 11/17/2022] Open

Huemer MT, Bauer A, Petrera A, Scholz M, Hauck SM, Drey M, Peters A, Thorand B. Proteomic profiling of low muscle and high fat mass: a machine learning approach in the KORA S4/FF4 study. J Cachexia Sarcopenia Muscle 2021;12:1011-1023. [PMID: 34151535 PMCID: PMC8350207 DOI: 10.1002/jcsm.12733] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Revised: 05/12/2021] [Accepted: 05/21/2021] [Indexed: 12/14/2022] Open

Abstract

BACKGROUND

The coexistence of low muscle mass and high fat mass, two interrelated conditions strongly associated with declining health status, has been characterized by only a few protein biomarkers. High-throughput proteomics enable concurrent measurement of numerous proteins, facilitating the discovery of potentially new biomarkers.

METHODS

Data derived from the prospective population-based Cooperative Health Research in the Region of Augsburg S4/FF4 cohort study (median follow-up time: 13.5 years) included 1478 participants (756 men and 722 women) aged 55-74 years in the cross-sectional and 608 participants (315 men and 293 women) in the longitudinal analysis. Appendicular skeletal muscle mass (ASMM) and body fat mass index (BFMI) were determined through bioelectrical impedance analysis at baseline and follow-up. At baseline, 233 plasma proteins were measured using proximity extension assay. We implemented boosting with stability selection to enable false positives-controlled variable selection to identify new protein biomarkers of low muscle mass, high fat mass, and their combination. We evaluated prediction models developed based on group least absolute shrinkage and selection operator (lasso) with 100× bootstrapping by cross-validated area under the curve (AUC) to investigate if proteins increase the prediction accuracy on top of classical risk factors.

RESULTS

In the cross-sectional analysis, we identified kallikrein-6, C-C motif chemokine 28 (CCL28), and tissue factor pathway inhibitor as previously unknown biomarkers for muscle mass [association with low ASMM: odds ratio (OR) per 1-SD increase in log2 normalized protein expression values (95% confidence interval (CI)): 1.63 (1.37-1.95), 1.31 (1.14-1.51), 1.24 (1.06-1.45), respectively] and serine protease 27 for fat mass [association with high BFMI: OR (95% CI): 0.73 (0.61-0.86)]. CCL28 and metalloproteinase inhibitor 4 (TIMP4) constituted new biomarkers for the combination of low muscle and high fat mass [association with low ASMM combined with high BFMI: OR (95% CI): 1.32 (1.08-1.61), 1.28 (1.03-1.59), respectively]. Including protein biomarkers selected in ≥90% of group lasso bootstrap iterations on top of classical risk factors improved the performance of models predicting low ASMM, high BFMI, and their combination [delta AUC (95% CI): 0.16 (0.13-0.20), 0.22 (0.18-0.25), 0.12 (0.08-0.17), respectively]. In the longitudinal analysis, N-terminal prohormone brain natriuretic peptide (NT-proBNP) was the only protein selected for loss in ASMM and loss in ASMM combined with gain in BFMI over 14 years [OR (95% CI): 1.40 (1.10-1.77), 1.60 (1.15-2.24), respectively].

CONCLUSIONS

Proteomic profiling revealed CCL28 and TIMP4 as new biomarkers of low muscle mass combined with high fat mass and NT-proBNP as a key biomarker of loss in muscle mass combined with gain in fat mass. Proteomics enable us to accelerate biomarker discoveries in muscle research.

Collapse

Soerensen M, Debrabant B, Halekoh U, Møller JE, Hassager C, Frydland M, Hjelmborg J, Beck HC, Rasmussen LM. Does diabetes modify the effect of heparin on plasma proteins? - A proteomic search for plasma protein biomarkers for diabetes-related endothelial dysfunction. J Diabetes Complications 2021;35:107906. [PMID: 33785251 DOI: 10.1016/j.jdiacomp.2021.107906] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Revised: 02/11/2021] [Accepted: 03/07/2021] [Indexed: 11/23/2022]

Affiliation(s)

Mette Soerensen Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, J.B. Winsløws Vej 9B, 5000 Odense C, Denmark; Center for Individualized Medicine in Arterial Diseases, Department of Clinical Biochemistry and Pharmacology, Odense University Hospital, J.B. Winsløws Vej 4, 5000 Odense C, Denmark; Department of Clinical Genetics, Odense University Hospital, J.B. Winsløws Vej 4, 5000 Odense C, Denmark.
Birgit Debrabant Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, J.B. Winsløws Vej 9B, 5000 Odense C, Denmark.
Ulrich Halekoh Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, J.B. Winsløws Vej 9B, 5000 Odense C, Denmark.
Jacob Eifer Møller Department of Clinical Cardiology, Odense University Hospital, J.B. Winsløws Vej 4, 5000 Odense C, Denmark; Department of Cardiology, Rigshospitalet, Blegdamsvej 9, 2100 Copenhagen Ø, Denmark; Department of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark.
Christian Hassager Department of Cardiology, Rigshospitalet, Blegdamsvej 9, 2100 Copenhagen Ø, Denmark; Department of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark.
Martin Frydland Department of Cardiology, Rigshospitalet, Blegdamsvej 9, 2100 Copenhagen Ø, Denmark; Department of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen N, Denmark.
Jacob Hjelmborg Epidemiology, Biostatistics and Biodemography, Department of Public Health, University of Southern Denmark, J.B. Winsløws Vej 9B, 5000 Odense C, Denmark.
Hans Christian Beck Center for Individualized Medicine in Arterial Diseases, Department of Clinical Biochemistry and Pharmacology, Odense University Hospital, J.B. Winsløws Vej 4, 5000 Odense C, Denmark.
Lars Melholt Rasmussen Center for Individualized Medicine in Arterial Diseases, Department of Clinical Biochemistry and Pharmacology, Odense University Hospital, J.B. Winsløws Vej 4, 5000 Odense C, Denmark.

Collapse

Tozzo V, Azencott CA, Fiorini S, Fava E, Trucco A, Barla A. Where Do We Stand in Regularization for Life Science Studies? J Comput Biol 2021;29:213-232. [PMID: 33926217 PMCID: PMC8968832 DOI: 10.1089/cmb.2019.0371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open

Seifer DB, Petok WD, Agrawal A, Glenn TL, Bayer AH, Witt BR, Burgin BD, Lieman HJ. Psychological experience and coping strategies of patients in the Northeast US delaying care for infertility during the COVID-19 pandemic. Reprod Biol Endocrinol 2021;19:28. [PMID: 33618732 PMCID: PMC7899935 DOI: 10.1186/s12958-021-00721-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 02/17/2021] [Indexed: 12/21/2022] Open

Abstract

BACKGROUND

On March 17, 2020 an expert ASRM task force recommended the temporary suspension of new, non-urgent fertility treatments during an ongoing world-wide pandemic of Covid-19. We surveyed at the time of resumption of fertility care the psychological experience and coping strategies of patients pausing their care due to Covid-19 and examined which factors were associated and predictive of resilience, anxiety, stress and hopefulness.

METHODS

Cross sectional cohort patient survey using an anonymous, self-reported, single time, web-based, HIPPA compliant platform (REDCap). Survey sampled two Northeast academic fertility practices (Yale Medicine Fertility Center in CT and Montefiore's Institute for Reproductive Medicine and Health in NY). Data from multiple choice and open response questions collected demographic, reproductive history, experience and attitudes about Covid-19, prior infertility treatment, sense of hopefulness and stress, coping strategies for mitigating stress and two validated psychological surveys to assess anxiety (six-item short-form State Trait Anxiety Inventory (STAl-6)) and resilience (10-item Connor-Davidson Resilience Scale, (CD-RISC-10).

RESULTS

Seven hundred thirty-four patients were sent invitations to participate. Two hundred fourteen of 734 (29.2%) completed the survey. Patients reported their fertility journey had been delayed a mean of 10 weeks while 60% had been actively trying to conceive > 1.5 years. The top 5 ranked coping skills from a choice of 19 were establishing a daily routine, going outside regularly, exercising, maintaining social connection via phone, social media or Zoom and continuing to work. Having a history of anxiety (p < 0.0001) and having received oral medication as prior infertility treatment (p < 0.0001) were associated with lower resilience. Increased hopefulness about having a child at the time of completing the survey (p < 0.0001) and higher resilience scores (p < 0.0001) were associated with decreased anxiety. Higher reported stress scores (p < 0.0001) were associated with increased anxiety. Multiple multivariate regression showed being non-Hispanic black (p = 0.035) to be predictive of more resilience while variables predictive of less resilience were being a full-time homemaker (p = 0.03), having received oral medication as prior infertility treatment (p = 0.003) and having higher scores on the STAI-6 (< 0.0001).

CONCLUSIONS

Prior to and in anticipation of further pauses in treatment the clinical staff should consider pretreatment screening for psychological distress and provide referral sources. In addition, utilization of a patient centered approach to care should be employed.

Collapse

Halama A, Oliveira JM, Filho SA, Qasim M, Achkar IW, Johnson S, Suhre K, Vinardell T. Metabolic Predictors of Equine Performance in Endurance Racing. Metabolites 2021;11:metabo11020082. [PMID: 33572513 PMCID: PMC7912089 DOI: 10.3390/metabo11020082] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 01/23/2021] [Accepted: 01/26/2021] [Indexed: 11/16/2022] Open

Lima E, Hyde R, Green M. Model selection for inferential models with high dimensional data: synthesis and graphical representation of multiple techniques. Sci Rep 2021;11:412. [PMID: 33431921 PMCID: PMC7801732 DOI: 10.1038/s41598-020-79317-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 12/07/2020] [Indexed: 12/18/2022] Open

Scelsi MA, Napolioni V, Greicius MD, Altmann A. Network propagation of rare variants in Alzheimer's disease reveals tissue-specific hub genes and communities. PLoS Comput Biol 2021;17:e1008517. [PMID: 33411734 PMCID: PMC7817020 DOI: 10.1371/journal.pcbi.1008517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2020] [Revised: 01/20/2021] [Accepted: 11/10/2020] [Indexed: 11/18/2022] Open

Abstract

State-of-the-art rare variant association testing methods aggregate the contribution of rare variants in biologically relevant genomic regions to boost statistical power. However, testing single genes separately does not consider the complex interaction landscape of genes, nor the downstream effects of non-synonymous variants on protein structure and function. Here we present the NETwork Propagation-based Assessment of Genetic Events (NETPAGE), an integrative approach aimed at investigating the biological pathways through which rare variation results in complex disease phenotypes. We applied NETPAGE to sporadic, late-onset Alzheimer's disease (AD), using whole-genome sequencing from the AD Neuroimaging Initiative (ADNI) cohort, as well as whole-exome sequencing from the AD Sequencing Project (ADSP). NETPAGE is based on network propagation, a framework that models information flow on a graph and simulates the percolation of genetic variation through tissue-specific gene interaction networks. The result of network propagation is a set of smoothed gene scores that can be tested for association with disease status through sparse regression. The application of NETPAGE to AD enabled the identification of a set of connected genes whose smoothed variation profile was robustly associated to case-control status, based on gene interactions in the hippocampus. Additionally, smoothed scores significantly correlated with risk of conversion to AD in Mild Cognitive Impairment (MCI) subjects. Lastly, we investigated tissue-specific transcriptional dysregulation of the core genes in two independent RNA-seq datasets, as well as significant enrichments in terms of gene sets with known connections to AD. We present a framework that enables enhanced genetic association testing for a wide range of traits, diseases, and sample sizes.

Collapse

Borchert C, Herman A, Roth M, Brooks AC, Friedenberg SG. RNA sequencing of whole blood in dogs with primary immune-mediated hemolytic anemia (IMHA) reveals novel insights into disease pathogenesis. PLoS One 2020;15:e0240975. [PMID: 33091028 PMCID: PMC7580939 DOI: 10.1371/journal.pone.0240975] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 10/06/2020] [Indexed: 11/29/2022] Open

Adde A, Darveau M, Barker N, Cumming S. Predicting spatiotemporal abundance of breeding waterfowl across Canada: A Bayesian hierarchical modelling approach. DIVERS DISTRIB 2020. [DOI: 10.1111/ddi.13129] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open

Cohen AS, Cox CR, Le TP, Cowan T, Masucci MD, Strauss GP, Kirkpatrick B. Using machine learning of computerized vocal expression to measure blunted vocal affect and alogia. NPJ SCHIZOPHRENIA 2020;6:26. [PMID: 32978400 PMCID: PMC7519104 DOI: 10.1038/s41537-020-00115-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 08/06/2020] [Indexed: 11/16/2022]

Teitsdottir UD, Jonsdottir MK, Lund SH, Darreh-Shori T, Snaedal J, Petersen PH. Association of glial and neuronal degeneration markers with Alzheimer's disease cerebrospinal fluid profile and cognitive functions. ALZHEIMERS RESEARCH & THERAPY 2020;12:92. [PMID: 32753068 PMCID: PMC7404927 DOI: 10.1186/s13195-020-00657-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Accepted: 07/21/2020] [Indexed: 01/15/2023]

Abstract

BACKGROUND

Neuroinflammation has gained increasing attention as a potential contributing factor in the onset and progression of Alzheimer's disease (AD). The objective of this study was to examine the association of selected cerebrospinal fluid (CSF) inflammatory and neuronal degeneration markers with signature CSF AD profile and cognitive functions among subjects at the symptomatic pre- and early dementia stages.

METHODS

In this cross-sectional study, 52 subjects were selected from an Icelandic memory clinic cohort. Subjects were classified as having AD (n = 28, age = 70, 39% female, Mini-Mental State Examination [MMSE] = 27) or non-AD (n = 24, age = 67, 33% female, MMSE = 28) profile based on the ratio between CSF total-tau (T-tau) and amyloid-β_1-42 (Aβ₄₂) values (cut-off point chosen as 0.52). Novel CSF biomarkers included neurofilament light (NFL), YKL-40, S100 calcium-binding protein B (S100B) and glial fibrillary acidic protein (GFAP), measured with enzyme-linked immunosorbent assays (ELISAs). Subjects underwent neuropsychological assessment for evaluation of different cognitive domains, including verbal episodic memory, non-verbal episodic memory, language, processing speed, and executive functions.

RESULTS

Accuracy coefficient for distinguishing between the two CSF profiles was calculated for each CSF marker and test. Novel CSF markers performed poorly (area under curve [AUC] coefficients ranging from 0.61 to 0.64) compared to tests reflecting verbal episodic memory, which all performed fair (AUC > 70). LASSO regression with a stability approach was applied for the selection of CSF markers and demographic variables predicting performance on each cognitive domain, both among all subjects and only those with a CSF AD profile. Relationships between CSF markers and cognitive domains, where the CSF marker reached stability selection criteria of > 75%, were visualized with scatter plots. Before calculations of corresponding Pearson's correlations coefficients, composite scores for cognitive domains were adjusted for age and education. GFAP correlated with executive functions (r = - 0.37, p = 0.01) overall, while GFAP correlated with processing speed (r = - 0.68, p < 0.001) and NFL with verbal episodic memory (r = - 0.43, p = 0.02) among subjects with a CSF AD profile.

CONCLUSIONS

The novel CSF markers NFL and GFAP show potential as markers for cognitive decline among individuals with core AD pathology at the symptomatic pre- and early stages of dementia.

Collapse

Rotival M, Siddle KJ, Silvert M, Pothlichet J, Quach H, Quintana-Murci L. Population variation in miRNAs and isomiRs and their impact on human immunity to infection. Genome Biol 2020;21:187. [PMID: 32731901 PMCID: PMC7391576 DOI: 10.1186/s13059-020-02098-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Accepted: 07/08/2020] [Indexed: 12/19/2022] Open

Ploner T, Heß S, Grum M, Drewe-Boss P, Walker J. Using gradient boosting with stability selection on health insurance claims data to identify disease trajectories in chronic obstructive pulmonary disease. Stat Methods Med Res 2020;29:3684-3694. [PMID: 32646307 DOI: 10.1177/0962280220938088] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Guinot F, Szafranski M, Chiquet J, Zancarini A, Le Signor C, Mougel C, Ambroise C. Fast computation of genome-metagenome interaction effects. Algorithms Mol Biol 2020;15:13. [PMID: 32625242 PMCID: PMC7329492 DOI: 10.1186/s13015-020-00173-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 06/17/2020] [Indexed: 01/01/2023] Open

Abstract

Motivation

Association studies have been widely used to search for associations between common genetic variants observations and a given phenotype. However, it is now generally accepted that genes and environment must be examined jointly when estimating phenotypic variance. In this work we consider two types of biological markers: genotypic markers, which characterize an observation in terms of inherited genetic information, and metagenomic marker which are related to the environment. Both types of markers are available in their millions and can be used to characterize any observation uniquely.

Objective

Our focus is on detecting interactions between groups of genetic and metagenomic markers in order to gain a better understanding of the complex relationship between environment and genome in the expression of a given phenotype.

Contributions

We propose a novel approach for efficiently detecting interactions between complementary datasets in a high-dimensional setting with a reduced computational cost. The method, named SICOMORE, reduces the dimension of the search space by selecting a subset of supervariables in the two complementary datasets. These supervariables are given by a weighted group structure defined on sets of variables at different scales. A Lasso selection is then applied on each type of supervariable to obtain a subset of potential interactions that will be explored via linear model testing.

Results

We compare SICOMORE with other approaches in simulations, with varying sample sizes, noise, and numbers of true interactions. SICOMORE exhibits convincing results in terms of recall, as well as competitive performances with respect to running time. The method is also used to detect interaction between genomic markers in Medicago truncatula and metagenomic markers in its rhizosphere bacterial community.

Software availability

An R package is available [4], along with its documentation and associated scripts, allowing the reader to reproduce the results presented in the paper.

Collapse

Richter-Heitmann T, Hofner B, Krah FS, Sikorski J, Wüst PK, Bunk B, Huang S, Regan KM, Berner D, Boeddinghaus RS, Marhan S, Prati D, Kandeler E, Overmann J, Friedrich MW. Stochastic Dispersal Rather Than Deterministic Selection Explains the Spatio-Temporal Distribution of Soil Bacteria in a Temperate Grassland. Front Microbiol 2020;11:1391. [PMID: 32695081 PMCID: PMC7338559 DOI: 10.3389/fmicb.2020.01391] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 05/29/2020] [Indexed: 01/15/2023] Open

Abstract

Spatial and temporal processes shaping microbial communities are inseparably linked but rarely studied together. By Illumina 16S rRNA sequencing, we monitored soil bacteria in 360 stations on a 100 square meter plot distributed across six intra-annual samplings in a rarely managed, temperate grassland. Using a multi-tiered approach, we tested the extent to which stochastic or deterministic processes influenced the composition of local communities. A combination of phylogenetic turnover analysis and null modeling demonstrated that either homogenization by unlimited stochastic dispersal or scenarios, in which neither stochastic processes nor deterministic forces dominated, explained local assembly processes. Thus, the majority of all sampled communities (82%) was rather homogeneous with no significant changes in abundance-weighted composition. However, we detected strong and uniform taxonomic shifts within just nine samples in early summer. Thus, community snapshots sampled from single points in time or space do not necessarily reflect a representative community state. The potential for change despite the overall homogeneity was further demonstrated when the focus shifted to the rare biosphere. Rare OTU turnover, rather than nestedness, characterized abundance-independent β-diversity. Accordingly, boosted generalized additive models encompassing spatial, temporal and environmental variables revealed strong and highly diverse effects of space on OTU abundance, even within the same genus. This pure spatial effect increased with decreasing OTU abundance and frequency, whereas soil moisture – the most important environmental variable – had an opposite effect by impacting abundant OTUs more than the rare ones. These results indicate that – despite considerable oscillation in space and time – the abundant and resident OTUs provide a community backbone that supports much higher β-diversity of a dynamic rare biosphere. Our findings reveal complex interactions among space, time, and environmental filters within bacterial communities in a long-established temperate grassland.

Collapse

Affiliation(s)

Tim Richter-Heitmann Microbial Ecophysiology Group, Faculty of Biology/Chemistry, University of Bremen, Bremen, Germany.,International Max Planck Research School of Marine Microbiology, Max Planck Institute for Marine Microbiology, Bremen, Germany
Benjamin Hofner Institut für Medizininformatik, Biometrie und Epidemiologie, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
Franz-Sebastian Krah Biodiversity Conservation, Institute for Ecology, Evolution and Diversity, Biologicum, Goethe University Frankfurt, Frankfurt am Main, Germany
Johannes Sikorski Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
Pia K Wüst Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
Boyke Bunk Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
Sixing Huang Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
Kathleen M Regan Institute of Soil Science and Land Evaluation, Soil Biology Department, University of Hohenheim, Stuttgart, Germany
Doreen Berner Institute of Soil Science and Land Evaluation, Soil Biology Department, University of Hohenheim, Stuttgart, Germany
Runa S Boeddinghaus Institute of Soil Science and Land Evaluation, Soil Biology Department, University of Hohenheim, Stuttgart, Germany
Sven Marhan Institute of Soil Science and Land Evaluation, Soil Biology Department, University of Hohenheim, Stuttgart, Germany
Daniel Prati Institute of Plant Sciences, University of Bern, Bern, Switzerland
Ellen Kandeler Institute of Soil Science and Land Evaluation, Soil Biology Department, University of Hohenheim, Stuttgart, Germany
Jörg Overmann Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
Michael W Friedrich Microbial Ecophysiology Group, Faculty of Biology/Chemistry, University of Bremen, Bremen, Germany

Collapse

Mirończuk MM, Protasiewicz J. Recognising innovative companies by using a diversified stacked generalisation method for website classification. APPL INTELL 2020. [DOI: 10.1007/s10489-019-01509-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Cascio L, Chen CF, Pauly R, Srikanth S, Jones K, Skinner CD, Stevenson RE, Schwartz CE, Boccuto L. Abnormalities in the genes that encode Large Amino Acid Transporters increase the risk of Autism Spectrum Disorder. Mol Genet Genomic Med 2019;8:e1036. [PMID: 31701662 PMCID: PMC6978257 DOI: 10.1002/mgg3.1036] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 10/08/2019] [Accepted: 10/16/2019] [Indexed: 12/12/2022] Open

Abstract

Background

Autism spectrum disorder (ASD) is a common neurodevelopmental disorder whose molecular mechanisms are largely unknown. Several studies have shown an association between ASD and abnormalities in the metabolism of amino acids, specifically tryptophan and branched‐chain amino acids (BCAAs).

Methods

Ninety‐seven patients with ASD were screened by Sanger sequencing the genes encoding the heavy (SLC3A2) and light subunits (SLC7A5 and SLC7A8) of the large amino acid transporters (LAT) 1 and 2. LAT1 and 2 are responsible for the transportation of tryptophan and BCAA across the blood–brain barrier and are expressed both in blood and brain. Functional studies were performed employing the Biolog Phenotype Microarray Mammalian (PM‐M) technology to investigate the metabolic profiling in lymphoblastoid cell lines from 43 patients with ASD and 50 controls with particular focus on the amino acid substrates of LATs.

Results

We detected nine likely pathogenic variants in 11 of 97 patients (11.3%): three in SLC3A2, three in SLC7A5, and three in SLC7A8. Six variants of unknown significance were detected in eight patients, two of which also carrying a likely pathogenic variant.

The functional studies showed a consistently reduced utilization of tryptophan, accompanied by evidence of reduced utilization of other large aromatic amino acids (LAAs), either alone or as part of a dipeptide.

Conclusion

Coding variants in the LAT genes were detected in 17 of 97 patients with ASD (17.5%). Metabolic assays indicate that such abnormalities affect the utilization of certain amino acids, particularly tryptophan and other LAAs, with potential consequences on their transport across the blood barrier and their availability during brain development. Therefore, abnormalities in the LAT1 and two transporters are likely associated with an increased risk of developing ASD.

Collapse

Zwanenburg A. Radiomics in nuclear medicine: robustness, reproducibility, standardization, and how to avoid data analysis traps and replication crisis. Eur J Nucl Med Mol Imaging 2019;46:2638-2655. [PMID: 31240330 DOI: 10.1007/s00259-019-04391-8] [Citation(s) in RCA: 169] [Impact Index Per Article: 33.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Accepted: 06/04/2019] [Indexed: 12/16/2022]

Zhang C, Wu Y, Zhu M. Pruning variable selection ensembles. Stat Anal Data Min 2019. [DOI: 10.1002/sam.11410] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Sauer DG, Melcher M, Mosor M, Walch N, Berkemeyer M, Scharl-Hirsch T, Leisch F, Jungbauer A, Dürauer A. Real-time monitoring and model-based prediction of purity and quantity during a chromatographic capture of fibroblast growth factor 2. Biotechnol Bioeng 2019;116:1999-2009. [PMID: 30934111 PMCID: PMC6618329 DOI: 10.1002/bit.26984] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 03/15/2019] [Accepted: 03/28/2019] [Indexed: 12/14/2022]

Hepp T, Schmid M, Mayr A. Significance Tests for Boosted Location and Scale Models with Linear Base-Learners. Int J Biostat 2019;15:/j/ijb.ahead-of-print/ijb-2018-0110/ijb-2018-0110.xml. [PMID: 30990787 DOI: 10.1515/ijb-2018-0110] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 03/21/2019] [Indexed: 11/15/2022]

Smith A, Hofner B, Lamb JS, Osenkowski J, Allison T, Sadoti G, McWilliams SR, Paton P. Modeling spatiotemporal abundance of mobile wildlife in highly variable environments using boosted GAMLSS hurdle models. Ecol Evol 2019;9:2346-2364. [PMID: 30891185 PMCID: PMC6405508 DOI: 10.1002/ece3.4738] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 10/11/2018] [Accepted: 10/30/2018] [Indexed: 11/07/2022] Open

Abstract

Modeling organism distributions from survey data involves numerous statistical challenges, including accounting for zero-inflation, overdispersion, and selection and incorporation of environmental covariates. In environments with high spatial and temporal variability, addressing these challenges often requires numerous assumptions regarding organism distributions and their relationships to biophysical features. These assumptions may limit the resolution or accuracy of predictions resulting from survey-based distribution models. We propose an iterative modeling approach that incorporates a negative binomial hurdle, followed by modeling of the relationship of organism distribution and abundance to environmental covariates using generalized additive models (GAM) and generalized additive models for location, scale, and shape (GAMLSS). Our approach accounts for key features of survey data by separating binary (presence-absence) from count (abundance) data, separately modeling the mean and dispersion of count data, and incorporating selection of appropriate covariates and response functions from a suite of potential covariates while avoiding overfitting. We apply our modeling approach to surveys of sea duck abundance and distribution in Nantucket Sound (Massachusetts, USA), which has been proposed as a location for offshore wind energy development. Our model results highlight the importance of spatiotemporal variation in this system, as well as identifying key habitat features including distance to shore, sediment grain size, and seafloor topographic variation. Our work provides a powerful, flexible, and highly repeatable modeling framework with minimal assumptions that can be broadly applied to the modeling of survey data with high spatiotemporal variability. Applying GAMLSS models to the count portion of survey data allows us to incorporate potential overdispersion, which can dramatically affect model results in highly dynamic systems. Our approach is particularly relevant to systems in which little a priori knowledge is available regarding relationships between organism distributions and biophysical features, since it incorporates simultaneous selection of covariates and their functional relationships with organism responses.

Collapse