Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cremona MA, Xu H, Makova KD, Reimherr M, Chiaromonte F, Madrigal P. Functional data analysis for computational biology. Bioinformatics 2019;35:3211-3213. [PMID: 30668667 DOI: 10.1093/bioinformatics/btz045] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2018] [Revised: 01/01/2019] [Accepted: 01/17/2019] [Indexed: 12/25/2022] Open

For:	Cremona MA, Xu H, Makova KD, Reimherr M, Chiaromonte F, Madrigal P. Functional data analysis for computational biology. Bioinformatics 2019;35:3211-3213. [PMID: 30668667 DOI: 10.1093/bioinformatics/btz045] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2018] [Revised: 01/01/2019] [Accepted: 01/17/2019] [Indexed: 12/25/2022] Open

Number

Cited by Other Article(s)

Ribeiro M, Azevedo L, Santos AP, Pinto Leite P, Pereira MJ. Understanding spatiotemporal patterns of COVID-19 incidence in Portugal: A functional data analysis from August 2020 to March 2022. PLoS One 2024;19:e0297772. [PMID: 38300912 PMCID: PMC10833534 DOI: 10.1371/journal.pone.0297772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 01/12/2024] [Indexed: 02/03/2024] Open

Abstract

During the SARS-CoV-2 pandemic, governments and public health authorities collected massive amounts of data on daily confirmed positive cases and incidence rates. These data sets provide relevant information to develop a scientific understanding of the pandemic's spatiotemporal dynamics. At the same time, there is a lack of comprehensive approaches to describe and classify patterns underlying the dynamics of COVID-19 incidence across regions over time. This seriously constrains the potential benefits for public health authorities to understand spatiotemporal patterns of disease incidence that would allow for better risk communication strategies and improved assessment of mitigation policies efficacy. Within this context, we propose an exploratory statistical tool that combines functional data analysis with unsupervised learning algorithms to extract meaningful information about the main spatiotemporal patterns underlying COVID-19 incidence on mainland Portugal. We focus on the timeframe spanning from August 2020 to March 2022, considering data at the municipality level. First, we describe the temporal evolution of confirmed daily COVID-19 cases by municipality as a function of time, and outline the main temporal patterns of variability using a functional principal component analysis. Then, municipalities are classified according to their spatiotemporal similarities through hierarchical clustering adapted to spatially correlated functional data. Our findings reveal disparities in disease dynamics between northern and coastal municipalities versus those in the southern and hinterland. We also distinguish effects occurring during the 2020-2021 period from those in the 2021-2022 autumn-winter seasons. The results provide proof-of-concept that the proposed approach can be used to detect the main spatiotemporal patterns of disease incidence. The novel approach expands and enhances existing exploratory tools for spatiotemporal analysis of public health data.

Collapse

Cheng JH, Zheng C, Yamada R, Okada D. Visualization of the landscape of the read alignment shape of ATAC-seq data using Hellinger distance metric. Genes Cells 2024;29:5-16. [PMID: 37989133 DOI: 10.1111/gtc.13082] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 10/25/2023] [Accepted: 10/28/2023] [Indexed: 11/23/2023]

Madrigal P, Deng S, Feng Y, Militi S, Goh KJ, Nibhani R, Grandy R, Osnato A, Ortmann D, Brown S, Pauklin S. Epigenetic and transcriptional regulations prime cell fate before division during human pluripotent stem cell differentiation. Nat Commun 2023;14:405. [PMID: 36697417 PMCID: PMC9876972 DOI: 10.1038/s41467-023-36116-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 01/17/2023] [Indexed: 01/26/2023] Open

Craig SJ, Kenney AM, Lin J, Paul IM, Birch LL, Savage JS, Marini ME, Chiaromonte F, Reimherr ML, Makova KD. Constructing a polygenic risk score for childhood obesity using functional data analysis. ECONOMETRICS AND STATISTICS 2023;25:66-86. [PMID: 36620476 PMCID: PMC9813976 DOI: 10.1016/j.ecosta.2021.10.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]

Abstract

Obesity is a highly heritable condition that affects increasing numbers of adults and, concerningly, of children. However, only a small fraction of its heritability has been attributed to specific genetic variants. These variants are traditionally ascertained from genome-wide association studies (GWAS), which utilize samples with tens or hundreds of thousands of individuals for whom a single summary measurement (e.g., BMI) is collected. An alternative approach is to focus on a smaller, more deeply characterized sample in conjunction with advanced statistical models that leverage longitudinal phenotypes. Novel functional data analysis (FDA) techniques are used to capitalize on longitudinal growth information from a cohort of children between birth and three years of age. In an ultra-high dimensional setting, hundreds of thousands of single nucleotide polymorphisms (SNPs) are screened, and selected SNPs are used to construct two polygenic risk scores (PRS) for childhood obesity using a weighting approach that incorporates the dynamic and joint nature of SNP effects. These scores are significantly higher in children with (vs. without) rapid infant weight gain-a predictor of obesity later in life. Using two independent cohorts, it is shown that the genetic variants identified in very young children are also informative in older children and in adults, consistent with early childhood obesity being predictive of obesity later in life. In contrast, PRSs based on SNPs identified by adult obesity GWAS are not predictive of weight gain in the cohort of young children. This provides an example of a successful application of FDA to GWAS. This application is complemented with simulations establishing that a deeply characterized sample can be just as, if not more, effective than a comparable study with a cross-sectional response. Overall, it is demonstrated that a deep, statistically sophisticated characterization of a longitudinal phenotype can provide increased statistical power to studies with relatively small sample sizes; and shows how FDA approaches can be used as an alternative to the traditional GWAS.

Collapse

He N, Wang W, Fang C, Tan Y, Li L, Hou C. Integration of Count Difference and Curve Similarity in Negative Regulatory Element Detection. Front Genet 2022;13:818344. [PMID: 35251128 PMCID: PMC8896116 DOI: 10.3389/fgene.2022.818344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 01/20/2022] [Indexed: 12/05/2022] Open

Boschi T, Di Iorio J, Testa L, Cremona MA, Chiaromonte F. Functional data analysis characterizes the shapes of the first COVID-19 epidemic wave in Italy. Sci Rep 2021;11:17054. [PMID: 34462450 PMCID: PMC8405612 DOI: 10.1038/s41598-021-95866-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 07/27/2021] [Indexed: 12/11/2022] Open

Chen D, Cremona MA, Qi Z, Mitra RD, Chiaromonte F, Makova KD. Human L1 Transposition Dynamics Unraveled with Functional Data Analysis. Mol Biol Evol 2021;37:3576-3600. [PMID: 32722770 DOI: 10.1093/molbev/msaa194] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Guiblet WM, Cremona MA, Harris RS, Chen D, Eckert KA, Chiaromonte F, Huang YF, Makova KD. Non-B DNA: a major contributor to small- and large-scale variation in nucleotide substitution frequencies across the genome. Nucleic Acids Res 2021;49:1497-1516. [PMID: 33450015 PMCID: PMC7897504 DOI: 10.1093/nar/gkaa1269] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 12/14/2020] [Accepted: 01/11/2021] [Indexed: 12/12/2022] Open

Mughal MR, Koch H, Huang J, Chiaromonte F, DeGiorgio M. Learning the properties of adaptive regions with functional data analysis. PLoS Genet 2020;16:e1008896. [PMID: 32853200 PMCID: PMC7480868 DOI: 10.1371/journal.pgen.1008896] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Revised: 09/09/2020] [Accepted: 05/29/2020] [Indexed: 12/12/2022] Open