1
|
Libeskind-Hadas R. Pairwise Distances and the Problem of Multiple Optima. J Comput Biol 2024; 31:638-650. [PMID: 38985743 DOI: 10.1089/cmb.2023.0382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024] Open
Abstract
Discrete optimization problems arise in many biological contexts and, in many cases, we seek to make inferences from the optimal solutions. However, the number of optimal solutions is frequently very large and making inferences from any single solution may result in conclusions that are not supported by other optimal solutions. We describe a general approach for efficiently (polynomial time) and exactly (without sampling) computing statistics on the space of optimal solutions. These statistics provide insights into the space of optimal solutions that can be used to support the use of a single optimum (e.g., when the optimal solutions are similar) or justify the need for selecting multiple optima (e.g., when the solution space is large and diverse) from which to make inferences. We demonstrate this approach on two well-known problems and identify the properties of these problems that make them amenable to this method.
Collapse
Affiliation(s)
- Ran Libeskind-Hadas
- Kravis Department of Integrated Sciences, Claremont McKenna College, Claremont, California, USA
| |
Collapse
|
2
|
Hurley F, Heitsch C. RNAprofiling 2.0: Enhanced Cluster Analysis of Structural Ensembles. J Mol Biol 2023; 435:168047. [PMID: 36933824 DOI: 10.1016/j.jmb.2023.168047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 03/05/2023] [Accepted: 03/09/2023] [Indexed: 03/18/2023]
Abstract
Understanding the base pairing of an RNA sequence provides insight into its molecular structure. By mining suboptimal sampling data, RNAprofiling 1.0 identifies the dominant helices in low-energy secondary structures as features, organizes them into profiles which partition the Boltzmann sample, and highlights key similarities/differences among the most informative, i.e. selected, profiles in a graphical format. Version 2.0 enhances every step of this approach. First, the featured substructures are expanded from helices to stems. Second, profile selection includes low-frequency pairings similar to featured ones. In conjunction, these updates extend the utility of the method to sequences up to length 600, as evaluated over a sizable dataset. Third, relationships are visualized in a decision tree which highlights the most important structural differences. Finally, this cluster analysis is made accessible to experimental researchers in a portable format as an interactive webpage, permitting a much greater understanding of trade-offs among different possible base pairing combinations.
Collapse
Affiliation(s)
- Forrest Hurley
- University of North Carolina at Chapel Hill, United States
| | | |
Collapse
|
3
|
Tapp M, Dennis P, Naik RR, Milam VT. Competition-Enhanced Ligand Selection to Screen for DNA Aptamers for Spherical Gold Nanoparticles. LANGMUIR : THE ACS JOURNAL OF SURFACES AND COLLOIDS 2021; 37:9043-9052. [PMID: 34279112 DOI: 10.1021/acs.langmuir.1c01053] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The Competition-Enhanced Ligand Selection (CompELS) approach was used to identify aptamer candidates for spherical gold nanoparticles (AuNPs). This approach differs from conventional Systematic Evolution of Ligands by EXponential enrichment (SELEX)-based aptamer screening by eliminating repeated elution and polymerase chain reaction (PCR) amplification steps of bound candidate sequences between each selection round to continually enrich the candidate aptamer pool with oligonucleotides remaining from an earlier SELEX selection round. Instead, a new pool of unenriched oligonucleotides is added during each CompELS selection round to compete with existing target-bound oligonucleotides species for target binding sites. In this study, 24 aptamer candidates for AuNPs were identified using the CompELS approach and then compared to reveal similarities in their primary structures and their predicted secondary structures. No strong patterns in individual base identities (position-dependent) nor in segments of consecutive bases (independent of position) prevailed among the identified sequences. Motifs in predicted secondary structures, on the other hand, were shared among otherwise unrelated aptamer sequences. These motifs were revealed using a systematic classification and enumeration of distinct secondary structure elements, namely, hairpins, duplexes, single-stranded segments, interior loops, bulges, and multibranched loops.
Collapse
Affiliation(s)
| | - Patrick Dennis
- Materials & Manufacturing Directorate, Soft Matter Materials Branch, Air Force Research Laboratory, Wright-Patterson AFB, Ohio 45433, United States
| | - Rajesh R Naik
- 711th Human Performance Wing, Air Force Research Laboratory, Wright-Patterson AFB, Ohio 45433, United States
| | | |
Collapse
|
4
|
Lyu X, Gong E. Intelligent clustering analysis model for mining area mineral resource prediction. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-179110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Xiaodong Lyu
- School of Resources and Civil Engineering, Northeastern University, Shenyang, China
| | - Enpu Gong
- School of Resources and Civil Engineering, Northeastern University, Shenyang, China
| |
Collapse
|
5
|
Wang F, Sun LZ, Sun T, Chang S, Xu X. Helix-Based RNA Landscape Partition and Alternative Secondary Structure Determination. ACS OMEGA 2019; 4:15407-15413. [PMID: 31572840 PMCID: PMC6761681 DOI: 10.1021/acsomega.9b01430] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Accepted: 09/03/2019] [Indexed: 06/10/2023]
Abstract
RNA is a versatile macromolecule with the ability to fold into and interconvert between multiple functional conformations. The elucidation of the RNA folding landscape, especially the knowledge of alternative structures, is critical to uncover the physical mechanism of RNA functions. Here, we introduce a helix-based strategy for RNA folding landscape partition and alternative secondary structure determination. The benchmark test of 27 RNAs involving alternative stable structures shows that the model has the ability to divide the whole landscape into distinct partitions at the secondary structure level and predict the representative structures for each partition. Furthermore, the predicted structures and equilibrium populations of metastable conformations for the 2'dG-sensing riboswitch reveal the allosteric conformational switch on transcript length, which is consistent with the experimental study, indicating the importance of metastable states for RNA-based gene regulation. The model delivers a starting point for the landscape-based strategy toward the RNA folding mechanism and functions.
Collapse
Affiliation(s)
- Fengfei Wang
- Institute
of Bioinformatics and Medical Engineering, School of Mathematics and
Physics, Jiangsu University of Technology, Changzhou, Jiangsu 213001, China
| | - Li-Zhen Sun
- Department
of Applied Physics, Zhejiang University
of Technology, Hangzhou, Zhejiang 310023, China
| | - Tingting Sun
- Department
of Physics, Zhejiang University of Science
and Technology, Hangzhou, Zhejiang 310023, China
| | - Shan Chang
- Institute
of Bioinformatics and Medical Engineering, School of Mathematics and
Physics, Jiangsu University of Technology, Changzhou, Jiangsu 213001, China
| | - Xiaojun Xu
- Institute
of Bioinformatics and Medical Engineering, School of Mathematics and
Physics, Jiangsu University of Technology, Changzhou, Jiangsu 213001, China
| |
Collapse
|
6
|
Sullivan R, Adams MC, Naik RR, Milam VT. Analyzing Secondary Structure Patterns in DNA Aptamers Identified via CompELS. Molecules 2019; 24:molecules24081572. [PMID: 31010064 PMCID: PMC6515186 DOI: 10.3390/molecules24081572] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Revised: 04/09/2019] [Accepted: 04/15/2019] [Indexed: 12/12/2022] Open
Abstract
In contrast to sophisticated high-throughput sequencing tools for genomic DNA, analytical tools for comparing secondary structure features between multiple single-stranded DNA sequences are less developed. For single-stranded nucleic acid ligands called aptamers, secondary structure is widely thought to play a pivotal role in driving recognition-based binding activity between an aptamer sequence and its specific target. Here, we employ a competition-based aptamer screening platform called CompELS to identify DNA aptamers for a colloidal target. We then analyze predicted secondary structures of the aptamers and a large population of random sequences to identify sequence features and patterns. Our secondary structure analysis identifies patterns ranging from position-dependent score matrixes of individual structural elements to position-independent consensus domains resulting from global alignment.
Collapse
Affiliation(s)
- Richard Sullivan
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Dr. NW, Atlanta, GA 30332-0245, USA.
| | - Mary Catherine Adams
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Dr. NW, Atlanta, GA 30332-0245, USA.
| | - Rajesh R Naik
- 711 Human Performance Wing, Air Force Research Laboratory, Wright Patterson AFB, OH 45433, USA.
| | - Valeria T Milam
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Dr. NW, Atlanta, GA 30332-0245, USA.
- Wallace H. Coulter, Department of Biomedical Engineering, Georgia Institute of Technology, 313 Ferst Dr., Atlanta, GA 30332, USA.
- Petit Institute for Bioengineering and Bioscience, Georgia Institute of Technology, 315 Ferst Dr., Atlanta, GA 30332-0363, USA.
| |
Collapse
|
7
|
Ledda M, Aviran S. PATTERNA: transcriptome-wide search for functional RNA elements via structural data signatures. Genome Biol 2018; 19:28. [PMID: 29495968 PMCID: PMC5833111 DOI: 10.1186/s13059-018-1399-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Accepted: 01/30/2018] [Indexed: 02/08/2023] Open
Abstract
Establishing a link between RNA structure and function remains a great challenge in RNA biology. The emergence of high-throughput structure profiling experiments is revolutionizing our ability to decipher structure, yet principled approaches for extracting information on structural elements directly from these data sets are lacking. We present PATTERNA, an unsupervised pattern recognition algorithm that rapidly mines RNA structure motifs from profiling data. We demonstrate that PATTERNA detects motifs with an accuracy comparable to commonly used thermodynamic models and highlight its utility in automating data-directed structure modeling from large data sets. PATTERNA is versatile and compatible with diverse profiling techniques and experimental conditions.
Collapse
Affiliation(s)
- Mirko Ledda
- Department of Biomedical Engineering and Genome Center, UC Davis, 1 Shields Ave, Davis, 95616 USA
- Integrative Genetics and Genomics Graduate Group, UC Davis, 1 Shields Ave, Davis, 95616 USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, UC Davis, 1 Shields Ave, Davis, 95616 USA
| |
Collapse
|
8
|
Statistical modeling of RNA structure profiling experiments enables parsimonious reconstruction of structure landscapes. Nat Commun 2018; 9:606. [PMID: 29426922 PMCID: PMC5807309 DOI: 10.1038/s41467-018-02923-8] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 01/09/2018] [Indexed: 11/23/2022] Open
Abstract
RNA plays key regulatory roles in diverse cellular processes, where its functionality often derives from folding into and converting between structures. Many RNAs further rely on co-existence of alternative structures, which govern their response to cellular signals. However, characterizing heterogeneous landscapes is difficult, both experimentally and computationally. Recently, structure profiling experiments have emerged as powerful and affordable structure characterization methods, which improve computational structure prediction. To date, efforts have centered on predicting one optimal structure, with much less progress made on multiple-structure prediction. Here, we report a probabilistic modeling approach that predicts a parsimonious set of co-existing structures and estimates their abundances from structure profiling data. We demonstrate robust landscape reconstruction and quantitative insights into structural dynamics by analyzing numerous data sets. This work establishes a framework for data-directed characterization of structure landscapes to aid experimentalists in performing structure-function studies. Different experimental and computational approaches can be used to study RNA structures. Here, the authors present a computational method for data-directed reconstruction of complex RNA structure landscapes, which predicts a parsimonious set of co-existing structures and estimates their abundances from structure profiling data.
Collapse
|
9
|
Schlick T, Pyle AM. Opportunities and Challenges in RNA Structural Modeling and Design. Biophys J 2017; 113:225-234. [PMID: 28162235 PMCID: PMC5529161 DOI: 10.1016/j.bpj.2016.12.037] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2016] [Revised: 12/08/2016] [Accepted: 12/19/2016] [Indexed: 01/27/2023] Open
Abstract
We describe opportunities and challenges in RNA structural modeling and design, as recently discussed during the second Telluride Science Research Center workshop organized in June 2016. Topics include fundamental processes of RNA, such as structural assemblies (hierarchical folding, multiple conformational states and their clustering), RNA motifs, and chemical reactivity of RNA, as used for structural prediction and functional inference. We also highlight the software and database issues associated with RNA structures, such as the multiple approaches for motif annotation, the need for frequent database updating, and the importance of quality control of RNA structures. We discuss various modeling approaches for structure prediction, mechanistic analysis of RNA reactions, and RNA design, and the complementary roles that both atomistic and coarse-grained approaches play in such simulations. Collectively, as scientists from varied disciplines become familiar and drawn into these unique challenges, new approaches and collaborative efforts will undoubtedly be catalyzed.
Collapse
Affiliation(s)
- Tamar Schlick
- Department of Chemistry, New York University, New York, New York; Courant Institute of Mathematical Sciences, New York University, New York, New York.
| | - Anna Marie Pyle
- Department of Molecular and Cellular and Developmental Biology and Department of Chemistry, Yale University; Howard Hughes Medical Institute, New Haven, Connecticut.
| |
Collapse
|
10
|
Dawn of the in vivo RNA structurome and interactome. Biochem Soc Trans 2017; 44:1395-1410. [PMID: 27911722 DOI: 10.1042/bst20160075] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Revised: 06/19/2016] [Accepted: 07/04/2016] [Indexed: 12/11/2022]
Abstract
RNA is one of the most fascinating biomolecules in living systems given its structural versatility to fold into elaborate architectures for important biological functions such as gene regulation, catalysis, and information storage. Knowledge of RNA structures and interactions can provide deep insights into their functional roles in vivo For decades, RNA structural studies have been conducted on a transcript-by-transcript basis. The advent of next-generation sequencing (NGS) has enabled the development of transcriptome-wide structural probing methods to profile the global landscape of RNA structures and interactions, also known as the RNA structurome and interactome, which transformed our understanding of the RNA structure-function relationship on a transcriptomic scale. In this review, molecular tools and NGS methods used for RNA structure probing are presented, novel insights uncovered by RNA structurome and interactome studies are highlighted, and perspectives on current challenges and potential future directions are discussed. A more complete understanding of the RNA structures and interactions in vivo will help illuminate the novel roles of RNA in gene regulation, development, and diseases.
Collapse
|
11
|
Ignatova Z, Narberhaus F. Systematic probing of the bacterial RNA structurome to reveal new functions. Curr Opin Microbiol 2017; 36:14-19. [DOI: 10.1016/j.mib.2017.01.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2016] [Revised: 01/06/2017] [Accepted: 01/11/2017] [Indexed: 12/15/2022]
|
12
|
Choudhary K, Deng F, Aviran S. Comparative and integrative analysis of RNA structural profiling data: current practices and emerging questions. QUANTITATIVE BIOLOGY 2017; 5:3-24. [PMID: 28717530 PMCID: PMC5510538 DOI: 10.1007/s40484-017-0093-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 12/08/2016] [Accepted: 12/15/2016] [Indexed: 12/30/2022]
Abstract
BACKGROUND Structure profiling experiments provide single-nucleotide information on RNA structure. Recent advances in chemistry combined with application of high-throughput sequencing have enabled structure profiling at transcriptome scale and in living cells, creating unprecedented opportunities for RNA biology. Propelled by these experimental advances, massive data with ever-increasing diversity and complexity have been generated, which give rise to new challenges in interpreting and analyzing these data. RESULTS We review current practices in analysis of structure profiling data with emphasis on comparative and integrative analysis as well as highlight emerging questions. Comparative analysis has revealed structural patterns across transcriptomes and has become an integral component of recent profiling studies. Additionally, profiling data can be integrated into traditional structure prediction algorithms to improve prediction accuracy. CONCLUSIONS To keep pace with experimental developments, methods to facilitate, enhance and refine such analyses are needed. Parallel advances in analysis methodology will complement profiling technologies and help them reach their full potential.
Collapse
Affiliation(s)
| | | | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA 95616, USA
| |
Collapse
|