1
|
Abondio P, Cilli E, Luiselli D. Human Pangenomics: Promises and Challenges of a Distributed Genomic Reference. Life (Basel) 2023; 13:1360. [PMID: 37374141 DOI: 10.3390/life13061360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/02/2023] [Accepted: 06/08/2023] [Indexed: 06/29/2023] Open
Abstract
A pangenome is a collection of the common and unique genomes that are present in a given species. It combines the genetic information of all the genomes sampled, resulting in a large and diverse range of genetic material. Pangenomic analysis offers several advantages compared to traditional genomic research. For example, a pangenome is not bound by the physical constraints of a single genome, so it can capture more genetic variability. Thanks to the introduction of the concept of pangenome, it is possible to use exceedingly detailed sequence data to study the evolutionary history of two different species, or how populations within a species differ genetically. In the wake of the Human Pangenome Project, this review aims at discussing the advantages of the pangenome around human genetic variation, which are then framed around how pangenomic data can inform population genetics, phylogenetics, and public health policy by providing insights into the genetic basis of diseases or determining personalized treatments, targeting the specific genetic profile of an individual. Moreover, technical limitations, ethical concerns, and legal considerations are discussed.
Collapse
Affiliation(s)
- Paolo Abondio
- Laboratory of Ancient DNA, Department of Cultural Heritage, University of Bologna, Via degli Ariani 1, 48121 Ravenna, Italy
| | - Elisabetta Cilli
- Laboratory of Ancient DNA, Department of Cultural Heritage, University of Bologna, Via degli Ariani 1, 48121 Ravenna, Italy
| | - Donata Luiselli
- Laboratory of Ancient DNA, Department of Cultural Heritage, University of Bologna, Via degli Ariani 1, 48121 Ravenna, Italy
| |
Collapse
|
2
|
Khatoon H, Raza RZ, Saleem S, Batool F, Arshad S, Abrar M, Ali S, Hussain I, Shubin NH, Abbasi AA. Evolutionary relevance of single nucleotide variants within the forebrain exclusive human accelerated enhancer regions. BMC Mol Cell Biol 2023; 24:13. [PMID: 36991330 PMCID: PMC10053400 DOI: 10.1186/s12860-023-00474-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Accepted: 03/13/2023] [Indexed: 03/30/2023] Open
Abstract
Abstract
Background
Human accelerated regions (HARs) are short conserved genomic sequences that have acquired significantly more nucleotide substitutions than expected in the human lineage after divergence from chimpanzees. The fast evolution of HARs may reflect their roles in the origin of human-specific traits. A recent study has reported positively-selected single nucleotide variants (SNVs) within brain-exclusive human accelerated enhancers (BE-HAEs) hs1210 (forebrain), hs563 (hindbrain) and hs304 (midbrain/forebrain). By including data from archaic hominins, these SNVs were shown to be Homo sapiens-specific, residing within transcriptional factors binding sites (TFBSs) for SOX2 (hs1210), RUNX1/3 (hs563), and FOS/JUND (hs304). Although these findings suggest that the predicted modifications in TFBSs may have some role in present-day brain structure, work is required to verify the extent to which these changes translate into functional variation.
Results
To start to fill this gap, we investigate the SOX2 SNV, with both forebrain expression and strong signal of positive selection in humans. We demonstrate that the HMG box of SOX2 binds in vitro with Homo sapiens-specific derived A-allele and ancestral T-allele carrying DNA sites in BE-HAE hs1210. Molecular docking and simulation analysis indicated highly favourable binding of HMG box with derived A-allele containing DNA site when compared to site carrying ancestral T-allele.
Conclusion
These results suggest that adoptive changes in TF affinity within BE-HAE hs1210 and other HAR enhancers in the evolutionary history of Homo sapiens might.
have brought about changes in gene expression patterns and have functional consequences on forebrain formation and evolution.
Methods
The present study employ electrophoretic mobility shift assays (EMSA) and molecular docking and molecular dynamics simulations approaches.
Collapse
|
3
|
Stein WD, Hoshen MB. During evolution from the earliest tetrapoda, newly-recruited genes are increasingly paralogues of existing genes and distribute non-randomly among the chromosomes. BMC Genomics 2021; 22:794. [PMID: 34736418 PMCID: PMC8570013 DOI: 10.1186/s12864-021-08066-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Accepted: 09/28/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The present availability of full genome sequences of a broad range of animal species across the whole range of evolutionary history enables one to ask questions as to the distribution of genes across the chromosomes. Do newly recruited genes, as new clades emerge, distribute at random or at non-random locations? RESULTS We extracted values for the ages of the human genes and for their current chromosome locations, from published sources. A quantitative analysis showed that the distribution of newly-added genes among and within the chromosomes appears to be increasingly non-random if one observes animals along the evolutionary series from the precursors of the tetrapoda through to the great apes, whereas the oldest genes are randomly distributed. CONCLUSIONS Randomization will result from chromosome evolution, but less and less time is available for this process as evolution proceeds. Much of the bunching of recently-added genes arises from new gene formation as paralogues in gene families, near the location of genes that were recruited in the preceding phylostratum. As examples we cite the KRTAP, ZNF, OR and some minor gene families. We show that bunching can also result from the evolution of the chromosomes themselves when, as for the KRTAP genes, blocks of genes that had previously been on disparate chromosomes become linked together.
Collapse
Affiliation(s)
- Wilfred D Stein
- Silberman Institute of Life Sciences, Hebrew University, 91904, Jerusalem, Israel.
| | - Moshe B Hoshen
- Bioinformatics Department, Jerusalem College of Technology, Tal Campus, Beit HaDfus 7, 95483, Jerusalem, Israel
| |
Collapse
|
4
|
Auboeuf D. The Physics-Biology continuum challenges darwinism: Evolution is directed by the homeostasis-dependent bidirectional relation between genome and phenotype. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2021; 167:121-139. [PMID: 34097984 DOI: 10.1016/j.pbiomolbio.2021.05.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 05/19/2021] [Accepted: 05/31/2021] [Indexed: 10/21/2022]
Abstract
The physics-biology continuum relies on the fact that life emerged from prebiotic molecules. Here, I argue that life emerged from the coupling between nucleic acid and protein synthesis during which proteins (or proto-phenotypes) maintained the physicochemical parameter equilibria (or proto-homeostasis) in the proximity of their encoding nucleic acids (or proto-genomes). This protected the proto-genome physicochemical integrity (i.e., atomic composition) from environmental physicochemical constraints, and therefore increased the probability of reproducing the proto-genome without variation. From there, genomes evolved depending on the biological activities they generated in response to environmental fluctuations. Thus, a genome maintaining homeostasis (i.e., internal physicochemical parameter equilibria), despite and in response to environmental fluctuations, maintains its physicochemical integrity and has therefore a higher probability to be reproduced without variation. Consequently, descendants have a higher probability to share the same phenotype than their parents. Otherwise, the genome is modified during replication as a consequence of the imbalance of the internal physicochemical parameters it generates, until new mutation-deriving biological activities maintain homeostasis in offspring. In summary, evolution depends on feedforward and feedback loops between genome and phenotype, as the internal physicochemical conditions that a genome generates ─ through its derived phenotype in response to environmental fluctuations ─ in turn either guarantee its stability or direct its variation. Evolution may not be explained by the Darwinism-derived, unidirectional principle (random mutations-phenotypes-natural selection) but rather by the bidirectional relationship between genome and phenotype, in which the phenotype in interaction with the environment directs the evolution of the genome it derives from.
Collapse
Affiliation(s)
- Didier Auboeuf
- ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée D'Italie, Site Jacques Monod, F-69007, Lyon, France.
| |
Collapse
|
5
|
Lopes I, Altab G, Raina P, de Magalhães JP. Gene Size Matters: An Analysis of Gene Length in the Human Genome. Front Genet 2021; 12:559998. [PMID: 33643374 PMCID: PMC7905317 DOI: 10.3389/fgene.2021.559998] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 01/06/2021] [Indexed: 12/23/2022] Open
Abstract
While it is expected for gene length to be associated with factors such as intron number and evolutionary conservation, we are yet to understand the connections between gene length and function in the human genome. In this study, we show that, as expected, there is a strong positive correlation between gene length, transcript length, and protein size as well as a correlation with the number of genetic variants and introns. Among tissue-specific genes, we find that the longest transcripts tend to be expressed in the blood vessels, nerves, thyroid, cervix uteri, and the brain, while the smallest transcripts tend to be expressed in the pancreas, skin, stomach, vagina, and testis. We report, as shown previously, that natural selection suppresses changes for genes with longer transcripts and promotes changes for genes with smaller transcripts. We also observe that genes with longer transcripts tend to have a higher number of co-expressed genes and protein-protein interactions, as well as more associated publications. In the functional analysis, we show that bigger transcripts are often associated with neuronal development, while smaller transcripts tend to play roles in skin development and in the immune system. Furthermore, pathways related to cancer, neurons, and heart diseases tend to have genes with longer transcripts, with smaller transcripts being present in pathways related to immune responses and neurodegenerative diseases. Based on our results, we hypothesize that longer genes tend to be associated with functions that are important in the early development stages, while smaller genes tend to play a role in functions that are important throughout the whole life, like the immune system, which requires fast responses.
Collapse
Affiliation(s)
| | | | | | - João Pedro de Magalhães
- Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
6
|
Combes F, Meyer E, Sanders NN. Immune cells as tumor drug delivery vehicles. J Control Release 2020; 327:70-87. [PMID: 32735878 DOI: 10.1016/j.jconrel.2020.07.043] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 07/24/2020] [Accepted: 07/25/2020] [Indexed: 12/21/2022]
Abstract
This review article describes the use of immune cells as potential candidates to deliver anti-cancer drugs deep within the tumor microenvironment. First, the rationale of using drug carriers to target tumors and potentially decrease drug-related side effects is discussed. We further explain some of the current limitations when using nanoparticles for this purpose. Next, a comprehensive step-by-step description of the migration cascade of immune cells is provided as well as arguments on why immune cells can be used to address some of the limitations associated with nanoparticle-mediated drug delivery. We then describe the benefits and drawbacks of using red blood cells, platelets, granulocytes, monocytes, macrophages, myeloid-derived suppressor cells, T cells and NK cells for tumor-targeted drug delivery. An additional section discusses the versatility of nanoparticles to load anti-cancer drugs into immune cells. Lastly, we propose increasing the circulatory half-life and development of conditional release strategies as the two main future pillars to improve the efficacy of immune cell-mediated drug delivery to tumors.
Collapse
Affiliation(s)
- Francis Combes
- Laboratory of Gene Therapy, Department of Nutrition, Genetics and Ethology, Faculty of Veterinary Medicine, Ghent University, Heidestraat 19, 9820 Merelbeke, Belgium; Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium
| | - Evelyne Meyer
- Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium; Department of Pharmacology, Toxicology and Biochemistry, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
| | - Niek N Sanders
- Laboratory of Gene Therapy, Department of Nutrition, Genetics and Ethology, Faculty of Veterinary Medicine, Ghent University, Heidestraat 19, 9820 Merelbeke, Belgium; Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium.
| |
Collapse
|
7
|
de Los Reyes BG. Genomic and epigenomic bases of transgressive segregation - New breeding paradigm for novel plant phenotypes. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2019; 288:110213. [PMID: 31521221 DOI: 10.1016/j.plantsci.2019.110213] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Revised: 07/17/2019] [Accepted: 08/05/2019] [Indexed: 06/10/2023]
Abstract
For a holistic approach in developing the stress-resilient crops of the 21st century, modern genomic biology will need to re-envision the underappreciated phenomena in classical genetics, and incorporate them into the new plant breeding paradigm. Advances in evolutionary genomics support a theory that genetic recombination under genome shock during hybridization of widely divergent parents is an important driver of adaptive speciation, by virtue of the novelties of rare hybrids and recombinants. The enormous potential of genetic network rewiring to generate developmental or physiological novelties with adaptive advantage to special ecological niches has been appreciated. Developmental and physiological reconfiguration through network rewiring involves intricate molecular synergies controlled both at the genetic and epigenetic levels, as typified by the phenomenon of transgressive segregation, observed in both natural and breeding populations. This paper presents modern views on the possible molecular underpinnings of transgressive phenotypes as they are created in plant breeding, expanded from classical explanations through the Omnigenic Theory for quantitative traits and modern paradigms of epigenetics. Perspectives on how genomic biology can fully exploit this phenomenon to create novel phenotypes beyond what could be achieved through the more reductionist approach of functional genomics are presented in context of genomic modeling.
Collapse
Affiliation(s)
- Benildo G de Los Reyes
- Department of Plant and Soil Science Texas Tech University 215 Experimental Sciences Building, Lubbock, TX 806-834-6421, USA.
| |
Collapse
|
8
|
Goodarzi P, Alavi-Moghadam S, Payab M, Larijani B, Rahim F, Gilany K, Bana N, Tayanloo-Beik A, Foroughi Heravani N, Hadavandkhani M, Arjmand B. Metabolomics Analysis of Mesenchymal Stem Cells. INTERNATIONAL JOURNAL OF MOLECULAR AND CELLULAR MEDICINE 2019; 8:30-40. [PMID: 32351907 PMCID: PMC7175611 DOI: 10.22088/ijmcm.bums.8.2.30] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Accepted: 05/20/2019] [Indexed: 12/12/2022]
Abstract
Various mesenchymal stem cells as easily accessible and multipotent cells can share different essential signaling pathways related to their stemness ability. Understanding the mechanism of stemness ability can be useful for controlling the stem cells for regenerative medicine targets. In this context, OMICs studies can analyze the mechanism of different stem cell properties or stemness ability via a broad range of current high-throughput techniques. This field is fundamentally directed toward the analysis of whole genome (genomics), mRNAs (transcriptomics), proteins (proteomics) and metabolites (metabolomics) in biological samples. According to several studies, metabolomics is more effective than other OMICs ّfor various system biology concerns. Metabolomics can elucidate the biological mechanisms of various mesenchymal stem cell function by measuring their metabolites such as their secretome components. Analyzing the metabolic alteration of mesenchymal stem cells can be useful to promote their regenerative medicine application.
Collapse
Affiliation(s)
- Parisa Goodarzi
- Brain and Spinal Cord Injury Research Center, Neuroscience Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Sepideh Alavi-Moghadam
- Cell Therapy and Regenerative Medicine Research Center, Endocrinology and Metabolism Molecular-Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Moloud Payab
- Obesity and Eating Habits Research Center, Endocrinology and Metabolism Molecular-Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Bagher Larijani
- Endocrinology and Metabolism Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical sciences, Tehran, Iran
| | - Fakher Rahim
- Health Research Institute, Thalassemia and Hemoglobinopathies Research Center, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Kambiz Gilany
- Integrative Oncology Department, Breast Cancer Research Center, Motamed Cancer Institute, ACECR, Tehran, Iran .,Department of Biomedical Sciences, University of Antwerp, Belgium
| | - Nikoo Bana
- Metabolomics and Genomics Research Center, Endocrinology and Metabolism Molecular- Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Akram Tayanloo-Beik
- Cell Therapy and Regenerative Medicine Research Center, Endocrinology and Metabolism Molecular-Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Najmeh Foroughi Heravani
- Cell Therapy and Regenerative Medicine Research Center, Endocrinology and Metabolism Molecular-Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Mahdieh Hadavandkhani
- Cell Therapy and Regenerative Medicine Research Center, Endocrinology and Metabolism Molecular-Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Babak Arjmand
- Cell Therapy and Regenerative Medicine Research Center, Endocrinology and Metabolism Molecular-Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran .,Metabolomics and Genomics Research Center, Endocrinology and Metabolism Molecular- Cellular Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
9
|
Litman T, Stein WD. Obtaining estimates for the ages of all the protein-coding genes and most of the ontology-identified noncoding genes of the human genome, assigned to 19 phylostrata. Semin Oncol 2018; 46:3-9. [PMID: 30558821 DOI: 10.1053/j.seminoncol.2018.11.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Accepted: 11/16/2018] [Indexed: 11/11/2022]
Abstract
Following Liebeskind et al [1], we have attempted to find consensus ages for the protein-coding and the noncoding genes of the human genome, using publicly-available ortholog databases. For each database separately, we determined its age estimate for the genes it listed, determining this by identifying the earliest ortholog for the gene in question. We assigned these ages to 1 of the 19 major phylostrata defined by Domazet-Loso and Tautz [2], 2 of which were further subdivided. From these various estimates, we found the modal value if 1 was present, defining this as the consensus age for the gene. For the genes where no consensus value could be found, we recorded the median value of the age estimates across the databases interrogated. We present a resource that lists the age, as so defined, of every one of the 19,660 protein-coding genes and of 5,981 of the 16,528 non-protein-coding genes of the human genome, the age being the time when the gene was accreted to the evolving human genome. We calculate the number of genes that accreted to the genome, epoch by epoch, and consider the rate at which they accreted.
Collapse
Affiliation(s)
| | - Wilfred D Stein
- Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel.
| |
Collapse
|
10
|
Alu SINE analyses of 3,000-year-old human skeletal remains: a pilot study. Mob DNA 2016; 7:7. [PMID: 27096009 PMCID: PMC4836192 DOI: 10.1186/s13100-016-0063-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 03/31/2016] [Indexed: 01/21/2023] Open
Abstract
Background As Short Interspersed Elements (SINEs), human-specific Alu elements can be used for population genetic studies. Very recent inserts are polymorphic within and between human populations. In a sample of 30 elements originating from three different Alu subfamilies, we investigated whether they are preserved in prehistorical skeletal human remains from the Bronze Age Lichtenstein cave in Lower Saxony, Germany. In the present study, we examined a prehistoric triad of father, mother and daughter. Results For 26 of the 30 Alu loci investigated, definite results were obtained. We were able to demonstrate that presence/absence analyses of Alu elements can be conducted on individuals who lived 3,000 years ago. The preservation of the ancient DNA (aDNA) is good enough in two out of three ancient individuals to routinely allow the amplification of 500 bp fragments. The third individual revealed less well-preserved DNA, which results in allelic dropout or complete amplification failures. We here present an alternative molecular approach to deal with these degradation phenomena by using internal Alu subfamily specific primers producing short fragments of approximately 150 bp. Conclusions Our data clearly show the possibility of presence/absence analyses of Alu elements in individuals from the Lichtenstein cave. Thus, we demonstrate that our method is reliably applicable for aDNA samples with good or moderate DNA preservation. This method will be very useful for further investigations with more Alu loci and larger datasets. Human population genetic studies and other large-scale investigations would provide insight into Alu SINE-based microevolutionary processes in humans during the last few thousand years and help us comprehend the evolutionary dynamics of our genome. Electronic supplementary material The online version of this article (doi:10.1186/s13100-016-0063-y) contains supplementary material, which is available to authorized users.
Collapse
|