Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Balasubramanian S, Harrison P, Hegyi H, Bertone P, Luscombe N, Echols N, McGarvey P, Zhang Z, Gerstein M. SNPs on human chromosomes 21 and 22 -- analysis in terms of protein features and pseudogenes. Pharmacogenomics 2002;3:393-402. [PMID: 12052146 DOI: 10.1517/14622416.3.3.393] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

For:	Balasubramanian S, Harrison P, Hegyi H, Bertone P, Luscombe N, Echols N, McGarvey P, Zhang Z, Gerstein M. SNPs on human chromosomes 21 and 22 -- analysis in terms of protein features and pseudogenes. Pharmacogenomics 2002;3:393-402. [PMID: 12052146 DOI: 10.1517/14622416.3.3.393] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

Number

Cited by Other Article(s)

Šimon M, Mikec Š, Atanur SS, Konc J, Morton NM, Horvat S, Kunej T. Whole genome sequencing of mouse lines divergently selected for fatness (FLI) and leanness (FHI) revealed several genetic variants as candidates for novel obesity genes. Genes Genomics 2024;46:557-575. [PMID: 38483771 PMCID: PMC11024027 DOI: 10.1007/s13258-024-01507-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 02/25/2024] [Indexed: 04/18/2024]

Abstract

BACKGROUND

Analysing genomes of animal model organisms is widely used for understanding the genetic basis of complex traits and diseases, such as obesity, for which only a few mouse models exist, however, without their lean counterparts.

OBJECTIVE

To analyse genetic differences in the unique mouse models of polygenic obesity (Fat line) and leanness (Lean line) originating from the same base population and established by divergent selection over more than 60 generations.

METHODS

Genetic variability was analysed using WGS. Variants were identified with GATK and annotated with Ensembl VEP. g.Profiler, WebGestalt, and KEGG were used for GO and pathway enrichment analysis. miRNA seed regions were obtained with miRPathDB 2.0, LncRRIsearch was used to predict targets of identified lncRNAs, and genes influencing adipose tissue amount were searched using the IMPC database.

RESULTS

WGS analysis revealed 6.3 million SNPs, 1.3 million were new. Thousands of potentially impactful SNPs were identified, including within 24 genes related to adipose tissue amount. SNP density was highest in pseudogenes and regulatory RNAs. The Lean line carries SNP rs248726381 in the seed region of mmu-miR-3086-3p, which may affect fatty acid metabolism. KEGG analysis showed deleterious missense variants in immune response and diabetes genes, with food perception pathways being most enriched. Gene prioritisation considering SNP GERP scores, variant consequences, and allele comparison with other mouse lines identified seven novel obesity candidate genes: 4930441H08Rik, Aff3, Fam237b, Gm36633, Pced1a, Tecrl, and Zfp536.

CONCLUSION

WGS revealed many genetic differences between the lines that accumulated over the selection period, including variants with potential negative impacts on gene function. Given the increasing availability of mouse strains and genetic polymorphism catalogues, the study is a valuable resource for researchers to study obesity.

Collapse

Comparative analysis of pseudogenes across three phyla. Proc Natl Acad Sci U S A 2014;111:13361-6. [PMID: 25157146 DOI: 10.1073/pnas.1407293111] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Chen J, Zhang X, Jing R, Blair MW, Mao X, Wang S. Cloning and genetic diversity analysis of a new P5CS gene from common bean (Phaseolus vulgaris L.). TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2010;120:1393-404. [PMID: 20143043 DOI: 10.1007/s00122-010-1263-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2009] [Accepted: 12/23/2009] [Indexed: 05/11/2023]

Chandrasekar A, Riju A, Sithara K, Anoop S, Eapen SJ. Identification of single nucleotide polymorphism in ginger using expressed sequence tags. Bioinformation 2009;4:119-22. [PMID: 20198184 PMCID: PMC2828891 DOI: 10.6026/97320630004119] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2009] [Revised: 05/10/2009] [Accepted: 06/08/2009] [Indexed: 12/05/2022] Open

Williams JL, Dunner S, Valentini A, Mazza R, Amarger V, Checa ML, Crisà A, Razzaq N, Delourme D, Grandjean F, Marchitelli C, García D, Pérez Gomez R, Negrini R, Ajmone Marsan P, Levéziel H. Discovery, characterization and validation of single nucleotide polymorphisms within 206 bovine genes that may be considered as candidate genes for beef production and quality. Anim Genet 2009;40:486-91. [DOI: 10.1111/j.1365-2052.2009.01874.x] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Morais DD, Harrison PM. Genomic evidence for non-random endemic populations of decaying exons from mammalian genes. BMC Genomics 2009;10:309. [PMID: 19594905 PMCID: PMC2718932 DOI: 10.1186/1471-2164-10-309] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2009] [Accepted: 07/13/2009] [Indexed: 11/13/2022] Open

Abstract

Background

Functional diversification of genes in mammalian genomes is engendered by a number of processes, e.g., gene duplication and alternative splicing. Gene duplication is classically discussed as leading to neofunctionalization (generation of new functions), subfunctionalization (generation of a varied function), or pseudogenization (loss of the gene and its function).

Results

Here, we focus on the process of pseudogenization, but specifically for individual exons from genes. It is at present unclear to what extent pseudogenization of individual exon duplications affects gene evolution, i.e., is it a random phenomenon, or is it associated with specific types of genes and encoded proteins, and positions in gene structures? We gathered genomic evidence for pseudogenic exons (ΨEs, i.e., exons disabled by frameshifts and premature stop codons), to examine for significant trends in their distribution across four mammalian genomes (specifically human, cow, mouse and rat). Across these four genomes, we observed a consistent population of ΨEs, associated with 0.4–1.0% of genes. These ΨE populations exhibit codon substitution patterns that are typical of an endemic population of decaying sequences. In human, ΨEs have significant over-representation for functional categories related to 'ion binding' and 'nucleic-acid binding', compared to duplicated exons in general. Also, ΨEs tend to be associated with some protein domains that are abundant generally, e.g., Zinc-finger and immunoglobulin protein domains, but not others, e.g., EGF-like domains. Positionally, ΨEs are also significantly associated with the 5' end of genes, but despite this, individual stop codons are positioned so that there is significant avoidance of potential targeting to nonsense-mediated decay. In human, ΨEs are often associated with alternative splicing (in 22 out of 284 genes with ΨEs in their milieu), and can have different parts of their sequence differentially spliced in alternative transcripts. Some unusual cases of ΨEs embedded within 5' and 3' non-coding exons are observed.

Conclusion

Our results indicate the types of genes that harbour ΨEs, and demonstrate that ΨEs have non-random distribution within gene structures. These ΨEs may function in gene regulation through generation of transcribed pseudogenes, or regulatory alternate transcripts.

Collapse

Mukherjee S, Sarkar-Roy N, Wagener DK, Majumder PP. Signatures of natural selection are not uniform across genes of innate immune system, but purifying selection is the dominant signature. Proc Natl Acad Sci U S A 2009;106:7073-8. [PMID: 19359493 PMCID: PMC2678448 DOI: 10.1073/pnas.0811357106] [Citation(s) in RCA: 85] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2008] [Indexed: 12/21/2022] Open

Guo Y, Jamison DC. The distribution of SNPs in human gene regulatory regions. BMC Genomics 2005;6:140. [PMID: 16209714 PMCID: PMC1260019 DOI: 10.1186/1471-2164-6-140] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2005] [Accepted: 10/06/2005] [Indexed: 11/25/2022] Open

Zheng D, Zhang Z, Harrison PM, Karro J, Carriero N, Gerstein M. Integrated pseudogene annotation for human chromosome 22: evidence for transcription. J Mol Biol 2005;349:27-45. [PMID: 15876366 DOI: 10.1016/j.jmb.2005.02.072] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2004] [Revised: 02/16/2005] [Accepted: 02/23/2005] [Indexed: 02/06/2023]

Abstract

Pseudogenes are inheritable genetic elements formally defined by two properties: their similarity to functioning genes and their presumed lack of activity. However, their precise characterization, particularly with respect to the latter quality, has proven elusive. An opportunity to explore this issue arises from the recent emergence of tiling-microarray data showing that intergenic regions (containing pseudogenes) are transcribed to a great degree. Here we focus on the transcriptional activity of pseudogenes on human chromosome 22. First, we integrated several sets of annotation to define a unified list of 525 pseudogenes on the chromosome. To characterize these further, we developed a comprehensive list of genomic features based on conservation in related organisms, expression evidence, and the presence of upstream regulatory sites. Of the 525 unified pseudogenes we could confidently classify 154 as processed and 49 as duplicated. Using data from tiling microarrays, especially from recent high-resolution oligonucleotide arrays, we found some evidence that up to a fifth of the 525 pseudogenes are potentially transcribed. Expressed sequence tags (EST) comparison further validated a number of these, and overall we found 17 pseudogenes with strong support for transcription. In particular, one of the pseudogenes with both EST and microarray evidence for transcription turned out to be a duplicated pseudogene in the cat eye syndrome critical region. Although we could not identify a meaningful number of transcription factor-binding sites (based on chromatin immunoprecipitation-chip data) near pseudogenes, we did find that approximately 12% of the pseudogenes had upstream CpG islands. Finally, analysis of corresponding syntenic regions in the mouse, rat and chimp genomes indicates, as previously suggested, that pseudogenes are less conserved than genes, but more preserved than the intergenic background (all notation is available from http://www.pseudogene.org).

Collapse

Zhang Z, Gerstein M. Large-scale analysis of pseudogenes in the human genome. Curr Opin Genet Dev 2005;14:328-35. [PMID: 15261647 DOI: 10.1016/j.gde.2004.06.003] [Citation(s) in RCA: 117] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Basu A, Chaudhuri P, Majumder PP. Identification of polymorphic motifs using probabilistic search algorithms. Genome Res 2005;15:67-77. [PMID: 15632091 PMCID: PMC540278 DOI: 10.1101/gr.2358005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2004] [Accepted: 10/21/2004] [Indexed: 01/12/2023]

Rinn JL, Euskirchen G, Bertone P, Martone R, Luscombe NM, Hartman S, Harrison PM, Nelson FK, Miller P, Gerstein M, Weissman S, Snyder M. The transcriptional activity of human Chromosome 22. Genes Dev 2003;17:529-40. [PMID: 12600945 PMCID: PMC195998 DOI: 10.1101/gad.1055203] [Citation(s) in RCA: 237] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2002] [Accepted: 12/24/2002] [Indexed: 01/09/2023]

Harrison PM, Gerstein M. Studying genomes through the aeons: protein families, pseudogenes and proteome evolution. J Mol Biol 2002;318:1155-74. [PMID: 12083509 DOI: 10.1016/s0022-2836(02)00109-2] [Citation(s) in RCA: 120] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

Protein families can be used to understand many aspects of genomes, both their "live" and their "dead" parts (i.e. genes and pseudogenes). Surveys of genomes have revealed that, in every organism, there are always a few large families and many small ones, with the overall distribution following a power-law. This commonality is equally true for both genes and pseudogenes, and exists despite the fact that the specific families that are enlarged differ greatly between organisms. Furthermore, because of family structure there is great redundancy in proteomes, a fact linked to the large number of dispensable genes for each organism and the small size of the minimal, indispensable sub-proteome. Pseudogenes in prokaryotes represent families that are in the process of being dispensed with. In particular, the genome sequences of certain pathogenic bacteria (Mycobacterium leprae, Yersinia pestis and Rickettsia prowazekii) show how an organism can undergo reductive evolution on a large scale (i.e. the dying out of families) as a result of niche change. There appears to be less pressure to delete pseudogenes in eukaryotes. These can be divided into two varieties, duplicated and processed, where the latter involves reverse transcription from an mRNA intermediate. We discuss these collectively in yeast, worm, fly, and human. The fly has few pseudogenes apparently because of its high rate of genomic DNA deletion. In the other three organisms, the distribution of pseudogenes on the chromosome and amongst different families is highly non-uniform. Pseudogenes tend not to occur in the middle of chromosome arms, and tend to be associated with lineage-specific (as opposed to highly conserved) families that have environmental-response functions. This may be because, rather than being dead, they may form a reservoir of diverse "extra parts" that can be resurrected to help an organism adapt to its surroundings. In yeast, there may be a novel mechanism involving the [PSI+] prion that potentially enables this resurrection. In worm, the pseudogenes tend to arise out of families (e.g. chemoreceptors) that are greatly expanded in it compared to the fly. The human genome stands out in having many processed pseudogenes. These have a character very different from those of the duplicated variety, to a large extent just representing random insertions. Thus, their occurrence tends to be roughly in proportion to the amount of mRNA for a particular protein and to reflect the extent of the intergenic sequences. Further information about pseudogenes is available at http://genecensus.org/pseudogene

Collapse