1
|
Škrabišová M, Dietz N, Zeng S, Chan YO, Wang J, Liu Y, Biová J, Joshi T, Bilyeu KD. A novel Synthetic phenotype association study approach reveals the landscape of association for genomic variants and phenotypes. J Adv Res 2022; 42:117-133. [PMID: 36513408 PMCID: PMC9788956 DOI: 10.1016/j.jare.2022.04.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 02/14/2022] [Accepted: 04/08/2022] [Indexed: 12/27/2022] Open
Abstract
INTRODUCTION Genome-Wide Association Studies (GWAS) identify tagging variants in the genome that are statistically associated with the phenotype because of their linkage disequilibrium (LD) relationship with the causative mutation (CM). When both low-density genotyped accession panels with phenotypes and resequenced data accession panels are available, tagging variants can assist with post-GWAS challenges in CM discovery. OBJECTIVES Our objective was to identify additional GWAS evaluation criteria to assess correspondence between genomic variants and phenotypes, as well as enable deeper analysis of the localized landscape of association. METHODS We used genomic variant positions as Synthetic phenotypes in GWAS that we named "Synthetic phenotype association study" (SPAS). The extreme case of SPAS is what we call an "Inverse GWAS" where we used CM positions of cloned soybean genes. We developed and validated the Accuracy concept as a measure of the correspondence between variant positions and phenotypes. RESULTS The SPAS approach demonstrated that the genotype status of an associated variant used as a Synthetic phenotype enabled us to explore the relationships between tagging variants and CMs, and further, that utilizing CMs as Synthetic phenotypes in Inverse GWAS illuminated the landscape of association. We implemented the Accuracy calculation for a curated accession panel to an online Accuracy calculation tool (AccuTool) as a resource for gene identification in soybean. We demonstrated our concepts on three examples of soybean cloned genes. As a result of our findings, we devised an enhanced "GWAS to Genes" analysis (Synthetic phenotype to CM strategy, SP2CM). Using SP2CM, we identified a CM for a novel gene. CONCLUSION The SP2CM strategy utilizing Synthetic phenotypes and the Accuracy calculation of correspondence provides crucial information to assist researchers in CM discovery. The impact of this work is a more effective evaluation of landscapes of GWAS associations.
Collapse
Affiliation(s)
- Mária Škrabišová
- Department of Biochemistry, Faculty of Science, Palacky University Olomouc, Olomouc 78371, Czech Republic
| | - Nicholas Dietz
- Division of Plant Sciences, University of Missouri, Columbia, MO 65201, USA
| | - Shuai Zeng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65212, USA,Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65212, USA
| | - Yen On Chan
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65212, USA,MU Data Science and Informatics Institute, University of Missouri, Columbia, MO 65212, USA
| | - Juexin Wang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65212, USA,Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65212, USA
| | - Yang Liu
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65212, USA,MU Data Science and Informatics Institute, University of Missouri, Columbia, MO 65212, USA
| | - Jana Biová
- Department of Biochemistry, Faculty of Science, Palacky University Olomouc, Olomouc 78371, Czech Republic
| | - Trupti Joshi
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65212, USA,Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65212, USA,MU Data Science and Informatics Institute, University of Missouri, Columbia, MO 65212, USA,Department of Health Management and Informatics, School of Medicine, University of Missouri, Columbia, MO 65212, USA,Corresponding authors at: Department of Health Management and Informatics, School of Medicine, 1201 E Rollins St, 271B Life Science Center, Columbia, MO 65201, USA (T. Joshi). Plant Genetics Research Unit, United States Department of Agriculture-Agricultural Research Service, 110 Waters Hall, University of Missouri, Columbia, MO 65211, USA (K.D. Bilyeu).
| | - Kristin D. Bilyeu
- Plant Genetics Research Unit, United States Department of Agriculture-Agricultural Research Service, University of Missouri, Columbia, MO 65211, USA,Corresponding authors at: Department of Health Management and Informatics, School of Medicine, 1201 E Rollins St, 271B Life Science Center, Columbia, MO 65201, USA (T. Joshi). Plant Genetics Research Unit, United States Department of Agriculture-Agricultural Research Service, 110 Waters Hall, University of Missouri, Columbia, MO 65211, USA (K.D. Bilyeu).
| |
Collapse
|
2
|
Zheng Z, Li Y, Li M, Li G, Du X, Hongyin H, Yin M, Lu Z, Zhang X, Shrestha N, Liu J, Yang Y. Whole-Genome Diversification Analysis of the Hornbeam Species Reveals Speciation and Adaptation Among Closely Related Species. FRONTIERS IN PLANT SCIENCE 2021; 12:581704. [PMID: 33643339 PMCID: PMC7902934 DOI: 10.3389/fpls.2021.581704] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 01/04/2021] [Indexed: 06/12/2023]
Abstract
Speciation is the key evolutionary process for generating biological diversity and has a central place in evolutionary and ecological research. How species diverge and adapt to different habitats is one of the most exciting areas in speciation studies. Here, we sequenced 55 individuals from three closely related species in the genus Carpinus: Carpinus tibetana, Carpinus monbeigiana, and Carpinus mollicoma to understand the strength and direction of gene flow and selection during the speciation process. We found low genetic diversity in C. tibetana, which reflects its extremely small effective population size. The speciation analysis between C. monbeigiana and C. mollicoma revealed that both species diverged ∼1.2 Mya with bidirectional gene flow. A total of 291 highly diverged genes, 223 copy number variants genes, and 269 positive selected genes were recovered from the two species. Genes associated with the diverged and positively selected regions were mainly involved in thermoregulation, plant development, and response to stress, which included adaptations to their habitats. We also found a great population decline and a low genetic divergence of C. tibetana, which suggests that this species is extremely vulnerable. We believe that the current diversification and adaption study and the important genomic resource sequenced herein will facilitate the speciation studies and serve as an important methodological reference for future research.
Collapse
Affiliation(s)
- Zeyu Zheng
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Ying Li
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Minjie Li
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Guiting Li
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Xin Du
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Hu Hongyin
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Mou Yin
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Zhiqiang Lu
- CAS Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Kunming, China
| | - Xu Zhang
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Nawal Shrestha
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Jianquan Liu
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| | - Yongzhi Yang
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology and School of Life Sciences, Lanzhou University, Lanzhou, China
| |
Collapse
|
3
|
Kajiya-Kanegae H, Nagasaki H, Kaga A, Hirano K, Ogiso-Tanaka E, Matsuoka M, Ishimori M, Ishimoto M, Hashiguchi M, Tanaka H, Akashi R, Isobe S, Iwata H. Whole-genome sequence diversity and association analysis of 198 soybean accessions in mini-core collections. DNA Res 2021; 28:dsaa032. [PMID: 33492369 PMCID: PMC7934572 DOI: 10.1093/dnares/dsaa032] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 01/04/2021] [Indexed: 12/11/2022] Open
Abstract
We performed whole-genome Illumina resequencing of 198 accessions to examine the genetic diversity and facilitate the use of soybean genetic resources and identified 10 million single nucleotide polymorphisms and 2.8 million small indels. Furthermore, PacBio resequencing of 10 accessions was performed, and a total of 2,033 structure variants were identified. Genetic diversity and structure analysis congregated the 198 accessions into three subgroups (Primitive, World, and Japan) and showed the possibility of a long and relatively isolated history of cultivated soybean in Japan. Additionally, the skewed regional distribution of variants in the genome, such as higher structural variations on the R gene clusters in the Japan group, suggested the possibility of selective sweeps during domestication or breeding. A genome-wide association study identified both known and novel causal variants on the genes controlling the flowering period. Novel candidate causal variants were also found on genes related to the seed coat colour by aligning together with Illumina and PacBio reads. The genomic sequences and variants obtained in this study have immense potential to provide information for soybean breeding and genetic studies that may uncover novel alleles or genes involved in agronomically important traits.
Collapse
Affiliation(s)
- Hiromi Kajiya-Kanegae
- Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo 113-8657, Japan
| | - Hideki Nagasaki
- Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan
| | - Akito Kaga
- Institute of Crop Science, National Agriculture and Food Research Organization (NARO), Tsukuba, Ibaraki 305-8518, Japan
| | - Ko Hirano
- Bioscience and Biotechnology Center, Nagoya University, Nagoya, Aichi 464-8601, Japan
| | - Eri Ogiso-Tanaka
- Institute of Crop Science, National Agriculture and Food Research Organization (NARO), Tsukuba, Ibaraki 305-8518, Japan
| | - Makoto Matsuoka
- Bioscience and Biotechnology Center, Nagoya University, Nagoya, Aichi 464-8601, Japan
| | - Motoyuki Ishimori
- Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo 113-8657, Japan
| | - Masao Ishimoto
- Institute of Crop Science, National Agriculture and Food Research Organization (NARO), Tsukuba, Ibaraki 305-8518, Japan
| | | | - Hidenori Tanaka
- Faculty of Agriculture, University of Miyazaki, Miyazaki 889-2192, Japan
| | - Ryo Akashi
- Faculty of Agriculture, University of Miyazaki, Miyazaki 889-2192, Japan
| | - Sachiko Isobe
- Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan
| | - Hiroyoshi Iwata
- Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo 113-8657, Japan
| |
Collapse
|
4
|
Kim JY, Jeong S, Kim KH, Lim WJ, Lee HY, Jeong N, Moon JK, Kim N. Dissection of soybean populations according to selection signatures based on whole-genome sequences. Gigascience 2019; 8:giz151. [PMID: 31869408 PMCID: PMC6927394 DOI: 10.1093/gigascience/giz151] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 08/21/2019] [Accepted: 12/05/2019] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Domestication and improvement processes, accompanied by selections and adaptations, have generated genome-wide divergence and stratification in soybean populations. Simultaneously, soybean populations, which comprise diverse subpopulations, have developed their own adaptive characteristics enhancing fitness, resistance, agronomic traits, and morphological features. The genetic traits underlying these characteristics play a fundamental role in improving other soybean populations. RESULTS This study focused on identifying the selection signatures and adaptive characteristics in soybean populations. A core set of 245 accessions (112 wild-type, 79 landrace, and 54 improvement soybeans) selected from 4,234 soybean accessions was re-sequenced. Their genomic architectures were examined according to the domestication and improvement, and accessions were then classified into 3 wild-type, 2 landrace, and 2 improvement subgroups based on various population analyses. Selection and gene set enrichment analyses revealed that the landrace subgroups have selection signals for soybean-cyst nematode HG type 0 and seed development with germination, and that the improvement subgroups have selection signals for plant development with viability and seed development with embryo development, respectively. The adaptive characteristic for soybean-cyst nematode was partially underpinned by multiple resistance accessions, and the characteristics related to seed development were supported by our phenotypic findings for seed weights. Furthermore, their adaptive characteristics were also confirmed as genome-based evidence, and unique genomic regions that exhibit distinct selection and selective sweep patterns were revealed for 13 candidate genes. CONCLUSIONS Although our findings require further biological validation, they provide valuable information about soybean breeding strategies and present new options for breeders seeking donor lines to improve soybean populations.
Collapse
Affiliation(s)
- Jae-Yoon Kim
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Gwahak-ro 125, Yuseong-gu, Daejeon 34141, Republic of Korea
- Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology (UST), Gajeong-ro 217, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Seongmun Jeong
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Gwahak-ro 125, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Kyoung Hyoun Kim
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Gwahak-ro 125, Yuseong-gu, Daejeon 34141, Republic of Korea
- Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology (UST), Gajeong-ro 217, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Won-Jun Lim
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Gwahak-ro 125, Yuseong-gu, Daejeon 34141, Republic of Korea
- Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology (UST), Gajeong-ro 217, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Ho-Yeon Lee
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Gwahak-ro 125, Yuseong-gu, Daejeon 34141, Republic of Korea
- Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology (UST), Gajeong-ro 217, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Namhee Jeong
- National Institute of Crop Science, Rural Development Administration, Nongsaengmyeong-ro 370, Deokjin-gu, Jeon-Ju 54874, Republic of Korea
| | - Jung-Kyung Moon
- National Institute of Crop Science, Rural Development Administration, Nongsaengmyeong-ro 370, Deokjin-gu, Jeon-Ju 54874, Republic of Korea
| | - Namshin Kim
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Gwahak-ro 125, Yuseong-gu, Daejeon 34141, Republic of Korea
- Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology (UST), Gajeong-ro 217, Yuseong-gu, Daejeon 34141, Republic of Korea
| |
Collapse
|