1
|
Deng CH, Naithani S, Kumari S, Cobo-Simón I, Quezada-Rodríguez EH, Skrabisova M, Gladman N, Correll MJ, Sikiru AB, Afuwape OO, Marrano A, Rebollo I, Zhang W, Jung S. Genotype and phenotype data standardization, utilization and integration in the big data era for agricultural sciences. Database (Oxford) 2023; 2023:baad088. [PMID: 38079567 PMCID: PMC10712715 DOI: 10.1093/database/baad088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 10/17/2023] [Accepted: 11/28/2023] [Indexed: 12/18/2023]
Abstract
Large-scale genotype and phenotype data have been increasingly generated to identify genetic markers, understand gene function and evolution and facilitate genomic selection. These datasets hold immense value for both current and future studies, as they are vital for crop breeding, yield improvement and overall agricultural sustainability. However, integrating these datasets from heterogeneous sources presents significant challenges and hinders their effective utilization. We established the Genotype-Phenotype Working Group in November 2021 as a part of the AgBioData Consortium (https://www.agbiodata.org) to review current data types and resources that support archiving, analysis and visualization of genotype and phenotype data to understand the needs and challenges of the plant genomic research community. For 2021-22, we identified different types of datasets and examined metadata annotations related to experimental design/methods/sample collection, etc. Furthermore, we thoroughly reviewed publicly funded repositories for raw and processed data as well as secondary databases and knowledgebases that enable the integration of heterogeneous data in the context of the genome browser, pathway networks and tissue-specific gene expression. Based on our survey, we recommend a need for (i) additional infrastructural support for archiving many new data types, (ii) development of community standards for data annotation and formatting, (iii) resources for biocuration and (iv) analysis and visualization tools to connect genotype data with phenotype data to enhance knowledge synthesis and to foster translational research. Although this paper only covers the data and resources relevant to the plant research community, we expect that similar issues and needs are shared by researchers working on animals. Database URL: https://www.agbiodata.org.
Collapse
Affiliation(s)
- Cecilia H Deng
- Molecular and Digital Breeding, New Cultivar Innovation, The New Zealand Institute for Plant and Food Research Limited, 120 Mt Albert Road, Auckland 1025, New Zealand
| | - Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Sunita Kumari
- Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, New York, NY 11724, USA
| | - Irene Cobo-Simón
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
- Institute of Forest Science (ICIFOR-INIA, CSIC), Madrid, Spain
| | - Elsa H Quezada-Rodríguez
- Departamento de Producción Agrícola y Animal, Universidad Autónoma Metropolitana-Xochimilco, Ciudad de México, México
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Ciudad de México, México
| | - Maria Skrabisova
- Department of Biochemistry, Faculty of Science, Palacky University, Olomouc, Czech Republic
| | - Nick Gladman
- Cold Spring Harbor Laboratory, 1 Bungtown Rd, Cold Spring Harbor, New York, NY 11724, USA
- U.S. Department of Agriculture-Agricultural Research Service, NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, NY 14853, USA
| | - Melanie J Correll
- Agricultural and Biological Engineering Department, University of Florida, 1741 Museum Rd, Gainesville, FL 32611, USA
| | | | | | - Annarita Marrano
- Phoenix Bioinformatics, 39899 Balentine Drive, Suite 200, Newark, CA 94560, USA
| | | | - Wentao Zhang
- National Research Council Canada, 110 Gymnasium Pl, Saskatoon, Saskatchewan S7N 0W9, Canada
| | - Sook Jung
- Department of Horticulture, Washington State University, 303c Plant Sciences Building, Pullman, WA 99164-6414, USA
| |
Collapse
|
2
|
Li Z, Li C, Zhang R, Duan M, Tian H, Yi H, Xu L, Wang F, Shi Z, Wang X, Wang J, Su A, Wang S, Sun X, Zhao Y, Wang S, Zhang Y, Wang Y, Song W, Zhao J. Genomic analysis of a new heterotic maize group reveals key loci for pedigree breeding. FRONTIERS IN PLANT SCIENCE 2023; 14:1213675. [PMID: 37636101 PMCID: PMC10451083 DOI: 10.3389/fpls.2023.1213675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 07/21/2023] [Indexed: 08/29/2023]
Abstract
Genome-wide analyses of maize populations have clarified the genetic basis of crop domestication and improvement. However, limited information is available on how breeding improvement reshaped the genome in the process of the formation of heterotic groups. In this study, we identified a new heterotic group (X group) based on an examination of 512 Chinese maize inbred lines. The X group was clearly distinct from the other non-H&L groups, implying that X × HIL is a new heterotic pattern. We selected the core inbred lines for an analysis of yield-related traits. Almost all yield-related traits were better in the X lines than those in the parental lines, indicating that the primary genetic improvement in the X group during breeding was yield-related traits. We generated whole-genome sequences of these lines with an average coverage of 17.35× to explore genome changes further. We analyzed the identity-by-descent (IBD) segments transferred from the two parents to the X lines and identified 29 and 28 IBD conserved regions (ICRs) from the parents PH4CV and PH6WC, respectively, accounting for 28.8% and 12.8% of the genome. We also identified 103, 89, and 131 selective sweeps (SSWs) using methods that involved the π, Tajima's D, and CLR values, respectively. Notably, 96.13% of the ICRs co-localized with SSWs, indicating that SSW signals concentrated in ICRs. We identified 171 annotated genes associated with yield-related traits in maize both in ICRs and SSWs. To identify the genetic factors associated with yield improvement, we conducted QTL mapping for 240 lines from a DH population (PH4CV × PH6WC, which are the parents of X1132X) for ten key yield-related traits and identified a total of 55 QTLs. Furthermore, we detected three QTL clusters both in ICRs and SSWs. Based on the genetic evidence, we finally identified three key genes contributing to yield improvement in breeding the X group. These findings reveal key loci and genes targeted during pedigree breeding and provide new insights for future genomic breeding.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Yuandong Wang
- Beijing Key Laboratory of Maize DNA Fingerprinting and Molecular Breeding, Maize Research Institute, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Wei Song
- Beijing Key Laboratory of Maize DNA Fingerprinting and Molecular Breeding, Maize Research Institute, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Jiuran Zhao
- Beijing Key Laboratory of Maize DNA Fingerprinting and Molecular Breeding, Maize Research Institute, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| |
Collapse
|
3
|
Galić V, Mlinarić S, Marelja M, Zdunić Z, Brkić A, Mazur M, Begović L, Šimić D. Contrasting Water Withholding Responses of Young Maize Plants Reveal Link Between Lipid Peroxidation and Osmotic Regulation Corroborated by Genetic Analysis. FRONTIERS IN PLANT SCIENCE 2022; 13:804630. [PMID: 35873985 PMCID: PMC9296821 DOI: 10.3389/fpls.2022.804630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 05/30/2022] [Indexed: 06/15/2023]
Abstract
Linking biochemistry and genetics of tolerance to osmotic stress is of interest for understanding plant adaptations to unfavorable conditions. The aims of this study were to investigate the variability in responses of panel of elite maize inbred lines to water withholding for stress-related traits through association study and to identify pathways linked to detected associations for better understanding of maize stress responses. Densely genotyped public and expired Plant Variety Protection Certificate (ex-PVP) inbred lines were planted in controlled conditions (16-h/8-h day/night, 25°C, 50% RH) in control (CO) and exposed to 10-day water withholding (WW). Traits analyzed were guaiacol peroxidase activity (GPOD), total protein content (PROT), lipid peroxidation (TBARS), hydrogen peroxide accumulation (H2O2), proline accumulation (proline), and current water content (CWC). Proline accumulation was found to be influenced by H2O2 and TBARS signaling pathways acting as an accumulation-switching mechanism. Most of the associations detected were for proline (29.4%) and TBARS (44.1%). Gene ontology (GO) enrichment analysis showed significant enrichment in regulation of integral membrane parts and peroxisomes along with regulation of transcription and polysaccharide catabolism. Dynamic studies involving inbreds with extreme phenotypes are needed to elucidate the role of this signaling mechanism in regulation of response to water deficit.
Collapse
Affiliation(s)
- Vlatko Galić
- Department of Maize Breeding and Genetics, Agricultural Institute Osijek, Osijek, Croatia
| | - Selma Mlinarić
- Department of Biology, Josip Juraj Strossmayer University of Osijek, Osijek, Croatia
| | - Matea Marelja
- Department of Biology, Josip Juraj Strossmayer University of Osijek, Osijek, Croatia
| | - Zvonimir Zdunić
- Department of Maize Breeding and Genetics, Agricultural Institute Osijek, Osijek, Croatia
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CroP-BioDiv), Zagreb, Croatia
| | - Andrija Brkić
- Department of Maize Breeding and Genetics, Agricultural Institute Osijek, Osijek, Croatia
| | - Maja Mazur
- Department of Maize Breeding and Genetics, Agricultural Institute Osijek, Osijek, Croatia
| | - Lidija Begović
- Department of Biology, Josip Juraj Strossmayer University of Osijek, Osijek, Croatia
| | - Domagoj Šimić
- Department of Maize Breeding and Genetics, Agricultural Institute Osijek, Osijek, Croatia
- Centre of Excellence for Biodiversity and Molecular Plant Breeding (CroP-BioDiv), Zagreb, Croatia
| |
Collapse
|
4
|
Sheng M, Ma X, Wang J, Xue T, Li Z, Cao Y, Yu X, Zhang X, Wang Y, Xu W, Su Z. KNOX II transcription factor HOS59 functions in regulating rice grain size. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:863-880. [PMID: 35167131 DOI: 10.1111/tpj.15709] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Revised: 01/30/2022] [Accepted: 02/10/2022] [Indexed: 06/14/2023]
Abstract
Plant Knotted1-like homeobox (KNOX) genes encode homeodomain-containing transcription factors. In rice (Oryza sativa L.), little is known about the downstream target genes of KNOX Class II subfamily proteins. Here we generated chromatin immunoprecipitation (ChIP)-sequencing datasets for HOS59, a member of the rice KNOX Class II subfamily, and characterized the genome-wide binding sites of HOS59. We conducted trait ontology (TO) analysis of 9705 identified downstream target genes, and found that multiple TO terms are related to plant structure morphology and stress traits. ChIP-quantitative PCR (qPCR) was conducted to validate some key target genes. Meanwhile, our IP-MS datasets showed that HOS59 was closely associated with BELL family proteins, some grain size regulators (OsSPL13, OsSPL16, OsSPL18, SLG, etc.), and some epigenetic modification factors such as OsAGO4α and OsAGO4β, proteins involved in small interfering RNA-mediated gene silencing. Furthermore, we employed CRISPR/Cas9 editing and transgenic approaches to generate hos59 mutants and overexpression lines, respectively. Compared with wild-type plants, the hos59 mutants have longer grains and increased glume cell length, a loose plant architecture, and drooping leaves, while the overexpression lines showed smaller grain size, erect leaves, and lower plant height. The qRT-PCR results showed that mutation of the HOS59 gene led to upregulation of some grain size-related genes such as OsSPL13, OsSPL18, and PGL2. In summary, our results indicate that HOS59 may be a repressor of the downstream target genes, negatively regulating glume cell length, rice grain size, plant architecture, etc. The identified downstream target genes and possible interaction proteins of HOS59 improve our understanding of the KNOX regulatory networks.
Collapse
Affiliation(s)
- Minghao Sheng
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Xuelian Ma
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Jiyao Wang
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Tianxi Xue
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Zhongqiu Li
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Yaxin Cao
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Xinyue Yu
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Xinyi Zhang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Yonghong Wang
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Wenying Xu
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Zhen Su
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| |
Collapse
|
5
|
Ma X, Yan H, Yang J, Liu Y, Li Z, Sheng M, Cao Y, Yu X, Yi X, Xu W, Su Z. PlantGSAD: a comprehensive gene set annotation database for plant species. Nucleic Acids Res 2021; 50:D1456-D1467. [PMID: 34534340 PMCID: PMC8728169 DOI: 10.1093/nar/gkab794] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 08/26/2021] [Accepted: 09/01/2021] [Indexed: 12/17/2022] Open
Abstract
With the accumulation of massive data sets from high-throughput experiments and the rapid emergence of new types of omics data, gene sets have become more diverse and essential for the refinement of gene annotation at multidimensional levels. Accordingly, we collected and defined 236 007 gene sets across different categories for 44 plant species in the Plant Gene Set Annotation Database (PlantGSAD). These gene sets were divided into nine main categories covering many functional subcategories, such as trait ontology, co-expression modules, chromatin states, and liquid-liquid phase separation. The annotations from the collected gene sets covered all of the genes in the Brassicaceae species Arabidopsis and Poaceae species Oryza sativa. Several GSEA tools are implemented in PlantGSAD to improve the efficiency of the analysis, including custom SEA for a flexible strategy based on customized annotations, SEACOMPARE for the cross-comparison of SEA results, and integrated visualization features for ontological analysis that intuitively reflects their parent-child relationships. In summary, PlantGSAD provides numerous gene sets for multiple plant species and highly efficient analysis tools. We believe that PlantGSAD will become a multifunctional analysis platform that can be used to predict and elucidate the functions and mechanisms of genes of interest. PlantGSAD is publicly available at http://systemsbiology.cau.edu.cn/PlantGSEAv2/.
Collapse
Affiliation(s)
- Xuelian Ma
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Hengyu Yan
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Jiaotong Yang
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Yue Liu
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Zhongqiu Li
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Minghao Sheng
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Yaxin Cao
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Xinyue Yu
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Xin Yi
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Wenying Xu
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Zhen Su
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
6
|
Buzdin AV, Patrushev MV, Sverdlov ED. Will Plant Genome Editing Play a Decisive Role in "Quantum-Leap" Improvements in Crop Yield to Feed an Increasing Global Human Population? PLANTS (BASEL, SWITZERLAND) 2021; 10:1667. [PMID: 34451712 PMCID: PMC8398637 DOI: 10.3390/plants10081667] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 08/04/2021] [Accepted: 08/07/2021] [Indexed: 02/08/2023]
Abstract
Growing scientific evidence demonstrates unprecedented planetary-scale human impacts on the Earth's system with a predicted threat to the existence of the terrestrial biosphere due to population increase, resource depletion, and pollution. Food systems account for 21-34% of global carbon dioxide (CO2) emissions. Over the past half-century, water and land-use changes have significantly impacted ecosystems, biogeochemical cycles, biodiversity, and climate. At the same time, food production is falling behind consumption, and global grain reserves are shrinking. Some predictions suggest that crop yields must approximately double by 2050 to adequately feed an increasing global population without a large expansion of crop area. To achieve this, "quantum-leap" improvements in crop cultivar productivity are needed within very narrow planetary boundaries of permissible environmental perturbations. Strategies for such a "quantum-leap" include mutation breeding and genetic engineering of known crop genome sequences. Synthetic biology makes it possible to synthesize DNA fragments of any desired sequence, and modern bioinformatics tools may hopefully provide an efficient way to identify targets for directed modification of selected genes responsible for known important agronomic traits. CRISPR/Cas9 is a new technology for incorporating seamless directed modifications into genomes; it is being widely investigated for its potential to enhance the efficiency of crop production. We consider the optimism associated with the new genetic technologies in terms of the complexity of most agronomic traits, especially crop yield potential (Yp) limits. We also discuss the possible directions of overcoming these limits and alternative ways of providing humanity with food without transgressing planetary boundaries. In conclusion, we support the long-debated idea that new technologies are unlikely to provide a rapidly growing population with significantly increased crop yield. Instead, we suggest that delicately balanced humane measures to limit its growth and the amount of food consumed per capita are highly desirable for the foreseeable future.
Collapse
Affiliation(s)
- Anton V Buzdin
- The Laboratory of Clinical and Genomic Bioinformatics, I.M. Sechenov First Moscow State Medical University, 119991 Moscow, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701 Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 117997 Moscow, Russia
| | - Maxim V Patrushev
- Kurchatov Center for Genome Research, National Research Center Kurchatov Institute, 123182 Moscow, Russia
| | - Eugene D Sverdlov
- Kurchatov Center for Genome Research, National Research Center Kurchatov Institute, 123182 Moscow, Russia
- Institute of Molecular Genetics, National Research Center Kurchatov Institute, 123182 Moscow, Russia
| |
Collapse
|