1
|
Haidar S, Hooker J, Lackey S, Elian M, Puchacz N, Szczyglowski K, Marsolais F, Golshani A, Cober ER, Samanfar B. Harnessing Multi-Omics Strategies and Bioinformatics Innovations for Advancing Soybean Improvement: A Comprehensive Review. PLANTS (BASEL, SWITZERLAND) 2024; 13:2714. [PMID: 39409584 PMCID: PMC11478702 DOI: 10.3390/plants13192714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2024] [Revised: 09/26/2024] [Accepted: 09/26/2024] [Indexed: 10/20/2024]
Abstract
Soybean improvement has entered a new era with the advent of multi-omics strategies and bioinformatics innovations, enabling more precise and efficient breeding practices. This comprehensive review examines the application of multi-omics approaches in soybean-encompassing genomics, transcriptomics, proteomics, metabolomics, epigenomics, and phenomics. We first explore pre-breeding and genomic selection as tools that have laid the groundwork for advanced trait improvement. Subsequently, we dig into the specific contributions of each -omics field, highlighting how bioinformatics tools and resources have facilitated the generation and integration of multifaceted data. The review emphasizes the power of integrating multi-omics datasets to elucidate complex traits and drive the development of superior soybean cultivars. Emerging trends, including novel computational techniques and high-throughput technologies, are discussed in the context of their potential to revolutionize soybean breeding. Finally, we address the challenges associated with multi-omics integration and propose future directions to overcome these hurdles, aiming to accelerate the pace of soybean improvement. This review serves as a crucial resource for researchers and breeders seeking to leverage multi-omics strategies for enhanced soybean productivity and resilience.
Collapse
Affiliation(s)
- Siwar Haidar
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON K1A 0C6, Canada; (S.H.)
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Julia Hooker
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON K1A 0C6, Canada; (S.H.)
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Simon Lackey
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON K1A 0C6, Canada; (S.H.)
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Mohamad Elian
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON K1A 0C6, Canada; (S.H.)
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Nathalie Puchacz
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON K1A 0C6, Canada; (S.H.)
| | - Krzysztof Szczyglowski
- Agriculture and Agri-Food Canada, London Research and Development Centre, London, ON N5V 4T3, Canada
| | - Frédéric Marsolais
- Agriculture and Agri-Food Canada, London Research and Development Centre, London, ON N5V 4T3, Canada
| | - Ashkan Golshani
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Elroy R. Cober
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON K1A 0C6, Canada; (S.H.)
| | - Bahram Samanfar
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON K1A 0C6, Canada; (S.H.)
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON K1S 5B6, Canada
| |
Collapse
|
2
|
Abstract
Applying computational statistics or machine learning methods to data is a key component of many scientific studies, in any field, but alone might not be sufficient to generate robust and reliable outcomes and results. Before applying any discovery method, preprocessing steps are necessary to prepare the data to the computational analysis. In this framework, data cleaning and feature engineering are key pillars of any scientific study involving data analysis and that should be adequately designed and performed since the first phases of the project. We call "feature" a variable describing a particular trait of a person or an observation, recorded usually as a column in a dataset. Even if pivotal, these data cleaning and feature engineering steps sometimes are done poorly or inefficiently, especially by beginners and unexperienced researchers. For this reason, we propose here our quick tips for data cleaning and feature engineering on how to carry out these important preprocessing steps correctly avoiding common mistakes and pitfalls. Although we designed these guidelines with bioinformatics and health informatics scenarios in mind, we believe they can more in general be applied to any scientific area. We therefore target these guidelines to any researcher or practitioners wanting to perform data cleaning or feature engineering. We believe our simple recommendations can help researchers and scholars perform better computational analyses that can lead, in turn, to more solid outcomes and more reliable discoveries.
Collapse
Affiliation(s)
- Davide Chicco
- Institute of Health Policy Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
| | - Luca Oneto
- Dipartimento di Informatica Bioingegneria Robotica e Ingegneria dei Sistemi, Università di Genova, Genoa, Italy
- ZenaByte S.r.l., Genoa, Italy
| | - Erica Tavazzi
- Dipartimento di Ingegneria dell’Informazione, Università di Padova, Padua, Italy
| |
Collapse
|
3
|
Cao P, Zhao Y, Wu F, Xin D, Liu C, Wu X, Lv J, Chen Q, Qi Z. Multi-Omics Techniques for Soybean Molecular Breeding. Int J Mol Sci 2022; 23:4994. [PMID: 35563386 PMCID: PMC9099442 DOI: 10.3390/ijms23094994] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 04/22/2022] [Accepted: 04/28/2022] [Indexed: 02/04/2023] Open
Abstract
Soybean is a major crop that provides essential protein and oil for food and feed. Since its origin in China over 5000 years ago, soybean has spread throughout the world, becoming the second most important vegetable oil crop and the primary source of plant protein for global consumption. From early domestication and artificial selection through hybridization and ultimately molecular breeding, the history of soybean breeding parallels major advances in plant science throughout the centuries. Now, rapid progress in plant omics is ushering in a new era of precision design breeding, exemplified by the engineering of elite soybean varieties with specific oil compositions to meet various end-use targets. The assembly of soybean reference genomes, made possible by the development of genome sequencing technology and bioinformatics over the past 20 years, was a great step forward in soybean research. It facilitated advances in soybean transcriptomics, proteomics, metabolomics, and phenomics, all of which paved the way for an integrated approach to molecular breeding in soybean. In this review, we summarize the latest progress in omics research, highlight novel findings made possible by omics techniques, note current drawbacks and areas for further research, and suggest that an efficient multi-omics approach may accelerate soybean breeding in the future. This review will be of interest not only to soybean breeders but also to researchers interested in the use of cutting-edge omics technologies for crop research and improvement.
Collapse
Affiliation(s)
- Pan Cao
- College of Agriculture, Northeast Agricultural University, Harbin 150030, China; (P.C.); (Y.Z.); (F.W.); (D.X.); (C.L.)
| | - Ying Zhao
- College of Agriculture, Northeast Agricultural University, Harbin 150030, China; (P.C.); (Y.Z.); (F.W.); (D.X.); (C.L.)
| | - Fengjiao Wu
- College of Agriculture, Northeast Agricultural University, Harbin 150030, China; (P.C.); (Y.Z.); (F.W.); (D.X.); (C.L.)
| | - Dawei Xin
- College of Agriculture, Northeast Agricultural University, Harbin 150030, China; (P.C.); (Y.Z.); (F.W.); (D.X.); (C.L.)
| | - Chunyan Liu
- College of Agriculture, Northeast Agricultural University, Harbin 150030, China; (P.C.); (Y.Z.); (F.W.); (D.X.); (C.L.)
| | - Xiaoxia Wu
- College of Agriculture, Northeast Agricultural University, Harbin 150030, China; (P.C.); (Y.Z.); (F.W.); (D.X.); (C.L.)
| | - Jian Lv
- Department of Innovation, Syngenta Biotechnology China, Beijing 102206, China
| | - Qingshan Chen
- College of Agriculture, Northeast Agricultural University, Harbin 150030, China; (P.C.); (Y.Z.); (F.W.); (D.X.); (C.L.)
| | - Zhaoming Qi
- College of Agriculture, Northeast Agricultural University, Harbin 150030, China; (P.C.); (Y.Z.); (F.W.); (D.X.); (C.L.)
| |
Collapse
|