1
|
Yang X, Zhu D, Yang C, Zhou C. H-ACO with Consecutive Bases Pairing Constraint for Designing DNA Sequences. Interdiscip Sci 2024:10.1007/s12539-024-00614-1. [PMID: 38683280 DOI: 10.1007/s12539-024-00614-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 01/24/2024] [Accepted: 01/27/2024] [Indexed: 05/01/2024]
Abstract
DNA computing is a novel computing method that does not rely on traditional computers. The design of DNA sequences is a crucial step in DNA computing, and the quality of the sequence design directly affects the results of DNA computing. In this paper, a new constraint called the consecutive base pairing constraint is proposed to limit specific base pairings in DNA sequence design. Additionally, to improve the efficiency and capability of DNA sequence design, the Hierarchy-ant colony (H-ACO) algorithm is introduced, which combines the features of multiple algorithms and optimizes discrete numerical calculations. Experimental results show that the H-ACO algorithm performs well in DNA sequence design. Finally, this paper compares a series of constraint values and NUPACK simulation data with previous design results, and the DNA sequence set designed in this paper has more advantages.
Collapse
Affiliation(s)
- Xuwei Yang
- College of Computer Science and Technology, Zhejiang Normal University, Jinhua, 321004, China
| | - Donglin Zhu
- College of Computer Science and Technology, Zhejiang Normal University, Jinhua, 321004, China
| | - Can Yang
- Jinhua Polytechnic, Jinhua, 321000, China
| | - Changjun Zhou
- College of Computer Science and Technology, Zhejiang Normal University, Jinhua, 321004, China.
| |
Collapse
|
2
|
Du H, Zhou S, Yan W, Wang S. Study on DNA Storage Encoding Based IAOA under Innovation Constraints. Curr Issues Mol Biol 2023; 45:3573-3590. [PMID: 37185757 PMCID: PMC10136724 DOI: 10.3390/cimb45040233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Revised: 04/09/2023] [Accepted: 04/13/2023] [Indexed: 05/17/2023] Open
Abstract
With the informationization of social processes, the amount of related data has greatly increased, making traditional storage media unable to meet the current requirements for data storage. Due to its advantages of a high storage capacity and persistence, deoxyribonucleic acid (DNA) has been considered the most prospective storage media to solve the data storage problem. Synthesis is an important process for DNA storage, and low-quality DNA coding can increase errors during sequencing, which can affect the storage efficiency. To reduce errors caused by the poor stability of DNA sequences during storage, this paper proposes a method that uses the double-matching and error-pairing constraints to improve the quality of the DNA coding set. First, the double-matching and error-pairing constraints are defined to solve problems of sequences with self-complementary reactions in the solution that are prone to mismatch at the 3' end. In addition, two strategies are introduced in the arithmetic optimization algorithm, including a random perturbation of the elementary function and a double adaptive weighting strategy. An improved arithmetic optimization algorithm (IAOA) is proposed to construct DNA coding sets. The experimental results of the IAOA on 13 benchmark functions show a significant improvement in its exploration and development capabilities over the existing algorithms. Moreover, the IAOA is used in the DNA encoding design under both traditional and new constraints. The DNA coding sets are tested to estimate their quality regarding the number of hairpins and melting temperature. The DNA storage coding sets constructed in this study are improved by 77.7% at the lower boundary compared to existing algorithms. The DNA sequences in the storage sets show a reduction of 9.7-84.1% in the melting temperature variance, and the hairpin structure ratio is reduced by 2.1-80%. The results indicate that the stability of the DNA coding sets is improved under the two proposed constraints compared to traditional constraints.
Collapse
Affiliation(s)
- Haigui Du
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| | - Shihua Zhou
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| | - WeiQi Yan
- School of Engineering, Computer and Mathematical Sciences, Auckland University of Technology, Auckland 1010, New Zealand
| | - Sijie Wang
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| |
Collapse
|
3
|
Zhang J. Levy Equilibrium Optimizer algorithm for the DNA storage code set. PLoS One 2022; 17:e0277139. [PMID: 36395269 PMCID: PMC9671426 DOI: 10.1371/journal.pone.0277139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 10/10/2022] [Indexed: 11/18/2022] Open
Abstract
The generation of massive data puts forward higher requirements for storage technology. DNA storage is a new storage technology which uses biological macromolecule DNA as information carrier. Compared with traditional silicon-based storage, DNA storage has the advantages of large capacity, high density, low energy consumption and high durability. DNA coding is to store data information with as few base sequences as possible without errors. Coding is a key technology in DNA storage, and its results directly affect the performance of storage and the integrity of data reading and writing. In this paper, a Levy Equilibrium Optimizer (LEO) algorithm is proposed to construct a DNA storage code set that satisfies combinatorial constraints. The performance of the proposed algorithm is tested on 13 benchmark functions, and 4 new global optima are obtained. Under the same constraints, the DNA storage code set is constructed. Compared with previous work, the lower bound of DNA storage code set is improved by 4-13%.
Collapse
Affiliation(s)
- Jianxia Zhang
- School of Intelligent Engineering, Henan Institute of Technology, Xinxiang, China
- * E-mail:
| |
Collapse
|
4
|
Bekdaş G, Cakiroglu C, Kim S, Geem ZW. Optimization and Predictive Modeling of Reinforced Concrete Circular Columns. MATERIALS (BASEL, SWITZERLAND) 2022; 15:6624. [PMID: 36233966 PMCID: PMC9573187 DOI: 10.3390/ma15196624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Revised: 09/10/2022] [Accepted: 09/19/2022] [Indexed: 06/16/2023]
Abstract
Metaheuristic optimization techniques are widely applied in the optimal design of structural members. This paper presents the application of the harmony search algorithm to the optimal dimensioning of reinforced concrete circular columns. For the objective of optimization, the total cost of steel and concrete associated with the construction process were selected. The selected variables of optimization include the diameter of the column, the total cross-sectional area of steel, the unit costs of steel and concrete used in the construction, the total length of the column, and applied axial force and the bending moment acting on the column. By using the minimum allowable dimensions as the constraints of optimization, 3125 different data samples were generated where each data sample is an optimal design configuration. Based on the generated dataset, the SHapley Additive exPlanations (SHAP) algorithm was applied in combination with ensemble learning predictive models to determine the impact of each design variable on the model predictions. The relationships between the design variables and the objective function were visualized using the design of experiments methodology. Applying state-of-the-art statistical accuracy measures such as the coefficient of determination, the predictive models were demonstrated to be highly accurate. The current study demonstrates a novel technique for generating large datasets for the development of data-driven machine learning models. This new methodology can enhance the availability of large datasets, thereby facilitating the application of high-performance machine learning predictive models for optimal structural design.
Collapse
Affiliation(s)
- Gebrail Bekdaş
- Department of Civil Engineering, Istanbul University-Cerrahpasa, 34320 Istanbul, Turkey
| | - Celal Cakiroglu
- Department of Civil Engineering, Turkish-German University, 34820 Istanbul, Turkey
| | - Sanghun Kim
- Department of Civil and Environmental Engineering, Temple University, Philadelphia, PA 19122, USA
| | - Zong Woo Geem
- Department of Smart City & Energy, Gachon University, Seongnam 13120, Korea
| |
Collapse
|
5
|
Wu J, Zheng Y, Wang B, Zhang Q. Enhancing Physical and Thermodynamic Properties of DNA Storage Sets with End-constraint. IEEE Trans Nanobioscience 2021; 21:184-193. [PMID: 34662278 DOI: 10.1109/tnb.2021.3121278] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
With the explosion of data, DNA is considered as an ideal carrier for storage due to its high storage density. However, low-quality DNA sets hamper the widespread use of DNA storage. This work proposes a new method to design high-quality DNA storage sets. Firstly, random switch and double-weight offspring strategies are introduced in Double-strategy Black Widow Optimization Algorithm (DBWO). Experimental results of 26 benchmark functions show that the exploration and exploitation abilities of DBWO are greatly improved from previous work. Secondly, DBWO is applied in designing DNA storage sets, and compared with previous work, the lower bounds of storage sets are boosted by 9%-37%. Finally, to improve the poor stabilities of sequences, the End-constraint is proposed in designing DNA storage sets. By measuring the number of hairpin structures, melting temperature, and minimum free energy, it is evaluated that with our innovative constraint, DBWO can construct not only a larger number of storage sets, but also enhance physical and thermodynamic properties of DNA storage sets.
Collapse
|
6
|
Li X, Wei Z, Wang B, Song T. Stable DNA Sequence Over Close-Ending and Pairing Sequences Constraint. Front Genet 2021; 12:644484. [PMID: 34079580 PMCID: PMC8165483 DOI: 10.3389/fgene.2021.644484] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 04/12/2021] [Indexed: 11/15/2022] Open
Abstract
DNA computing is a new method based on molecular biotechnology to solve complex problems. The design of DNA sequences is a multi-objective optimization problem in DNA computing, whose objective is to obtain optimized sequences that satisfy multiple constraints to improve the quality of the sequences. However, the previous optimized DNA sequences reacted with each other, which reduced the number of DNA sequences that could be used for molecular hybridization in the solution and thus reduced the accuracy of DNA computing. In addition, a DNA sequence and its complement follow the principle of complementary pairing, and the sequence of base GC at both ends is more stable. To optimize the above problems, the constraints of Pairing Sequences Constraint (PSC) and Close-ending along with the Improved Chaos Whale (ICW) optimization algorithm were proposed to construct a DNA sequence set that satisfies the combination of constraints. The ICW optimization algorithm is added to a new predator–prey strategy and sine and cosine functions under the action of chaos. Compared with other algorithms, among the 23 benchmark functions, the new algorithm obtained the minimum value for one-third of the functions and two-thirds of the current minimum value. The DNA sequences satisfying the constraint combination obtained the minimum of fitness values and had stable and usable structures.
Collapse
Affiliation(s)
- Xue Li
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian, China
| | - Ziqi Wei
- School of Software, Tsinghua University, Beijing, China
| | - Bin Wang
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian, China
| | - Tao Song
- College of Computer and Communication Engineering, China University of Petroleum, Qingdao, China
| |
Collapse
|
7
|
Zheng Y, Wu J, Wang B. CLGBO: An Algorithm for Constructing Highly Robust Coding Sets for DNA Storage. Front Genet 2021; 12:644945. [PMID: 34017354 PMCID: PMC8129200 DOI: 10.3389/fgene.2021.644945] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 04/08/2021] [Indexed: 11/22/2022] Open
Abstract
In the era of big data, new storage media are urgently needed because the storage capacity for global data cannot meet the exponential growth of information. Deoxyribonucleic acid (DNA) storage, where primer and address sequences play a crucial role, is one of the most promising storage media because of its high density, large capacity and durability. In this study, we describe an enhanced gradient-based optimizer that includes the Cauchy and Levy mutation strategy (CLGBO) to construct DNA coding sets, which are used as primer and address libraries. Our experimental results show that the lower bounds of DNA storage coding sets obtained using the CLGBO algorithm are increased by 4.3–13.5% compared with previous work. The non-adjacent subsequence constraint was introduced to reduce the error rate in the storage process. This helps to resolve the problem that arises when consecutive repetitive subsequences in the sequence cause errors in DNA storage. We made use of the CLGBO algorithm and the non-adjacent subsequence constraint to construct larger and more highly robust coding sets.
Collapse
Affiliation(s)
- Yanfen Zheng
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian, China
| | - Jieqiong Wu
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian, China
| | - Bin Wang
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian, China
| |
Collapse
|
8
|
Cao B, Zhang X, Wu J, Wang B, Zhang Q, Wei X. Minimum Free Energy Coding for DNA Storage. IEEE Trans Nanobioscience 2021; 20:212-222. [PMID: 33534710 DOI: 10.1109/tnb.2021.3056351] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
With the development of information technology, huge amounts of data are produced at the same time. How to store data efficiently and at low cost has become an urgent problem. DNA is a high-density and persistent medium, making DNA storage a viable solution. In a DNA data storage system, the first consideration is how to encode the data effectively into code words. However, DNA strands are prone to non-specific hybridization during the hybridization reaction process and are prone to errors during synthesis and sequencing. In order to reduce the error rate, a thermodynamic minimum free energy (MFE) constraint is proposed and applied to the construction of coding sets for DNA storage. The Brownian multi-verse optimizer (BMVO) algorithm, based on the Multi-verse optimizer (MVO) algorithm, incorporates the idea of Brownian motion and Nelder-Mead method, and it is used to design a better DNA storage coding set. In addition, compared with previous works, the coding set has been increasing by 4%-50% in size and has better thermodynamic properties. With the improvement of the quality of the DNA coding set, the accuracy of reading and writing and the robustness of the DNA storage system are also enhanced.
Collapse
|
9
|
Feature Selection for Colon Cancer Detection Using K-Means Clustering and Modified Harmony Search Algorithm. MATHEMATICS 2021. [DOI: 10.3390/math9050570] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
This paper proposes a feature selection method that is effective in distinguishing colorectal cancer patients from normal individuals using K-means clustering and the modified harmony search algorithm. As the genetic cause of colorectal cancer originates from mutations in genes, it is important to classify the presence or absence of colorectal cancer through gene information. The proposed methodology consists of four steps. First, the original data are Z-normalized by data preprocessing. Candidate genes are then selected using the Fisher score. Next, one representative gene is selected from each cluster after candidate genes are clustered using K-means clustering. Finally, feature selection is carried out using the modified harmony search algorithm. The gene combination created by feature selection is then applied to the classification model and verified using 5-fold cross-validation. The proposed model obtained a classification accuracy of up to 94.36%. Furthermore, on comparing the proposed method with other methods, we prove that the proposed method performs well in classifying colorectal cancer. Moreover, we believe that the proposed model can be applied not only to colorectal cancer but also to other gene-related diseases.
Collapse
|
10
|
Xue X, Jin H, Zhou D, Zhou C. Medical Image Protection Algorithm Based on Deoxyribonucleic Acid Chain of Dynamic Length. Front Genet 2021; 12:654663. [PMID: 33747054 PMCID: PMC7970129 DOI: 10.3389/fgene.2021.654663] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Accepted: 02/09/2021] [Indexed: 11/15/2022] Open
Abstract
Current image encryption algorithms have various deficiencies in effectively protecting medical images with large storage capacity and high pixel correlation. This article proposed a new image protection algorithm based on the deoxyribonucleic acid chain of dynamic length, which achieved image encryption by DNA dynamic coding, generation of DNA dynamic chain, and dynamic operation of row chain and column chain. First, the original image is encoded dynamically according to the binary bit from a pixel, and the DNA sequence matrix is scrambled. Second, DNA sequence matrices are dynamically segmented into DNA chains of different lengths. After that, row and column deletion operation and transposition operation of DNA dynamic chain are carried out, respectively, which made DNA chain matrix double shuffle. Finally, the encrypted image is got after recombining DNA chains of different lengths. The proposed algorithm was tested on a list of medical images. Results showed that the proposed algorithm showed excellent security performance, and it is immune to noise attack, occlusion attack, and all common cryptographic attacks.
Collapse
Affiliation(s)
- Xianglian Xue
- School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, China.,Sections of Computer Teaching and Research, Shaanxi University of Chinese Medicine, Xianyang, China
| | - Haiyan Jin
- School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, China.,Shaanxi Key Laboratory for Network Computing and Security Technology, Xi'an University of Technology, Xi'an, China
| | - Dongsheng Zhou
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, Dalian University, Dalian, China
| | - Changjun Zhou
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, China
| |
Collapse
|
11
|
Xue X, Zhou D, Zhou C. New insights into the existing image encryption algorithms based on DNA coding. PLoS One 2020; 15:e0241184. [PMID: 33095816 PMCID: PMC7584250 DOI: 10.1371/journal.pone.0241184] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 10/10/2020] [Indexed: 12/03/2022] Open
Abstract
Because a DNA nucleotide sequence has the characteristics of large storage capacity, high parallelism, and low energy consumption, DNA cryptography is favored by information security researchers. Image encryption algorithms based on DNA coding have become a research hotspot in the field of image encryption and security. In this study, based on a comprehensive review of the existing studies and their results, we present new insights into the existing image encryption algorithms based on DNA coding. First, the existing algorithms were summarized and classified into five types, depending on the type of DNA coding: DNA fixed coding, DNA dynamic coding, different types of base complement operation, different DNA sequence algebraic operations, and combinations of multiple DNA operations. Second, we analyzed and studied each classification algorithm using simulation and obtained their advantages and disadvantages. Third, the DNA coding mechanisms, DNA algebraic operations, and DNA algebraic combination operations were compared and discussed. Then, a new scheme was proposed by combining the optimal coding mechanism with the optimal DNA coding operation. Finally, we revealed the shortcomings of the existing studies and the future direction for improving image encryption methods based on DNA coding.
Collapse
Affiliation(s)
- Xianglian Xue
- Section of Computer Teaching and Research, Shaanxi University of Chinese Medicine, Xianyang, China
- Laboratory of Network Computer and Security Technology in Shaanxi Province, Xi’an University of Technology, Xi’an, China
- * E-mail:
| | - Dongsheng Zhou
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, Dalian University, Dalian, China
| | - Changjun Zhou
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, China
| |
Collapse
|
12
|
Zhou S, He P, Kasabov N. A Dynamic DNA Color Image Encryption Method Based on SHA-512. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E1091. [PMID: 33286859 PMCID: PMC7597187 DOI: 10.3390/e22101091] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 09/23/2020] [Accepted: 09/25/2020] [Indexed: 11/16/2022]
Abstract
This paper presents a dynamic deoxyribonucleic acid (DNA) image encryption based on Secure Hash Algorithm-512 (SHA-512), having the structure of two rounds of permutation-diffusion, by employing two chaotic systems, dynamic DNA coding, DNA sequencing operations, and conditional shifting. We employed the SHA-512 algorithm to generate a 512-bit hash value and later utilized this value with the natural DNA sequence to calculate the initial values for the chaotic systems and the eight intermittent parameters. We implemented a two-dimensional rectangular transform (2D-RT) on the permutation. We used four-wing chaotic systems and Lorentz systems to generate chaotic sequences and recombined three channel matrices and chaotic matrices with intermittent parameters. We calculated hamming distances of DNA matrices, updated the initial values of two chaotic systems, and generated the corresponding chaotic matrices to complete the diffusion operation. After diffusion, we decoded and decomposed the DNA matrices, and then scrambled and merged these matrices into an encrypted image. According to experiments, the encryption method in this paper not only was able to withstand statistical attacks, plaintext attacks, brute-force attacks, and a host of other attacks, but also could reduce the complexity of the algorithm because it adopted DNA sequencing operations that were different from traditional DNA sequencing operations.
Collapse
Affiliation(s)
- Shihua Zhou
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China;
- Knowledge Engineering and Discovery Research Institute, Auckland University of Technology, Auckland 1010, New Zealand;
| | - Pinyan He
- Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China;
| | - Nikola Kasabov
- Knowledge Engineering and Discovery Research Institute, Auckland University of Technology, Auckland 1010, New Zealand;
- Intelligent Systems Research Center, Ulster University, Londonderry BT52 1SA, UK
| |
Collapse
|
13
|
Wan Y, Wang X, Chen Q, Lei X, Wang Y, Chen C, Hu H. A disease category feature database construction method of brain image based on deep convolutional neural network. PLoS One 2020; 15:e0232791. [PMID: 32479504 PMCID: PMC7263580 DOI: 10.1371/journal.pone.0232791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 04/21/2020] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Constructing a medical image feature database according to the category of disease can achieve a quick retrieval of images with similar pathological features. Therefore, this approach has important application values in the fields such as auxiliary diagnosis, teaching, research, and telemedicine. METHODS Based on the deep convolutional neural network, an image classifier applicable to brain disease was designed to distinguish between the image features of the different brain diseases with similar anatomical structures. Through the extraction and analysis of visual features, the images were labelled with the corresponding semantic features of a specific disease category, which can establish an association between the visual features of brain images and the semantic features of the category of disease which will permit to construct a disease category feature database of brain images. RESULTS Based on the similarity measurement and the matching strategy of high-dimensional visual feature, a high-precision retrieval of brain image with semantics category was achieved, and the constructed disease category feature database of brain image was tested and evaluated through large numbers of pathological image retrieval experiments, the accuracy and the effectiveness of the proposed approach was verified. CONCLUSION The disease category feature database of brain image constructed by the proposed approach achieved a quick and effective retrieval of images with similar pathological features, which is beneficial to the categorization and analysis of intractable brain diseases. This provides an effective application tool such as case-based image data management, evidence-based medicine and clinical decision support.
Collapse
Affiliation(s)
- Yanli Wan
- Institute of Medical Information, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Xifu Wang
- School of Traffic and Transportation, Institute of System Engineering and Control, Beijing Jiaotong University, Beijing, China
| | - Quan Chen
- Institute of Medical Information, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Xingyun Lei
- Institute of Medical Information, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Yan Wang
- Institute of Medical Information, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Chongde Chen
- Institute of Medical Information, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Hongpu Hu
- Institute of Medical Information, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| |
Collapse
|
14
|
Yin Q, Cao B, Li X, Wang B, Zhang Q, Wei X. An Intelligent Optimization Algorithm for Constructing a DNA Storage Code: NOL-HHO. Int J Mol Sci 2020; 21:E2191. [PMID: 32235762 PMCID: PMC7139338 DOI: 10.3390/ijms21062191] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 03/07/2020] [Accepted: 03/18/2020] [Indexed: 11/16/2022] Open
Abstract
The high density, large capacity, and long-term stability of DNA molecules make them an emerging storage medium that is especially suitable for the long-term storage of large datasets. The DNA sequences used in storage need to consider relevant constraints to avoid nonspecific hybridization reactions, such as the No-runlength constraint, GC-content, and the Hamming distance. In this work, a new nonlinear control parameter strategy and a random opposition-based learning strategy were used to improve the Harris hawks optimization algorithm (for the improved algorithm NOL-HHO) in order to prevent it from falling into local optima. Experimental testing was performed on 23 widely used benchmark functions, and the proposed algorithm was used to obtain better coding lower bounds for DNA storage. The results show that our algorithm can better maintain a smooth transition between exploration and exploitation and has stronger global exploration capabilities as compared with other algorithms. At the same time, the improvement of the lower bound directly affects the storage capacity and code rate, which promotes the further development of DNA storage technology.
Collapse
Affiliation(s)
- Qiang Yin
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| | - Ben Cao
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| | - Xue Li
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| | - Bin Wang
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| | - Qiang Zhang
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| | - Xiaopeng Wei
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian 116622, China
| |
Collapse
|