1
|
António J, Valente J, Mora C, Almeida A, Jardim S. DarwinGSE: Towards better image retrieval systems for intellectual property datasets. PLoS One 2024; 19:e0304915. [PMID: 38950045 PMCID: PMC11216576 DOI: 10.1371/journal.pone.0304915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 05/20/2024] [Indexed: 07/03/2024] Open
Abstract
A trademark's image is usually the first type of indirect contact between a consumer and a product or a service. Companies rely on graphical trademarks as a symbol of quality and instant recognition, seeking to protect them from copyright infringements. A popular defense mechanism is graphical searching, where an image is compared to a large database to find potential conflicts with similar trademarks. Despite not being a new subject, image retrieval state-of-the-art lacks reliable solutions in the Industrial Property (IP) sector, where datasets are practically unrestricted in content, with abstract images for which modeling human perception is a challenging task. Existing Content-based Image Retrieval (CBIR) systems still present several problems, particularly in terms of efficiency and reliability. In this paper, we propose a new CBIR system that overcomes these major limitations. It follows a modular methodology, composed of a set of individual components tasked with the retrieval, maintenance and gradual optimization of trademark image searching, working on large-scale, unlabeled datasets. Its generalization capacity is achieved using multiple feature descriptions, weighted separately, and combined to represent a single similarity score. Images are evaluated for general features, edge maps, and regions of interest, using a method based on Watershedding K-Means segments. We propose an image recovery process that relies on a new similarity measure between all feature descriptions. New trademark images are added every day to ensure up-to-date results. The proposed system showcases a timely retrieval speed, with 95% of searches having a 10 second presentation speed and a mean average precision of 93.7%, supporting its applicability to real-word IP protection scenarios.
Collapse
Affiliation(s)
- João António
- Techframe-Information Systems, SA, São Domingos de Rana, Portugal
| | - Jorge Valente
- Techframe-Information Systems, SA, São Domingos de Rana, Portugal
| | - Carlos Mora
- Smart Cities Research Center, Polytechnic Institute of Tomar, Tomar, Portugal
| | - Artur Almeida
- Techframe-Information Systems, SA, São Domingos de Rana, Portugal
| | - Sandra Jardim
- Smart Cities Research Center, Polytechnic Institute of Tomar, Tomar, Portugal
| |
Collapse
|
2
|
Farabi Maleki S, Yousefi M, Afshar S, Pedrammehr S, Lim CP, Jafarizadeh A, Asadi H. Artificial Intelligence for Multiple Sclerosis Management Using Retinal Images: Pearl, Peaks, and Pitfalls. Semin Ophthalmol 2024; 39:271-288. [PMID: 38088176 DOI: 10.1080/08820538.2023.2293030] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 11/23/2023] [Indexed: 03/28/2024]
Abstract
Multiple sclerosis (MS) is a complex autoimmune disease characterized by inflammatory processes, demyelination, neurodegeneration, and axonal damage within the central nervous system (CNS). Retinal imaging, particularly Optical coherence tomography (OCT), has emerged as a crucial tool for investigating MS-related retinal injury. The integration of artificial intelligence(AI) has shown promise in enhancing OCT analysis for MS. Researchers are actively utilizing AI algorithms to accurately detect and classify MS-related abnormalities, leading to improved efficiency in diagnosis, monitoring, and personalized treatment planning. The prognostic value of AI in predicting MS disease progression has garnered substantial attention. Machine learning (ML) and deep learning (DL) algorithms can analyze longitudinal OCT data to forecast the course of the disease, providing critical information for personalized treatment planning and improved patient outcomes. Early detection of high-risk patients allows for targeted interventions to mitigate disability progression effectively. As such, AI-driven approaches yielded remarkable abilities in classifying distinct MS subtypes based on retinal features, aiding in disease characterization and guiding tailored therapeutic strategies. Additionally, these algorithms have enhanced the accuracy and efficiency of OCT image segmentation, streamlined diagnostic processes, and reduced human error. This study reviews the current research studies on the integration of AI,including ML and DL algorithms, with OCT in the context of MS. It examines the advancements, challenges, potential prospects, and ethical concerns of AI-powered techniques in enhancing MS diagnosis, monitoring disease progression, revolutionizing patient care, the development of patient screening tools, and supported clinical decision-making based on OCT images.
Collapse
Affiliation(s)
| | - Milad Yousefi
- Faculty of Mathematics, Statistics and Computer Sciences, University of Tabriz, Tabriz, Iran
| | - Sayeh Afshar
- Nikookari Eye Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | | | - Chee Peng Lim
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Burwood, Australia
| | - Ali Jafarizadeh
- Nikookari Eye Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Houshyar Asadi
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Burwood, Australia
| |
Collapse
|
3
|
Valente J, António J, Mora C, Jardim S. Developments in Image Processing Using Deep Learning and Reinforcement Learning. J Imaging 2023; 9:207. [PMID: 37888314 PMCID: PMC10607786 DOI: 10.3390/jimaging9100207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 09/24/2023] [Accepted: 09/28/2023] [Indexed: 10/28/2023] Open
Abstract
The growth in the volume of data generated, consumed, and stored, which is estimated to exceed 180 zettabytes in 2025, represents a major challenge both for organizations and for society in general. In addition to being larger, datasets are increasingly complex, bringing new theoretical and computational challenges. Alongside this evolution, data science tools have exploded in popularity over the past two decades due to their myriad of applications when dealing with complex data, their high accuracy, flexible customization, and excellent adaptability. When it comes to images, data analysis presents additional challenges because as the quality of an image increases, which is desirable, so does the volume of data to be processed. Although classic machine learning (ML) techniques are still widely used in different research fields and industries, there has been great interest from the scientific community in the development of new artificial intelligence (AI) techniques. The resurgence of neural networks has boosted remarkable advances in areas such as the understanding and processing of images. In this study, we conducted a comprehensive survey regarding advances in AI design and the optimization solutions proposed to deal with image processing challenges. Despite the good results that have been achieved, there are still many challenges to face in this field of study. In this work, we discuss the main and more recent improvements, applications, and developments when targeting image processing applications, and we propose future research directions in this field of constant and fast evolution.
Collapse
Affiliation(s)
- Jorge Valente
- Techframe-Information Systems, SA, 2785-338 São Domingos de Rana, Portugal; (J.V.); (J.A.)
| | - João António
- Techframe-Information Systems, SA, 2785-338 São Domingos de Rana, Portugal; (J.V.); (J.A.)
| | - Carlos Mora
- Smart Cities Research Center, Polytechnic Institute of Tomar, 2300-313 Tomar, Portugal;
| | - Sandra Jardim
- Smart Cities Research Center, Polytechnic Institute of Tomar, 2300-313 Tomar, Portugal;
| |
Collapse
|
4
|
Rangu S, Veramalla R, Salkuti SR, Kalagadda B. Efficient Approach to Color Image Segmentation Based on Multilevel Thresholding Using EMO Algorithm by Considering Spatial Contextual Information. J Imaging 2023; 9:jimaging9040074. [PMID: 37103225 PMCID: PMC10145584 DOI: 10.3390/jimaging9040074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/15/2023] [Accepted: 03/17/2023] [Indexed: 04/28/2023] Open
Abstract
The process of image segmentation is partitioning an image into its constituent parts and is a significant approach for extracting interesting features from images. Over a couple of decades, many efficient image segmentation approaches have been formulated for various applications. Still, it is a challenging and complex issue, especially for color image segmentation. To moderate this difficulty, a novel multilevel thresholding approach is proposed in this paper based on the electromagnetism optimization (EMO) technique with an energy curve, named multilevel thresholding based on EMO and energy curve (MTEMOE). To compute the optimized threshold values, Otsu's variance and Kapur's entropy are deployed as fitness functions; both values should be maximized to locate optimal threshold values. In both Kapur's and Otsu's methods, the pixels of an image are classified into different classes based on the threshold level selected on the histogram. Optimal threshold levels give higher efficiency of segmentation; the EMO technique is used to find optimal thresholds in this research. The methods based on an image's histograms do not possess the spatial contextual information for finding the optimal threshold levels. To abolish this deficiency an energy curve is used instead of the histogram and this curve can establish the spatial relationship of pixels with their neighbor pixels. To study the experimental results of the proposed scheme, several color benchmark images are considered at various threshold levels and compared with other meta-heuristic algorithms: multi-verse optimization, whale optimization algorithm, and so on. The investigational results are illustrated in terms of mean square error, peak signal-to-noise ratio, the mean value of fitness reach, feature similarity, structural similarity, variation of information, and probability rand index. The results reveal that the proposed MTEMOE approach overtops other state-of-the-art algorithms to solve engineering problems in various fields.
Collapse
Affiliation(s)
- Srikanth Rangu
- Department of ECE, Kakatiya Institute of Technology and Science, Warangal 506015, India
| | - Rajagopal Veramalla
- Department of ECE, Kakatiya Institute of Technology and Science, Warangal 506015, India
| | - Surender Reddy Salkuti
- Department of Railroad and Electrical Engineering, Woosong University, Daejeon 34606, Republic of Korea
| | | |
Collapse
|
5
|
Bembenek M, Mandziy T, Ivasenko I, Berehulyak O, Vorobel R, Slobodyan Z, Ropyak L. Multiclass Level-Set Segmentation of Rust and Coating Damages in Images of Metal Structures. SENSORS (BASEL, SWITZERLAND) 2022; 22:7600. [PMID: 36236705 PMCID: PMC9571848 DOI: 10.3390/s22197600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Revised: 09/30/2022] [Accepted: 10/05/2022] [Indexed: 06/16/2023]
Abstract
This paper describes the combined detection of coating and rust damages on painted metal structures through the multiclass image segmentation technique. Our prior works were focused solely on the localization of rust damages and rust segmentation under different ambient conditions (different lighting conditions, presence of shadows, low background/object color contrast). This paper method proposes three types of damages: coating crack, coating flaking, and rust damage. Background, paint flaking, and rust damage are objects that can be separated in RGB color-space alone. For their preliminary classification SVM is used. As for paint cracks, color features are insufficient for separating it from other defect types as they overlap with the other three classes in RGB color space. For preliminary paint crack segmentation we use the valley detection approach, which analyses the shape of defects. A multiclass level-set approach with a developed penalty term is used as a framework for the advanced final damage segmentation stage. Model training and accuracy assessment are fulfilled on the created dataset, which contains input images of corresponding defects with respective ground truth data provided by the expert. A quantitative analysis of the accuracy of the proposed approach is provided. The efficiency of the approach is demonstrated on authentic images of coated surfaces.
Collapse
Affiliation(s)
- Michał Bembenek
- Department of Manufacturing Systems, Faculty of Mechanical Engineering and Robotics, AGH University of Science and Technology, 30-059 Kraków, Poland
| | - Teodor Mandziy
- Department of the Theory of Wave Processes and Optical Systems of Diagnostics, Karpenko Physico-Mechanical Institute of the NAS of Ukraine, 5 Naukova St., 79060 Lviv, Ukraine
| | - Iryna Ivasenko
- Department of the Theory of Wave Processes and Optical Systems of Diagnostics, Karpenko Physico-Mechanical Institute of the NAS of Ukraine, 5 Naukova St., 79060 Lviv, Ukraine
| | - Olena Berehulyak
- Department of the Theory of Wave Processes and Optical Systems of Diagnostics, Karpenko Physico-Mechanical Institute of the NAS of Ukraine, 5 Naukova St., 79060 Lviv, Ukraine
| | - Roman Vorobel
- Department of the Theory of Wave Processes and Optical Systems of Diagnostics, Karpenko Physico-Mechanical Institute of the NAS of Ukraine, 5 Naukova St., 79060 Lviv, Ukraine
- Department of Computer Sciences, University of Lodz, Pomorska Str. 149/153, 90-236 Lodz, Poland
| | - Zvenomyra Slobodyan
- Department of Corrosion and Corrosion Protection, Karpenko Physico-Mechanical Institute of the NAS of Ukraine, 5 Naukova St., 79060 Lviv, Ukraine
| | - Liubomyr Ropyak
- Department of Computerized Engineering, Ivano-Frankivsk National Technical University of Oil and Gas, 76019 Ivano-Frankivsk, Ukraine
| |
Collapse
|
6
|
A Novel Trademark Image Retrieval System Based on Multi-Feature Extraction and Deep Networks. J Imaging 2022; 8:jimaging8090238. [PMID: 36135404 PMCID: PMC9505340 DOI: 10.3390/jimaging8090238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 08/28/2022] [Accepted: 08/31/2022] [Indexed: 11/23/2022] Open
Abstract
Graphical Search Engines are conceptually used in many development areas surrounding information retrieval systems that aim to provide a visual representation of results, typically associated with retrieving images relevant to one or more input images. Since the 1990s, efforts have been made to improve the result quality, be it through improved processing speeds or more efficient graphical processing techniques that generate accurate representations of images for comparison. While many systems achieve timely results by combining high-level features, they still struggle when dealing with large datasets and abstract images. Image datasets regarding industrial property are an example of an hurdle for typical image retrieval systems where the dimensions and characteristics of images make adequate comparison a difficult task. In this paper, we introduce an image retrieval system based on a multi-phase implementation of different deep learning and image processing techniques, designed to deliver highly accurate results regardless of dataset complexity and size. The proposed approach uses image signatures to provide a near exact representation of an image, with abstraction levels that allow the comparison with other signatures as a means to achieve a fully capable image comparison process. To overcome performance disadvantages related to multiple image searches due to the high complexity of image signatures, the proposed system incorporates a parallel processing block responsible for dealing with multi-image search scenarios. The system achieves the image retrieval through the use of a new similarity compound formula that accounts for all components of an image signature. The results shows that the developed approach performs image retrieval with high accuracy, showing that combining multiple image assets allows for more accurate comparisons across a broad spectrum of image typologies. The use of deep convolutional networks for feature extraction as a means of semantically describing more commonly encountered objects allows for the system to perform research with a degree of abstraction.
Collapse
|
7
|
Suryanto ME, Saputra F, Kurnia KA, Vasquez RD, Roldan MJM, Chen KHC, Huang JC, Hsiao CD. Using DeepLabCut as a Real-Time and Markerless Tool for Cardiac Physiology Assessment in Zebrafish. BIOLOGY 2022; 11:1243. [PMID: 36009871 PMCID: PMC9405297 DOI: 10.3390/biology11081243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 08/13/2022] [Accepted: 08/19/2022] [Indexed: 11/21/2022]
Abstract
DeepLabCut (DLC) is a deep learning-based tool initially invented for markerless pose estimation in mammals. In this study, we explored the possibility of adopting this tool for conducting markerless cardiac physiology assessment in an important aquatic toxicology model of zebrafish (Danio rerio). Initially, high-definition videography was applied to capture heartbeat information at a frame rate of 30 frames per second (fps). Next, 20 videos from different individuals were used to perform convolutional neural network training by labeling the heart chamber (ventricle) with eight landmarks. Using Residual Network (ResNet) 152, a neural network with 152 convolutional neural network layers with 500,000 iterations, we successfully obtained a trained model that can track the heart chamber in a real-time manner. Later, we validated DLC performance with the previously published ImageJ Time Series Analysis (TSA) and Kymograph (KYM) methods. We also evaluated DLC performance by challenging experimental animals with ethanol and ponatinib to induce cardiac abnormality and heartbeat irregularity. The results showed that DLC is more accurate than the TSA method in several parameters tested. The DLC-trained model also detected the ventricle of zebrafish embryos even in the occurrence of heart abnormalities, such as pericardial edema. We believe that this tool is beneficial for research studies, especially for cardiac physiology assessment in zebrafish embryos.
Collapse
Affiliation(s)
- Michael Edbert Suryanto
- Department of Chemistry, Chung Yuan Christian University, Taoyuan 320314, Taiwan
- Department of Bioscience Technology, Chung Yuan Christian University, Taoyuan 320314, Taiwan
| | - Ferry Saputra
- Department of Chemistry, Chung Yuan Christian University, Taoyuan 320314, Taiwan
- Department of Bioscience Technology, Chung Yuan Christian University, Taoyuan 320314, Taiwan
| | - Kevin Adi Kurnia
- Department of Chemistry, Chung Yuan Christian University, Taoyuan 320314, Taiwan
- Department of Bioscience Technology, Chung Yuan Christian University, Taoyuan 320314, Taiwan
| | - Ross D. Vasquez
- Department of Pharmacy, Research Center for Natural and Applied Sciences, University of Santo Tomas, Manila 1008, Philippines
| | - Marri Jmelou M. Roldan
- Faculty of Pharmacy, The Graduate School, University of Santo Tomas, Manila 1008, Philippines
| | - Kelvin H.-C. Chen
- Department of Applied Chemistry, National Pingtung University, Pingtung 90003, Taiwan
| | - Jong-Chin Huang
- Department of Applied Chemistry, National Pingtung University, Pingtung 90003, Taiwan
| | - Chung-Der Hsiao
- Department of Chemistry, Chung Yuan Christian University, Taoyuan 320314, Taiwan
- Department of Bioscience Technology, Chung Yuan Christian University, Taoyuan 320314, Taiwan
- Center for Nanotechnology, Chung Yuan Christian University, Taoyuan 320314, Taiwan
- Research Center for Aquatic Toxicology and Pharmacology, Chung Yuan Christian University, Taoyuan 320314, Taiwan
| |
Collapse
|