1
|
Mahajan S, Rani R. Word Level Script Identification Using Convolutional Neural Network Enhancement for Scenic Images. ACM T ASIAN LOW-RESO 2022. [DOI: 10.1145/3506699] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Script identification from complex and colorful images is an integral part of the text recognition and classification system. Such images may contain twofold challenges: (1) Challenges related to the camera like blurring effect, non-uniform illumination and noisy background, and so on, and (2) Challenges related to the text shape, orientation, and text size. The present work in this area is much focused on non-Indian scripts. In contrast, Gurumukhi, Hindi, and English scripts play a vital role in communication among Indians and foreigners. In this article, we focus on the above said challenges in the field of identifying the script. Additionally, we have introduced a new dataset that contains Hindi, Gurumukhi, and English scripts from scenic images collected from different sources. We also proposed a CNN-based model, which is capable of distinguishing between the scripts with good accuracy. Performance of the method has been evaluated for own dataset, i.e., NITJDATASET and other benchmarked datasets available for Indian scripts, i.e., CVSI-2015 (Task-1 and Task 4) and ILST. This work is an extension to find the script from strict text background.
Collapse
Affiliation(s)
- Shilpa Mahajan
- National Institute of Technology, Jalandhar, Punjab, India
| | - Rajneesh Rani
- National Institute of Technology, Jalandhar, Punjab, India
| |
Collapse
|
2
|
Mukherjee J, Parui SK, Roy U. An Unsupervised and Robust Line and Word Segmentation Method for Handwritten and Degraded Printed Document. ACM T ASIAN LOW-RESO 2022. [DOI: 10.1145/3474118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
Segmentation of text lines and words in an unconstrained handwritten or a machine-printed degraded document is a challenging document analysis problem due to the heterogeneity in the document structure. Often there is un-even skew between the lines and also broken words in a document. In this article, the contribution lies in segmentation of a document page image into lines and words. We have proposed an unsupervised, robust, and simple statistical method to segment a document image that is either handwritten or machine-printed (degraded or otherwise). In our proposed method, the segmentation is treated as a two-class classification problem. The classification is done by considering the distribution of gap size (between lines and between words) in a binary page image. Our method is very simple and easy to implement. Other than the binarization of the input image, no pre-processing is necessary. There is no need of high computational resources. The proposed method is unsupervised in the sense that no annotated document page images are necessary. Thus, the issue of a training database does not arise. In fact, given a document page image, the parameters that are needed for segmentation of text lines and words are learned in an unsupervised manner. We have applied our proposed method on several popular publicly available handwritten and machine-printed datasets (ISIDDI, IAM-Hist, IAM, PBOK) of different Indian and other languages containing different fonts. Several experimental results are presented to show the effectiveness and robustness of our method. We have experimented on ICDAR-2013 handwriting segmentation contest dataset and our method outperforms the winning method. In addition to this, we have suggested a quantitative measure to compute the level of degradation of a document page image.
Collapse
Affiliation(s)
| | | | - Utpal Roy
- Visva-Bharati, Shantiniketan, West Bengal, India
| |
Collapse
|
3
|
Singh PK, Sarkar R, Abraham A, Nasipuri M. A Case Study on Handwritten
Indic
Script Classification: Benchmarking of the Results at Page, Block, Text-line, and Word Levels. ACM T ASIAN LOW-RESO 2022. [DOI: 10.1145/3476102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Handwritten script classification is still considered as a challenging research problem in the domain of document image analysis. Although some research attempts have been made by the researchers for solving the challenging issues, a comprehensive solution is yet to be achieved. The case study, undertaken here, analyzes the performances of various state-of-the art handwritten script classification methods for Indian scripts where features, needed for the script classification task, are extracted from the script images at four different granularity levels, i.e., page, block, text line, or word. The results of handwritten script classification at each level have been obtained and compared using eight different feature sets and six different state-of-the-art classifiers. Based on the classification results, an ideal level for performing the handwritten script classification task is suggested among these four classification levels. The results have also been improved by using two feature dimensionality reduction methods. All these experiments are done on two different handwritten
Indic
script databases, of which one is an in-house developed dataset and the other one is a freely available dataset. Finally, some future research directions that may be undertaken by the researchers as an application of the handwritten
Indic
script classification problem are also highlighted. The work presented here provides a basic foundation for the construction of a comprehensive handwritten script classification method for official Indian scripts.
Collapse
Affiliation(s)
- Pawan Kumar Singh
- Department of Information Technology, Jadavpur University, Kolkata, West Bengal, INDIA
| | - Ram Sarkar
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, West Bengal, INDIA
| | - Ajith Abraham
- Machine Intelligence Research (MIR) Labs, Scientific Network for Innovation and Research Excellence, Auburn, Washington, USA and Center for Artificial Intelligence, Innopolis University, Russia
| | - Mita Nasipuri
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, West Bengal, INDIA
| |
Collapse
|
4
|
Offline script recognition from handwritten and printed multilingual documents: a survey. INT J DOC ANAL RECOG 2021. [DOI: 10.1007/s10032-021-00365-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
5
|
A Hybrid Swarm and Gravitation-based feature selection algorithm for handwritten Indic script classification problem. COMPLEX INTELL SYST 2021. [DOI: 10.1007/s40747-020-00237-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
AbstractIn any multi-script environment, handwritten script classification is an unavoidable pre-requisite before the document images are fed to their respective Optical Character Recognition (OCR) engines. Over the years, this complex pattern classification problem has been solved by researchers proposing various feature vectors mostly having large dimensions, thereby increasing the computation complexity of the whole classification model. Feature Selection (FS) can serve as an intermediate step to reduce the size of the feature vectors by restricting them only to the essential and relevant features. In the present work, we have addressed this issue by introducing a new FS algorithm, called Hybrid Swarm and Gravitation-based FS (HSGFS). This algorithm has been applied over three feature vectors introduced in the literature recently—Distance-Hough Transform (DHT), Histogram of Oriented Gradients (HOG), and Modified log-Gabor (MLG) filter Transform. Three state-of-the-art classifiers, namely, Multi-Layer Perceptron (MLP), K-Nearest Neighbour (KNN), and Support Vector Machine (SVM), are used to evaluate the optimal subset of features generated by the proposed FS model. Handwritten datasets at block, text line, and word level, consisting of officially recognized 12 Indic scripts, are prepared for experimentation. An average improvement in the range of 2–5% is achieved in the classification accuracy by utilizing only about 75–80% of the original feature vectors on all three datasets. The proposed method also shows better performance when compared to some popularly used FS models. The codes used for implementing HSGFS can be found in the following Github link: https://github.com/Ritam-Guha/HSGFS.
Collapse
|