How to separate between Machine-Printed/Handwritten and Arabic/Latin Words?

Afef Kacem; Asma Saidani; Abdel Belaid

Repository landing page

oai:doaj.org/article:8713a24e6602413eaf459eb4fbe02a59

How to separate between Machine-Printed/Handwritten and Arabic/Latin Words?

Authors: Afef Kacem
Asma Saidani
Abdel Belaid
Publication date: 1 April 2014
Publisher: 'Universitat Autonoma de Barcelona'
Doi

Abstract

This paper gathers some contributions to script and its nature identification. Different sets of features have been employed successfully for discriminating between handwritten and machine-printed Arabic and Latin scripts. They include some well established features, previously used in the literature, and new structural features which are intrinsic to Arabic and Latin scripts. The performance of such features is studied towards this paper. We also compared the performance of three classifiers: Bayes (AODEsr), k-Nearest Neighbor (k-NN) and Decision Tree (J48) used to identify the script at word level. These classifiers have been chosen enough different to test the feature contributions. Experiments have been conducted with handwritten and machine-printed words, covering a wide range of fonts. Experimental results show the capability of the proposed features to capture differences between scripts and the effectiveness of the three classifiers. An average identification precision and recall rates of 98.72% was achieved, using a set of 58 features and AODEsr classifier, which is slightly better than those reported in similar works

Similar works

Full text

Open in the Core reader

Download PDF

Directory of Open Access Journals

oai:doaj.org/article:8713a24e6...

Last time updated on 04/06/2019

This paper was published in Directory of Open Access Journals.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.