Arabic Personal Name Matching: Names Written using Latin Alphabet
- 1 Ziane Achour University - Djelfa, Algeria
- 2 Universite de Laghouat, Algeria
- 3 Groupe de Recherche Rouennais en Informatique Fondamentale (GR2IF), Algeria
- 4 Universite de Ghardaia, Algeria
Abstract
Abstract: In many Arab countries’ public administrations, Arabic personal names are written with Latin alphabet, generally, in various ways by different writers. This has led to many problems when it comes to connecting these administrations. The aim of this study was to propose two new approaches for the pairwise matching of Arabic personal names. The first approach is based on string alignment and phonetic transcription. Appropriate scoring functions were defined to catch similarity between Arabic personal names. In the second approach, we use machine learning techniques to derive a suitable model for this problem. Precisely, we suggest using a Multi-Layer Perceptron (MLP) architecture and experiment with different configurations. Performances of the new models compare well with the best-performing similarity measures (Jaro, Jaro-Winkler, Double Metaphone and Edit Distance) in terms of precision, recall and F1. Even though the work was carried out for the (Algeria/French Alphabet) case, it can be adapted to any other (country/script) case, like (Egypt/English).
DOI: https://doi.org/10.3844/jcssp.2021.776.788
Copyright: © 2021 Attia Nehar, Slimane Bellaouar, Djelloul Ziadi and Khaled Moulay Omar. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 2,774 Views
- 1,415 Downloads
- 0 Citations
Download
Keywords
- Personal Name Matching
- Phonetic Transcription
- Phonetic Encoding
- Sequence Alignment
- Machine Learning