ARABIC SOFT SPELLING CORRECTION WITH T5

Mohammed Al-Qaraghuli; Ola Arif Jaafar

doi:10.5455/jjcit.71-1699768515

Details

Publication Date

Mon Jan 01 2024

Journal Name

Jordanian Journal Of Computers And Information Technology

DOI

10.5455/jjcit.71-1699768515

Choose Citation Style

Statistics

View publication

17

Statistics

(3)

ARABIC SOFT SPELLING CORRECTION WITH T5

Mohammed Al-Qaraghuli

Ola Arif Jaafar

...Show More Authors

Spelling correction is considered a challenging task for resource-scarce languages. The Arabic language is one of these resource-scarce languages, which suffers from the absence of a large spelling correction dataset, thus datasets injected with artificial errors are used to overcome this problem. In this paper, we trained the Text-to-Text Transfer Transformer (T5) model using artificial errors to correct Arabic soft spelling mistakes. Our T5 model can correct 97.8% of the artificial errors that were injected into the test set. Additionally, our T5 model achieves a character error rate (CER) of 0.77% on a set that contains real soft spelling mistakes. We achieved these results using a 4-layer T5 model trained with a 90% error injection rate, with a maximum sequence length of 300 characters.

View Publication

Publication Date

Sat Jan 02 2021

Journal Name

Journal Of The College Of Languages (jcl)

Translating Food and Drink-Related Insults in Shakespeare’s (Henry IV) into Arabic

insults

linguistic context

procedures

source language

target language insults

linguistic context

procedures

source language

target language

Essam Tahir

...Show More Authors

This study highlights the problems of translating Shakespeare's food and drink-related insults (henceforth FDRIs) in (Henry IV, Parts I&II) into Arabic. It adopts (Vinay & Darbelnet's:1950s) model, namely (Direct& Oblique) to highlight the applicability of the different methods and procedures made by the two selected translators (Mashati:1990 & Habeeb:1905) .The present study tries to answer the following questions:(i) To what extent the FDRIs in Henry IV might pose a translational problem for the selected translators to find suitable cultural equivalents for them? (ii) Why do the translators, in many cases, resort to a literal procedure which is almost not worka

View Publication Preview PDF

Publication Date

Wed Aug 11 2021

Journal Name

Journal Of The College Of Languages

Pragmatic Analysis of the Translation of English Culture -specific Proverbs into Arabic

Abdali Hammood Shihan

...Show More Authors

Publication Date

Mon Aug 16 2021

Journal Name

TÜrkÇe SÖzlÜkte Tdk Yer Alan ArapÇa Kelİmeler Üzerİne Bİr Anlam Bİlİmİ İncelemesİ

A SEMANTICS REVIEW ON THE ARABIC WORDS IN THE TURKISH DICTIONARY (TDK)

LANGUAGE RELATION

ARABIC

TURKISH

BORROWING

SEMANTICS.

SARAH

...Show More Authors

Publication Date

Sun Apr 01 2018

Journal Name

Al–bahith Al–a'alami

The uses used in the political implications of Arabic-speaking foreign website

uses

political

implications

Arabic

speaking

website

Iman Sabih

Hussein Ali

...Show More Authors

This research is intended to high light the uses of political content in foreign Arabic / speaking websites, such as “ CNN “ and” Euro News“, The research problem stems from the main question: What is the nature of the use of the websites in the political content provided through them? A set of sub-questions that give the research aspects and aims to achieve a set of objectives , including the identification of topics that included , the political content provided through , the sample sites during the time period for analysis and determine that the study uses descriptive research based on the discovery of the researcher, describing it accurately and defining the relations between the components.
The research conducted the des

View Publication Preview PDF

Publication Date

Sun Jan 30 2022

Journal Name

Iraqi Journal Of Science

A Survey on Arabic Text Classification Using Deep and Machine Learning Algorithms

Farah A.

Nada A.Z.

...Show More Authors

Text categorization refers to the process of grouping text or documents into classes or categories according to their content. Text categorization process consists of three phases which are: preprocessing, feature extraction and classification. In comparison to the English language, just few studies have been done to categorize and classify the Arabic language. For a variety of applications, such as text classification and clustering, Arabic text representation is a difficult task because Arabic language is noted for its richness, diversity, and complicated morphology. This paper presents a comprehensive analysis and a comparison for researchers in the last five years based on the dataset, year, algorithms and the accuracy th

(18)

(8)

Publication Date

Wed Dec 11 2019

Journal Name

Journal Of The College Of Education For Women

Differences of Style between English and Arabic Political Discourse: A Contrastive Study

Arabic discourse

choice

English discourse

psychological status

style

Shifaa Hadi

...Show More Authors

Traditionally, style is defined as the expressive, emotive or aesthetic emphasis added linguistically to the discourse with its meaning is the same. In the current study, however, style is defined as the linguistic choice that the language users can make for specific purposes.

This study, thus, aims at analyzing political Arabic and English speeches to find out whether there are differences of style between English and Arabic and whether the choices the language users make can show any traits of their psychological status.

To fulfill the above aims, the study hypothesizes that English and Arabic speeches can be analyzed stylistically and that there are stylistic difference

View Publication Preview PDF

Publication Date

Thu Nov 01 2018

Journal Name

Journal Of Economics And Administrative Sciences

Using Multidimensional Contingency Coefficient to Discrimination Arabic Poems for Sample of Poets

Arabic poetry

Arabic letters shapes

Arabic letters characteristics

Al-alta'reef

multidimensional contingency table

contingency measurements.

عبد المنعم صالح

مها عبد الكريم

...Show More Authors

The purpose of this paper to discriminate between the poetic poems of each poet depending on the characteristics and attribute of the Arabic letters. Four categories used for the Arabic letters, letters frequency have been included in a multidimensional contingency table and each dimension has two or more levels, then contingency coefficient calculated.

The paper sample consists of six poets from different historical ages, and each poet has five poems. The method was programmed using the MATLAB program, the efficiency of the proposed method is 53% for the whole sample, and between 90% and 95% for each poet's poems.

View Publication Preview PDF

Publication Date

Thu Dec 01 2022

Journal Name

Baghdad Science Journal

Using Graph Mining Method in Analyzing Turkish Loanwords Derived from Arabic Language

Arabic language

Data mining

Graph mining

Loanwords

Turkish language

Abbood Kirebut

Muneam Jabbar

Ahmed Hussein

...Show More Authors

Loanwords are the words transferred from one language to another, which become essential part of the borrowing language. The loanwords have come from the source language to the recipient language because of many reasons. Detecting these loanwords is complicated task due to that there are no standard specifications for transferring words between languages and hence low accuracy. This work tries to enhance this accuracy of detecting loanwords between Turkish and Arabic language as a case study. In this paper, the proposed system contributes to find all possible loanwords using any set of characters either alphabetically or randomly arranged. Then, it processes the distortion in the pronunciation, and solves the problem of the missing lette

View Publication Preview PDF

(3)

(4)

Publication Date

Sat Jan 09 2016

Journal Name

مجلة العلوم التربوية والنفسية

Hill, the Employment of a Specimen in the Teaching of Arabic Grammar

Raed Rasim Yunis

...Show More Authors

The construction process of reception containing rebuild educated new gloss within the context of real-time knowledge with previous experience and learning environment, accounting for all of the real experiences and information beside Education backbones structural climate (olive 0.2002: p. 212 .) Based on two basic principles - : I: - The natural science that we do know from our experiences, we can not say for sure Bhakaigah realism and clearly, but built by creative minds of certain interpretations be applicable in light of our expectations. Other: - The knowledge built effectively active learner who adapts new knowledge with the conceptual framework has, since everyone has a conceptual framework can break at any time and replaced by a ne

View Publication Preview PDF

Publication Date

Sat Jan 02 2021

Journal Name

Journal Of The College Of Languages (jcl)

Pragmatic Analysis of the Translation of English Culture-Specific Proverbs into Arabic

CSPs

culture

translation

translation strategy

Abdali H.Shihan

...Show More Authors

Translating culture-specific proverbs (CSPs) is a challenging task since they often occur in a peculiar context. Further, CSPs are intended to imply meanings that extend far beyond the literal meaning of such a kind of proverbs. As far as English and Arabic are concerned, translators often encounter problems in translating CSPs due to cultural differences between the source language(SL) and the target language (TL) as well as what seems to be the lack of equivalence for some CSPs.

In view of this, the present study aims at investigating the translation of CSPs in three English-Arabic dictionaries of proverbs, namely Dictionary of Common English Proverbs Translated and Explained (2004), One thousand and One English Pr

View Publication Preview PDF

1 2 ... 24 25 26 27 ... 456 457