Spelling correction is considered a challenging task for resource-scarce languages. The Arabic language is one of these resource-scarce languages, which suffers from the absence of a large spelling correction dataset, thus datasets injected with artificial errors are used to overcome this problem. In this paper, we trained the Text-to-Text Transfer Transformer (T5) model using artificial errors to correct Arabic soft spelling mistakes. Our T5 model can correct 97.8% of the artificial errors that were injected into the test set. Additionally, our T5 model achieves a character error rate (CER) of 0.77% on a set that contains real soft spelling mistakes. We achieved these results using a 4-layer T5 model trained with a 90% error injection rate, with a maximum sequence length of 300 characters.
La condicionalidad en árabe, supongo como en otras lenguas, constituye una noción amplia que puede expresarse mediante diferentes construcciones sintácticas. La mayor parte de los especialistas coinciden en señalar que las estructuras condicionales son, probablemente, la clase más compleja de expresión compuesta en árabe. Se utilizan para expresar una condición de la que depende la realización de lo expuesto en la oración principal. Las estructuras condicionales son una de las principales vías lingüísticas de las que dispone el individuo para expresar su capacidad de imaginar situaciones diferentes a las reales; de crear mundos posibles; de soñar con situaciones pasadas que podrían haber sido diferentes; de ocultar lo fact
... Show MoreSummary of the research : Our research tagged (Arabic language in the media between warning and development) attempts to follow the most prominent phenomena that accompanied the evolution of the use of Arabic language in the media with the development of these means and spread technically globally, and how divided researchers and linguists and intellectuals Arabs into two teams, each demanding what contradicts the other, in the matter The use of the Arabic language in the media, and the arguments of each team in the need to deal with the media as one of the pillars of the nation culturally, historically and civilized, in order to enhance its position and maintain unity, continued the research highlighted the positions of hard-lin
... Show MoreThe present study investigates the relation between the biliteral and triliteral roots which is the introduction to comprehend the nature of the Semitic roots during its early stage of development being unconfirmed to a single pattern. The present research is not meant to decide on the question of the biliteral roots in the Semitic languages, rather it is meant to confirm the predominance of the triliteral roots on these languages which refers, partially, to analogy adopted by the majority of linguists. This tendency is frequently seen in the languages which incline to over generalize the triliteral phenomenon, i. e., to transfer the biliteral roots to the triliteral room, that is, to subject it to the predominant pattern regarding the r
... Show MoreThe article is devoted to the issue of word-formation motivation, which does not lose its relevance and plays a role not only in disclosing formal-semantic relations between words of one language and has not only theoretical, but also applied significance. The authors consider word-formation motivation consistently in its varieties in a comparative way on the materials of so different languages as Russian and Arabic and approach the mechanism of achieving semantic equivalence of translation. To the greatest extent, word-formation activity today, due to objective reasons, affects some special branch (technical, medical, etc.) vocabulary, which is increasing from year to year in national dictionaries. This extensive material, selected
... Show MoreThis research focuses on the services provided by news websites (IMN, Youm7, Huffington Post Arabic) to its audience of Internet users, as well as materials posted through its pages, trying to monitor and explain them to identify their types & features, and it›s functions, whether informational or non-informational, to know the technical potential of each of the news sites, with the entry of the latest technology information. The research used the analysis method to achieve the research objectives within the period from 1/1 to 31/1/2017. The researchers used the content analysis tool as a research tool to analyze the news sites and to know the services they provide through their pages. The research was divided into three parts, the
... Show MoreAbstract
The research aims to examine the effect of Hawkins' strategy on students in the fourth grade of primary school in the General Directorate of Education in Baghdad / Karkh 3 for the academic year (2020-2021). The research was limited to the topics of the Arabic language grammar book for the fourth grade of primary school. The researcher developed the research hypothesis, which is: that there is no statistically significant difference at the significance level (0.05) between the average scores of the experimental group students who study using the Hawkins strategy and the average scores of the control group students who study in the traditional method in the achievement test. The researcher set a number of
... Show More