Details

Publication Date

Tue Jan 25 2022

Journal Name

Iraqi Journal Of Science

Volume

58

Issue Number

1A

Choose Citation Style

Statistics

View publication

3

Statistics

Finding the Similarity between Two Arabic Texts

Arabic Text Similarity

Semantic Similarity

Keyword Extraction

N-Gram

Cosine Similarity Measure

Dice's Similarity Measure

Suhad Malallah kadhem

Aseel Qassim Abd Alameer

...Show More Authors

Calculating similarities between texts that have been written in one language or multiple languages still one of the most important challenges facing the natural language processing. This work offers many approaches that used for the texts similarity. The proposed system will find the similarity between two Arabic texts by using hybrid similarity measures techniques: Semantic similarity measure, Cosine similarity measure and N-gram ( using the Dice similarity measure). In our proposed system we will design Arabic SemanticNet that store the keywords for a specific field(computer science), by this network we can find semantic similarity between words according to specific equations. Cosine and N-gram similarity measures are used in order to find the similar characters sequences. The proposed system was executed by using Visual Basic 2012, and after testing it, it proved to be a worthy for finding the similarity between two Arabic texts (From the viewpoint of accuracy and search time).

View Publication Preview PDF

Quick Preview PDF