Mining Deviations in Document Writing Style through Vector Dissimilarity

Nasreen J. Kadhim

doi:10.24996/ijs.2024.65.4.44

Details

Publication Date

Tue Apr 30 2024

Journal Name

Iraqi Journal Of Science

DOI

10.24996/ijs.2024.65.4.44

Choose Citation Style

Statistics

View publication

14

Statistics

Mining Deviations in Document Writing Style through Vector Dissimilarity

Nasreen J. Kadhim

...Show More Authors

Doubts arise about the originality of a document when noticing a change in its writing style. This evidence to plagiarism has made the intrinsic approach for detecting plagiarism uncover the plagiarized passages through the analysis of the writing style for the suspicious document where a reference corpus to compare with is absent. The proposed work aims at discovering the deviations in document writing style through applying several steps: Firstly, the entire document is segmented into disjointed segments wherein each corresponds to a paragraph in the original document. For the entire document and for each segment, center vectors comprising average weight of their word are constructed. Second, the degree of closeness is calculated through applying Cosine similarity to measure for each segment, the deviation of its center vector from the center vector of the entire document. Additionally, word n-gram length will be investigated to show its effect on the proposed system performance wherein, center vectors are computed considering word n-grams for different values of n (n= 1, 2, and 3). Performance evaluation of the proposed method was accomplished through the use of Precision, Recall, F-measure, Granularity, and Plagdet as evaluation measures. Moreover, PAN-PC-09 and PAN-PC-11 were used for detecting intrinsic plagiarism as evaluation corpora. It is shown that the proposed approach has achieved results that are comparable to the state-of-the-art methods. Positive impact was observed through discovering deviations in document writing style by computing weight vectors dissimilarity rather than calculating the difference between the word n-grams that exist in segments and their corresponding word n-grams in the suspicious document. Furthermore, when considering the length of word n-gram, better results were recorded for system performance when word bi-grams was used compared to word uni-grams and word tri-grams.

View Publication

Publication Date

Mon Mar 07 2022

Journal Name

Journal Of Educational And Psychological Researches

Google Classroom in Teaching Writing Composition for College Students

Educational technology

Computer technology

Google Classroom

writing skill

paragraph writing

Hind Salim Kashkool

...Show More Authors

Technology plays a vital role in all walks of life, one of these is Education. Google Classroom is one of the educational tools that are free of cost and recently has gained popularity within a short period in many countries, including Iraq. The primary purpose of this study is to explore the Google Classroom use in EFL learners' composition writing. The sample of the study is EFL Second-year College students from the College of Science for Women /Computer Science Department, which consisted of (35) students who have implemented Google Classroom for at least one semester in their classroom. The students were asked to finish two uncompleted paragraphs that have the only main idea and write a suitable conclusion to each one. The results sh

View Publication Preview PDF

Publication Date

Mon Aug 01 2016

Journal Name

Journal Of Economics And Administrative Sciences

User (K-Means) for clustering in Data Mining with application

العناصر

تنقيب البيانات

العنقدة

التعليم الالي

الخوارزمية.

object

data mining

clustering

machine learning

algorithm object

data mining

clustering

machine learning

algorithm

قتيبة نبيل

محي الدين خلف

...Show More Authors

The great scientific progress has led to widespread Information as information accumulates in large databases is important in trying to revise and compile this vast amount of data and, where its purpose to extract hidden information or classified data under their relations with each other in order to take advantage of them for technical purposes.

And work with data mining (DM) is appropriate in this area because of the importance of research in the (K-Means) algorithm for clustering data in fact applied with effect can be observed in variables by changing the sample size (n) and the number of clusters (K)

View Publication Preview PDF

Publication Date

Tue Mar 30 2021

Journal Name

Baghdad Science Journal

Application of Data Mining Techniques on Tourist Expenses in Malaysia

Tourism

Data mining

Classification

JRIP

Random Tree

J48

REP Tree

Miao

Tan Shi

...Show More Authors

Tourism plays an important role in Malaysia’s economic development as it can boost business opportunity in its surrounding economic. By apply data mining on tourism data for predicting the area of business opportunity is a good choice. Data mining is the process that takes data as input and produces outputs knowledge. Due to the population of travelling in Asia country has increased in these few years. Many entrepreneurs start their owns business but there are some problems such as wrongly invest in the business fields and bad services quality which affected their business income. The objective of this paper is to use data mining technology to meet the business needs and customer needs of tourism enterprises and find the most effective

View Publication Preview PDF

(4)

(1)

Publication Date

Wed Feb 20 2019

Journal Name

Iraqi Journal Of Physics

Assessment of nuclear radiation pollution in uranium mining-impacted soil

Natural radioactivity

Gamma-spectroscopy

mine

Najaf/ Iraq.

Raad Obid

...Show More Authors

Activities associated with mining of uranium have generated significant quantities of waste materials containing uranium and other toxic metals. A qualitative and quantitative study was performed to assess the situation of nuclear pollution resulting from waste of drilling and exploration left on the surface layer of soil surrounding the abandoned uranium mine hole located in the southern of Najaf province in Iraq state. To measure the specific activity, twenty five surface soil samples were collected, prepared and analyzed by using gamma- ray spectrometer based on high counting efficiency NaI(Tl) scintillation detector. The results showed that the specific activities in Bq/kg are 37.31 to 1112.47 with mean of 268.16, 0.28 to 18.57 with

View Publication Preview PDF

Publication Date

Wed Jun 29 2022

Journal Name

Journal Of The College Of Education For Women

Using Online Platforms to Improve Writing

تحسين الكتابة

التعلم الإلكتروني

جوجل كلاس روم

الرموز السيميائية

الكتابة التفاعلية

المنصات عبر الإنترنت

Shahad Saleh Al Asadi

Nasser Al-Issa

...Show More Authors

Due to the difficulties that Iraqi students face when writing in the English language, this preliminary study aimed to improve students' writing skills by using online platforms remotely. Sixty first-year students from Al-Furat Al–Awsat Technical University participated in this study. Through these platforms, the researchers relied on stimuli, such as images, icons, and short titles to allow for deeper and more accurate participations. Data were collected through corrections, observations, and feedback from the researchers and peers. In addition, two pre and post-tests were conducted. The quantitative data were analysed by SPSS statistical Editor, whereas the qualitative data were analyzed using the Piot table, an Excel sheet. The resu

View Publication Preview PDF

Publication Date

Thu Oct 01 2015

Journal Name

Engineering And Technology Journal

Genetic Based Optimization Models for Enhancing Multi- Document Text Summarization

Hilal

Nasreen J.

...Show More Authors

View Publication

Publication Date

Mon Jan 01 2018

Journal Name

International Journal Of Data Mining, Modelling And Management

Association rules mining using cuckoo search algorithm

data mining

ARM

association rules mining

DCS

discrete cuckoo search

metaheuristic algorithm

Mohammed R.A.

MEHDI G. DUAIMI

...Show More Authors

Association rules mining (ARM) is a fundamental and widely used data mining technique to achieve useful information about data. The traditional ARM algorithms are degrading computation efficiency by mining too many association rules which are not appropriate for a given user. Recent research in (ARM) is investigating the use of metaheuristic algorithms which are looking for only a subset of high-quality rules. In this paper, a modified discrete cuckoo search algorithm for association rules mining DCS-ARM is proposed for this purpose. The effectiveness of our algorithm is tested against a set of well-known transactional databases. Results indicate that the proposed algorithm outperforms the existing metaheuristic methods.

View Publication Preview PDF

(8)

(3)

Publication Date

Sun Jan 01 2006

Journal Name

Journal Of The College Of Languages (jcl)

The Analysis of Errors Made by Iraqi Students in Writing

Huda Nafea’

...Show More Authors

Writing plays an effective role in developing one's thinking and
enhancing Learning. It is, in fact, a means of widening one's own views about
the world for the numerous uses that it can serve (Samuel, 1988:28).
In regard to the unquestionable significance of writing in the teaching –
Learning process, the traditional approach seems to be far from being able to
put such significance into practice. Traditionalists give priority to formulating
students' ideas before using prescribed rhetorical framework and then
submitting the written product for grading. Emphasis is, therefore, limited to the
prewriting stage where a certain topic is explored, and the role of the teacher is
confined to assigning the topic and

View Publication Preview PDF

Publication Date

Mon Feb 01 2016

Journal Name

Al-academy

Formal Diversity in Altasweed plates hand writing: أشرف كامل عبدالامير

Ashraf

...Show More Authors

Targeted research to study variations formalism in the paintings of the scratch -linear , and through surveys carried out by the researcher , found and to his knowledge that the study did not address the researchers before, this researcher found rationale in the study of plates scratch and identify variations of design in it. , Which included the first chapter of the research problem in question the following: What morphological variations in the paintings of the scratch -linear ?In order to reach solutions to this problem and to achieve results so research aims to identify the characteristics of the scratch for paintings , and design fundamentals and relationships , which are identified by the researcher Balentajat completed in all of I

View Publication Preview PDF

Publication Date

Wed Jun 01 2011

Journal Name

Journal Of The College Of Languages (jcl)

Areas in phonetics and phonology Differences Between Speech and Writing

May Stephan

Rasha Abdul Ridha

...Show More Authors

In any language there is some amount of difference between written language (planned) and spoken language (spontaneous). Since planned speech could be considered a form of written language, it could be inferred that there are also differences between planned speech and spontaneous speech. Some of these differences are very clear in terms of syntax, lexis, phonology and discourse. These differences are highlighted in order to make a clear distinction between spontaneous and planned speech.

This paper is an attempt to show the differences between the two forms of a language (written & spoken English) as far as number of linguistic features are tackle

View Publication Preview PDF

1 2 3 4 ... 1455 1456 1457 1458