Mining Deviations in Document Writing Style through Vector Dissimilarity

Nasreen J. Kadhim

doi:10.24996/ijs.2024.65.4.44

Details

Publication Date

Tue Apr 30 2024

Journal Name

Iraqi Journal Of Science

DOI

10.24996/ijs.2024.65.4.44

Choose Citation Style

Statistics

View publication

17

Statistics

(1)

Mining Deviations in Document Writing Style through Vector Dissimilarity

Nasreen J. Kadhim

...Show More Authors

Doubts arise about the originality of a document when noticing a change in its writing style. This evidence to plagiarism has made the intrinsic approach for detecting plagiarism uncover the plagiarized passages through the analysis of the writing style for the suspicious document where a reference corpus to compare with is absent. The proposed work aims at discovering the deviations in document writing style through applying several steps: Firstly, the entire document is segmented into disjointed segments wherein each corresponds to a paragraph in the original document. For the entire document and for each segment, center vectors comprising average weight of their word are constructed. Second, the degree of closeness is calculated through applying Cosine similarity to measure for each segment, the deviation of its center vector from the center vector of the entire document. Additionally, word n-gram length will be investigated to show its effect on the proposed system performance wherein, center vectors are computed considering word n-grams for different values of n (n= 1, 2, and 3). Performance evaluation of the proposed method was accomplished through the use of Precision, Recall, F-measure, Granularity, and Plagdet as evaluation measures. Moreover, PAN-PC-09 and PAN-PC-11 were used for detecting intrinsic plagiarism as evaluation corpora. It is shown that the proposed approach has achieved results that are comparable to the state-of-the-art methods. Positive impact was observed through discovering deviations in document writing style by computing weight vectors dissimilarity rather than calculating the difference between the word n-grams that exist in segments and their corresponding word n-grams in the suspicious document. Furthermore, when considering the length of word n-gram, better results were recorded for system performance when word bi-grams was used compared to word uni-grams and word tri-grams.

View Publication

Publication Date

Wed Mar 15 2023

Journal Name

Al-academy

The role of Iraqi nature in the style of the artist Ayath Abdul Rahman Ameen

role

nature

style

artist

Ayath Al-Doori

Mahmmoud

...Show More Authors

This research aims to clarify the role of Iraqi nature in the style of the artist Ayath Al-Doori. Through it, the spotlight was shed on the aesthetic of Iraqi nature, its importance and its relationship to art, and its pioneering role in the style of the Doori artist. The research included two axes: the theoretical axis and the applied axis. The first theoretical axis deals with two topics: The first topic: the aesthetic of Iraqi nature as part of the life and methods of Iraqi artists from ancient times until today, including the periodic artist. The second topic: the league artist touched on the private life of the league artist. As for the second (applied) axis, it includes the research community, the research sample models, the resear

View Publication Preview PDF

Publication Date

Mon Apr 01 2019

Journal Name

Journal Of Educational And Psychological Researches

Moral Awareness and Its Relation to Authoritarian Parenting Style of Secondary School Students in Baghdad

moral awareness

authoritarian parenting style

Mohammed Abbas .M

Salwa Faiq Abd

...Show More Authors

This research aims at identifying the level of Moral Awareness and the level of Authoritarian Parenting Style of Secondary School Students in Baghdad. Additionally, the study seeks to identify the significant difference between these two variables in term of gender (male-female), as well as the correlation between Moral Awareness and Authoritarian Parenting Style. To do this, the researchers have adopted the scale of moral awareness prepared by the (Assl 2014), which the number of its items was finalized of (28) items. As for the Authoritarian Parenting Style scale, the researcher designed a questionnaire of (22) items as the number of its finalized form. The two instruments were applied on a sample of (140) male and female Students who

View Publication Preview PDF

Publication Date

Mon Mar 01 2010

Journal Name

Journal Of Computer Science

Dropping down the Maximum Item Set: Improving the Stylometric Authorship Attribution Algorithm in the Text Mining for Authorship Investigation

Mustafa T.K.

...Show More Authors

View Publication

(4)

(2)

Publication Date

Mon Feb 04 2019

Journal Name

Journal Of The College Of Education For Women

Disjuncts and Conjuncts: A Syntactic Type of Study to Evaluate Kurdish Students' Writing in English (At University Level)

Abbas Jassim

Jwan Iqbal

...Show More Authors

The principal concern of this study is Disjunct and Conjunct adverbials in the
English language. The study sets out to explore and clarify the types, nature and
structure of disjuncts and conjuncts. It also aims at testing student's performance to
evaluate the use and usage of the disjuncts and conjuncts in their written performance.
Two tests, accordingly, were given to some fifty students of at the Dept. of English, at
the college of languages (third and fourth stages) in the University of Sulaimani. The
hypothesis that the study was based on are those students use disjuncts and conjuncts
hardly enough in their writings and when doing so, they generally tend to stick only to
the most commonly used and familiar o

View Publication Preview PDF

Publication Date

Thu Nov 01 2018

Journal Name

Iraqi National Journal Of Nursing Specialties

Life – Style of patients with peptic ulcer A case control study

Samir

betool

...Show More Authors

Abstract A descriptive (retrospective) (a case-control) study was carried out at Al-Karama Teaching Hospital, Baghdad Teaching Hospital and Surgical Specialties Hospital, and Gastro-Intestinal Tract and Liver (GIT) Hospital for the period of December 1st, 2001 To March 15th 2002. To identify aspects of life-style that may contribute to the occurrence of peptic ulcer (P.U)as risk factors. And to find out the relationship between the demographic characteristic of the group. Non-probability (Purposive) sample of (100) cases who were admitted to the endoscopy department who were later on diagnosed as having

View Publication Preview PDF

Publication Date

Fri Jan 01 2021

Journal Name

Towards Implementation Of Sustainability Concepts In Developing Countries

Toward Resiliency Through Sustainable Urban Formation in Baghdad

Zaynab

...Show More Authors

View Publication

(3)

(1)

Publication Date

Sat Jan 01 2022

Journal Name

Indonesian Journal Of Electrical Engineering And Computer Science

Construct an efficient distributed denial of service attack detection system based on data mining techniques

Dhurgham

Amer

...Show More Authors

<span>Distributed denial-of-service (DDoS) attack is bluster to network security that purpose at exhausted the networks with malicious traffic. Although several techniques have been designed for DDoS attack detection, intrusion detection system (IDS) It has a great role in protecting the network system and has the ability to collect and analyze data from various network sources to discover any unauthorized access. The goal of IDS is to detect malicious traffic and defend the system against any fraudulent activity or illegal traffic. Therefore, IDS monitors outgoing and incoming network traffic. This paper contains a based intrusion detection system for DDoS attack, and has the ability to detect the attack intelligently, dynami

View Publication Preview PDF

(4)

(2)

Publication Date

Thu Jul 01 2021

Journal Name

University Of Northampton Pue

Validating a Proposed Data Mining Approach (SLDM) for Motion Wearable Sensors to Detect the Early Signs of Lameness in Sheep

motion wearable sensors

sensor data mining

supervised machine learning

CART ensemble classifier

sheep lameness detection

sheep behaviour classification

Zainab

...Show More Authors

View Publication

Publication Date

Mon Apr 03 2023

Journal Name

Journal Of Educational And Psychological Researches

The Effectiveness of Using Cubing Technique in the Iraqi EFL Secondary Students' Composition Writing, Vocabulary, and Meta-Cognitive Awareness

cubing technique

composition writing

vocabulary

meta-cognitive awareness

Sana

...Show More Authors

Abstract

The aim of this research is to determine how well the Cubing Technique affects the Iraqi EFL students' composition writing, vocabulary, and meta-cognitive awareness of writing strategies. The sample of (64) secondary-school female students in the fifth grade is drawn from two classrooms and split into two equal groups: the experimental group and the control group, each of which consists of (32) students. A quasi-experimental design is applied. The performance test and Meta-cognitive Writing Strategies questionnaire are given as a pre-test for equalizing the two groups after ensuring their validity and reliability. Then, they are administrated as a posttest in both groups. According to the results, the si

View Publication Preview PDF

Publication Date

Tue Dec 01 2020

Journal Name

Baghdad Science Journal

A Modified Support Vector Machine Classifiers Using Stochastic Gradient Descent with Application to Leukemia Cancer Type Dataset

Classification

Dimension Reduction

Feature Selection

Leukemia Diagnosis

Stochastic Gradient Descend.

Ghadeer JM

...Show More Authors

Support vector machines (SVMs) are supervised learning models that analyze data for classification or regression. For classification, SVM is widely used by selecting an optimal hyperplane that separates two classes. SVM has very good accuracy and extremally robust comparing with some other classification methods such as logistics linear regression, random forest, k-nearest neighbor and naïve model. However, working with large datasets can cause many problems such as time-consuming and inefficient results. In this paper, the SVM has been modified by using a stochastic Gradient descent process. The modified method, stochastic gradient descent SVM (SGD-SVM), checked by using two simulation datasets. Since the classification of different ca

View Publication Preview PDF

(11)

(7)

1 2 ... 10 11 12 13 ... 1490 1491