Mining Deviations in Document Writing Style through Vector Dissimilarity

Nasreen J. Kadhim

doi:10.24996/ijs.2024.65.4.44

Details

Publication Date

Tue Apr 30 2024

Journal Name

Iraqi Journal Of Science

DOI

10.24996/ijs.2024.65.4.44

Choose Citation Style

Statistics

View publication

17

Statistics

(1)

Mining Deviations in Document Writing Style through Vector Dissimilarity

Nasreen J. Kadhim

...Show More Authors

Doubts arise about the originality of a document when noticing a change in its writing style. This evidence to plagiarism has made the intrinsic approach for detecting plagiarism uncover the plagiarized passages through the analysis of the writing style for the suspicious document where a reference corpus to compare with is absent. The proposed work aims at discovering the deviations in document writing style through applying several steps: Firstly, the entire document is segmented into disjointed segments wherein each corresponds to a paragraph in the original document. For the entire document and for each segment, center vectors comprising average weight of their word are constructed. Second, the degree of closeness is calculated through applying Cosine similarity to measure for each segment, the deviation of its center vector from the center vector of the entire document. Additionally, word n-gram length will be investigated to show its effect on the proposed system performance wherein, center vectors are computed considering word n-grams for different values of n (n= 1, 2, and 3). Performance evaluation of the proposed method was accomplished through the use of Precision, Recall, F-measure, Granularity, and Plagdet as evaluation measures. Moreover, PAN-PC-09 and PAN-PC-11 were used for detecting intrinsic plagiarism as evaluation corpora. It is shown that the proposed approach has achieved results that are comparable to the state-of-the-art methods. Positive impact was observed through discovering deviations in document writing style by computing weight vectors dissimilarity rather than calculating the difference between the word n-grams that exist in segments and their corresponding word n-grams in the suspicious document. Furthermore, when considering the length of word n-gram, better results were recorded for system performance when word bi-grams was used compared to word uni-grams and word tri-grams.

View Publication

Publication Date

Wed Feb 01 2023

Journal Name

International Journal Of Electrical And Computer Engineering (ijece)

Optimized Kalman filters for sensorless vector control induction motor drives

Extended Kalman filter

Induction motor

Multi objective optimization

Sensorless vector control

Unscented Kalman filter

Mohammed Khalil

Bajel Mohammed

Rashid

...Show More Authors

<span lang="EN-US">This paper presents the comparison between optimized unscented Kalman filter (UKF) and optimized extended Kalman filter (EKF) for sensorless direct field orientation control induction motor (DFOCIM) drive. The high performance of UKF and EKF depends on the accurate selection of state and noise covariance matrices. For this goal, multi objective function genetic algorithm is used to find the optimal values of state and noise covariance matrices. The main objectives of genetic algorithm to be minimized are the mean square errors (MSE) between actual and estimation of speed, current, and flux. Simulation results show the optimal state and noise covariance matrices can improve the estimation of speed, current, t

(5)

Publication Date

Fri Feb 08 2019

Journal Name

Journal Of The College Of Education For Women

Online Sumarians Cuneiform Detection Based on Symbol Structural Vector Algorithm

Kawther K.

...Show More Authors

The cuneiform images need many processes in order to know their contents
and by using image enhancement to clarify the objects (symbols) founded in the
image. The Vector used for classifying the symbol called symbol structural vector
(SSV) it which is build from the information wedges in the symbol.
The experimental tests show insome numbersand various relevancy including
various drawings in online method. The results are high accuracy in this research,
and methods and algorithms programmed using a visual basic 6.0. In this research
more than one method was applied to extract information from the digital images
of cuneiform tablets, in order to identify most of signs of Sumerian cuneiform.

View Publication Preview PDF

Publication Date

Thu Sep 15 2022

Journal Name

Knowledge And Information Systems

Multiresolution hierarchical support vector machine for classification of large datasets

Safaa

...Show More Authors

Support vector machine (SVM) is a popular supervised learning algorithm based on margin maximization. It has a high training cost and does not scale well to a large number of data points. We propose a multiresolution algorithm MRH-SVM that trains SVM on a hierarchical data aggregation structure, which also serves as a common data input to other learning algorithms. The proposed algorithm learns SVM models using high-level data aggregates and only visits data aggregates at more detailed levels where support vectors reside. In addition to performance improvements, the algorithm has advantages such as the ability to handle data streams and datasets with imbalanced classes. Experimental results show significant performance improvements in compa

View Publication

(6)

(4)

Publication Date

Mon Oct 10 2016

Journal Name

Iraqi Journal Of Science

Satellite image classification using KL-transformation and modified vector quantization

Satellite images

classification

and Modified Vector Quantization (MVQ).

Rafah Rasheed

Bushra Q.

...Show More Authors

In this work, satellite images classification for Al Chabaish marshes and the area surrounding district in (Dhi Qar) province for years 1990,2000 and 2015 using two software programming (MATLAB 7.11 and ERDAS imagine 2014) is presented. Proposed supervised classification method (Modified Vector Quantization) using MATLAB software and supervised classification method (Maximum likelihood Classifier) using ERDAS imagine have been used, in order to get most accurate results and compare these methods. The changes that taken place in year 2000 comparing with 1990 and in year 2015 comparing with 2000 are calculated. The results from classification indicated that water and vegetation are decreased, while barren land, alluvial soil and shallow water

Publication Date

Mon Jun 05 2023

Journal Name

Journal Of Engineering

A Fuzzy Logic Controller Based Vector Control of IPMSM Drives

fuzzy logic controller

vector control

PID controller

IPMSM

Afaneen Anwer

...Show More Authors

This paper explores a fuzzy-logic based speed controller of an interior permanent magnet synchronous motor (IPMSM) drive based on vector control. PI controllers were mostly used in a speed control loop based field oriented control of an IPMSM. The fundamentals of fuzzy logic algorithms as related to drive control applications are illustrated. A complete comparison between two tuning algorithms of the classical PI controller and the fuzzy PI controller is explained. A simplified fuzzy logic controller (FLC) for the IPMSM drive has been found to maintain high performance standards with a much simpler and less computation implementation. The Matlab simulink results have been given for different mechanical operating conditions. The simulated

View Publication Preview PDF

Publication Date

Tue Dec 20 2022

Journal Name

2022 International Conference On Computer And Applications (icca)

Improve Data Mining Techniques with a High-Performance Cluster

Fadhil H.M.

...Show More Authors

View Publication

Publication Date

Thu Mar 24 2022

Journal Name

Arab World English Journal

Collocation Networks of Selected Words in Academic Writing: A Corpus-Based Study

Eman

...Show More Authors

This study aims at shedding light on the linguistic significance of collocation networks in the academic writing context. Following Firth’s principle “You shall know a word by the company it keeps.” The study intends to examine three selected nodes (i.e. research, study, and paper) shared collocations in an academic context. This is achieved by using the corpus linguistic tool; GraphColl in #LancsBox software version 5 which was announced in June 2020 in analyzing selected nodes. The study focuses on academic writing of two corpora which were designed and collected especially to serve the purpose of the study. The corpora consist of a collection of abstracts extracted from two different academic journals that publish for writ

View Publication

(4)

(2)

Publication Date

Thu Dec 01 2016

Journal Name

Swarm And Evolutionary Computation

A new multi-objective evolutionary framework for community mining in dynamic social networks

Bara'a A.

Haidar S.

...Show More Authors

View Publication

(26)

(23)

Publication Date

Mon Apr 11 2011

Journal Name

Icgst

Employing Neural Network and Naive Bayesian Classifier in Mining Data for Car Evaluation

Data mining

Backpropagation Neural Network

Naïve Bayesian Classifier

Classification

Sarmad

Aida

Junaidah

Ealaf

Mohammed

...Show More Authors

In data mining, classification is a form of data analysis that can be used to extract models describing important data classes. Two of the well known algorithms used in data mining classification are Backpropagation Neural Network (BNN) and Naïve Bayesian (NB). This paper investigates the performance of these two classification methods using the Car Evaluation dataset. Two models were built for both algorithms and the results were compared. Our experimental results indicated that the BNN classifier yield higher accuracy as compared to the NB classifier but it is less efficient because it is time-consuming and difficult to analyze due to its black-box implementation.

Publication Date

Wed Jun 20 2018

Journal Name

Al-academy

Style in contemporary Iraqi painting before and after the war

ance

...Show More Authors

What happened on the Iraq war in 2003 put the problematic about what can happen from a shift in the style of individual or collective in contemporary Iraqi art similar to what is known in the history of the modern art world.

It wanted the researcher to identify this dilemma in academic study looking for an explanation of this Alastfhamat from the entrance of the definitions shows laparoscopic destination, as it determines the address (b style in painting contemporary Aeraa before and after the war) was a comparative analysis, through a choice of two Iraqi artists who have production style known sets since before the war, in order to read the transformation that took place in art before and after their tactics, and in order to off

View Publication Preview PDF

1 2 ... 4 5 6 7 ... 1490 1491