Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Many of researchers have written about social responsibility and business strategy and competitive advantage, and they have given particular attention to the relationship between economic and social responsibility , but what is missing in this aspect is how the economic units that use their core competencies to advance social responsibility initiatives so that they can achieve a significant competitive advantage and create value for it ?
The current research aims to verify the view that "the economic and social objectives in the long term is not contradictory in nature but complementary objectives essential", as well as make sure that the s
... Show MoreAbstract
This research aims to know the effect of job burnout in the worker’s performance. The researcher presented a theoretical basis for job burnout and the worker's performance. In order to achieve the objectives of the research, a hypothesis was drawn up that determines the nature of the relationship between the independent variable of job burnout and its dimensions (reduced personal accomplishment, depersonalization, Emotional Exhaustion) and variable dependent performance of workers dimensions (productivity, job satisfaction, organizational commitment, creativity), And to represent the volume of this community according to (de Morgan, D. Morgan) glo
... Show MoreAnomaly detection is still a difficult task. To address this problem, we propose to strengthen DBSCAN algorithm for the data by converting all data to the graph concept frame (CFG). As is well known that the work DBSCAN method used to compile the data set belong to the same species in a while it will be considered in the external behavior of the cluster as a noise or anomalies. It can detect anomalies by DBSCAN algorithm can detect abnormal points that are far from certain set threshold (extremism). However, the abnormalities are not those cases, abnormal and unusual or far from a specific group, There is a type of data that is do not happen repeatedly, but are considered abnormal for the group of known. The analysis showed DBSCAN using the
... Show MoreThe purpose of the current investigation is to distinguish between working memory ( ) in five patients with vascular dementia ( ), fifteen post-stroke patients with mild cognitive impairment ( ), and fifteen healthy control individuals ( ) based on background electroencephalography (EEG) activity. The elimination of EEG artifacts using wavelet (WT) pre-processing denoising is demonstrated in this study. In the current study, spectral entropy ( ), permutation entropy ( ), and approximation entropy ( ) were all explored. To improve the classification using the k-nearest neighbors ( NN) classifier scheme, a comparative study of using fuzzy neighbourhood preserving analysis with -decomposition ( ) as a dimensionality reduction technique an
... Show MoreFourty -tow Libyan patients with hydatidosis, which were
referred to by the physician for the detection of hydatid cyst by X - rays, Ultrasound and CT-Scan. The infection rate in females and males was(69% )and (31% )respectively .The highest rate 69% was in the liver, followed by the lung( 23.8%), the brain (4.8%) and kidney
(2.4%).
A total of 42 serum samples were gathered from Libyan patients infected with hydatidosis, 33 serum samples from patients cases with other parasitic diseases than hydatidosis and 30 serum samples from healthy normal controls and were tested by Dot-ELIZA utilizing antigen B from sheep hy
... Show MoreThe aim of the research is to shed light on the dimensions of the strategic lens and its impact on achieving the pioneer tax performance and represented by the dimensions (strategic direction, growth, pilot indicator, renewal and modernization, efficiency and effectiveness) in the General Tax Authority. The questionnaire was adopted as a tool to collect data and information from the adult sample They are (91) who are on the site (Assistant Director General, Head of Division, First Division Deputy, Second Division Deputy, Division Officer, Division Officer Associate) The statistical program (SPSS) has been used to calculate (the mean, the standard deviation, the correlation coefficient, the difference coefficient, the F test, the
... Show MoreBackground: Ischemic heart disease is a major cause of the diastolic heart failure. Risk of heart failures was increased with microvascular coronary disease, which is characterized by left ventricular stiffness with impaired relaxation and reduced compliance. Aim of this study is to estimate the effect of the severity of myocardium ischemia on the left ventricle ejection fraction and left ventricular volume using SPECT with 99mTc MIBI and to compare the results with the echocardiography. The study included 117 subjects with ischemic heart disease were examined using SPECT and echocardiography techniques. The following
... Show MoreComputer science has evolved to become the basis for evolution and entered into all areas of life where the use of computer has been developed in all scientific, military, commercial and health institutions. In addition, it has been applied in residential and industrial projects due to the high capacity and ability to achieve goals in a shorter time and less effort. In this research, the computer, its branches, and algorithms will be invested in the psychological field. In general, in psychological fields, a questionnaire model is created according to the requirements of the research topic. The model contains many questions that are answered by the individuals of the sample space chosen by the researcher. Often,
... Show MoreIn this research, a group of gray texture images of the Brodatz database was studied by building the features database of the images using the gray level co-occurrence matrix (GLCM), where the distance between the pixels was one unit and for four angles (0, 45, 90, 135). The k-means classifier was used to classify the images into a group of classes, starting from two to eight classes, and for all angles used in the co-occurrence matrix. The distribution of the images on the classes was compared by comparing every two methods (projection of one class onto another where the distribution of images was uneven, with one category being the dominant one. The classification results were studied for all cases using the confusion matrix between every
... Show More