Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
In this paper, we suggest a descent modification of the conjugate gradient method which converges globally provided that the exact minimization condition is satisfied. Preliminary numerical experiments on some benchmark problems show that the method is efficient and promising.
The identity of the cultural identity of the party (the ego and the other) during a series of questions subject to the transformations of history and the process of culture, the search of identity and its characteristics within the novel discourse calls to address the cultural impact communication discourse aims to reveal the functional and intellectual benefits that contributed to the formation of cultural identity, and stand at the most important manifestations And the secretions that mimic a world remains part of it present in the imagination of the producer of the text is moving in accordance with the holistic perception of the world embedded between the lines of the trial.The question of identity as an intellectual dimension in orde
... Show MoreThe Life Cycle and Larval Development of Neoechinorhynchus iraqensis (Acanthocephala: Neoechinorhynchidae) in the Intermediate Host
Objective: This project aimed to study the morphological description and histological structure of ileum in Iraqi black partridge (