Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Abstract
The research stems from the problem that focuses on a number of questions. They are as follows: What is the extent of interest in the topic of efficiency by the banks and their role in raising the efficiency of the banking business and its development? Is the banking efficiency used in Iraqi banks clear and specific for the Iraqi banking sector? How the banking sector efficiency is measured and what are the approaches adopted in determining the banking inputs and outputs? What is the level of efficiency in the research sample of the banks and what are the causes of its decline or rise in private banks individually and in the Iraqi banking sector in general?
The re
... Show MoreThe research aims to evaluate Islamic electronic libraries and their service for downloading research and illustrated books, explaining their origins, features and types. The research was limited to the libraries available on the Internet that provide the service for downloading research and illustrated books. The researcher relied on the survey approach to identify the libraries and a sample of them (20 libraries) was selected. For the purpose of evaluating it according to five criteria related to the preparation and publication of Islamic electronic libraries (the responsible party, the goals and objectives, the year, the services it provides, the sections and subject specializations of its contents) and five criteria related to the servi
... Show MoreDue to the great evolution in digital commercial cameras, several studies have addressed the using of such cameras in different civil and close-range applications such as 3D models generation. However, previous studies have not discussed a precise relationship between a camera resolution and the accuracy of the models generated based on images of this camera. Therefore the current study aims to evaluate the accuracy of the derived 3D buildings models captured by different resolution cameras. The digital photogrammetric methods were devoted to derive 3D models using the data of various resolution cameras and analyze their accuracies. This investigation involves selecting three different resolution cameras (low, medium and
... Show MoreAl-Rustamiya sewage treatment plant (WWTP) serves the east side of Baghdad city (Rusafa) and is considered one of the largest projects.It consists of three parts (old project F0, first extension F1, and second extension F2) that treat wastewater and the
effluent is discharged into Diyala river and thus into the Tigris River. These plants are designed and constructed with an aim to manage wastewater to reachIraqi effluent standard for BOD5, COD, TSS and chloride concentrations of 40, 100, 60 and 600
mg/L respectively. The data recordedfrom March till December 2011 provided from Al-RustamiyaWWTP, were considered in this study to evaluate the performance of the plant. The results indicated that the strength of the wastewater enterin
Warm mix asphalt (WMA) is relatively a new technology which enables the production and compaction of asphalt concrete mixtures at temperatures 15-40 °C lower than that of traditional hot mix asphalt HMA. In the present work, six asphalt concrete mixtures were produced in the mix plant (1 ton each) in six different batches. Half of these mixes were WMA and the other half were HMA. Three types of fillers (limestone dust, Portland cement and hydrated lime) were used for each type of mix. Samples were then taken from these patches and transferred to lab for performance testing which includes: Marshall characteristics, moisture susceptibility (indirect tension test), resilient modulus, permanent deformation (axial repeated load test)
... Show MoreWarm mix asphalt (WMA) is relatively a new technology which enables the production and compaction of asphalt concrete mixtures at temperatures 15-40 °C lower than that of traditional hot mix asphalt HMA. In the present work, six asphalt concrete mixtures were produced in the mix plant (1 ton each) in six different batches. Half of these mixes were WMA and the other half were HMA. Three types of fillers (limestone dust, Portland cement and hydrated lime) were used for each type of mix. Samples were then taken from these patches and transferred to lab for performance testing which includes: Marshall characteristics, moisture susceptibility (indirect tension test), resilient modulus, permanent deformation (axial repe
... Show MoreCar drivers hear many kinds of noise inside their vehicles' cabins, and the most annoying ones are the noise generated by tires, engines, and outside winds. Noise affects the comfort of the passengers inside the cabin, and it’s sad to say that modern cars are noisier in many kinds of noise signals due to using a lot of plastic materials in new budget cars. For expensive and luxury cars, the problem is solved by using better sound insulation materials, but for the budget ones, the approach used here is effective. It is called Active Noise Cancellation and can be done using analog or digital electronics. An operational amplifier and filters are used for the analog one, and in the digital one, signal processor chips are used. In engineeri
... Show More