Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Earth dams are constructed mainly from soil. A homogenous earth dam is composed of only one material. The seepage through such dams is quite high. Upstream impervious blanket is one of the methods used to control seepage through the dam foundations. Bennet's method is one of the commonly used methods to design an impervious upstream blanket. Design charts are developed relating the length of blanket, total reservoir head, total base width of the dam (excluding downstream drainage), the coefficient of permeability of the blanket material, blanket thickness, foundation thickness, and coefficient of permeability of the foundation soil, based on the equations governing the Bennet's method for a homogenous earth dam with a blanket of uniform
... Show MoreVariable selection is an essential and necessary task in the statistical modeling field. Several studies have triedto develop and standardize the process of variable selection, but it isdifficultto do so. The first question a researcher needs to ask himself/herself what are the most significant variables that should be used to describe a given dataset’s response. In thispaper, a new method for variable selection using Gibbs sampler techniqueshas beendeveloped.First, the model is defined, and the posterior distributions for all the parameters are derived.The new variable selection methodis tested usingfour simulation datasets. The new approachiscompared with some existingtechniques: Ordinary Least Squared (OLS), Least Absolute Shrinkage
... Show More: Porous silicon (n-PS) films can be prepared by photoelectochemical etching (PECE) Silicon chips n - types with 15 (mA /cm2), in15 minutes etching time on the fabrication nano-sized pore arrangement. By using X-ray diffraction measurement and atomic power microscopy characteristics (AFM), PS was investigated. It was also evaluated the crystallites size from (XRD) for the PS nanoscale. The atomic force microscopy confirmed the nano-metric size chemical fictionalization through the electrochemical etching that was shown on the PS surface chemical composition. The atomic power microscopy checks showed the roughness of the silicon surface. It is also notified (TiO2) preparation nano-particles that were prepared by pulse laser eradication in e
... Show MoreIn this work, satellite images classification for Al Chabaish marshes and the area surrounding district in (Dhi Qar) province for years 1990,2000 and 2015 using two software programming (MATLAB 7.11 and ERDAS imagine 2014) is presented. Proposed supervised classification method (Modified Vector Quantization) using MATLAB software and supervised classification method (Maximum likelihood Classifier) using ERDAS imagine have been used, in order to get most accurate results and compare these methods. The changes that taken place in year 2000 comparing with 1990 and in year 2015 comparing with 2000 are calculated. The results from classification indicated that water and vegetation are decreased, while barren land, alluvial soil and shallow water
... Show MoreOlanzapine (OLZ) is classified as a typical antipsychotic drug utilized for the treatment of schizophrenia. Its oral bioavailability is 60% due to its low solubility and pre-systemic metabolism. Hence, the present work aims to formulate and evaluate OLZ nanoparticles dissolving microneedles (MNs) for transdermal delivery to overcome the problems associated with drug administration orally. OLZ nanoparticles were prepared by the nanoprecipitation method. The optimized OLZ nanoparticle formula was utilized for the fabrication of dissolving MNs by loading OLZ nanodispersion into polydimethylsiloxane (PDMS) micromould cavities, followed by casting the polymeric solution of polyvinylpyrrolidone(PVP-K30) and polyvinyl alcohol (PVA) to form
... Show MoreThis research aims at shedding light on the concept of insurance awareness and clarifying its role on marketing insurance services of a sample of (100) employees in the National Company for Insurance. Questionnaire is used as a main instrument for collecting data and information from the sample. Their answers were analyzed by using arithmetic means, standard deviation, centesimal weight, and the correlation coefficient ( , F, t) tests .The research reached several conclusions of which:1.The sample member's response to insurance awareness and marketing insurance services factors was in the medium level.2.There was a positive relationship of a moral sign between insurance awareness and marketing insurance services, that correlation coeffic
... Show MoreThe efficiency of internal combustion engines (ICE) is usually about thirty percent of the total energy of the fuel. The residual energy is lost in the exhaust gas, the lubrication, and the cooling water in the radiators. Recently much of the researcher’s efforts have focused on taking advantage of wasted energy of the exhaust gas. Using a thermoelectric generator (TEG) is one of the promising ways. However, TEG depends entirely on the temperature difference, which may be offered by the exhaust muffler. An experimental test has been conducted to study the thermal performance of a different muffler internal design. The researchers resort to the use of lost energy in an ICE using TEG, which is one of the ways to take adv
... Show More