Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Mandali Basin is located between latitudes (33◦ 39 '00 "- 33◦ 54' 55") to the north and longitudes (45◦ 11 '00 "- 45◦ 40' 00") to the east, eastern Diyala province. The research study attributes hydrochemical properties groundwater upper part of the Mandali basin for 20 wells through the data from the analysis of the hydrological information bank of the General Directorate for drilling water wells 2007, hydrochemical study of the water tube wells for two seasons showed water surplus season (February) and season the water deficit (August) It's water colorless, odorless dominated by sulfate ion and sodium, and through hydrochemical formula and the type of water was found that most of the water area of study is the sodium sulfate ty
... Show MoreThe aim of this study was Identifying the relation of coordination and kinesthetic perception with artistic performance level in gymnastics skills for students in second class from the college of physical education/ university of Baghdad/ Al - jadreia .The searchers have been used the descriptive method in scanning style .The subject of this search has been taken (45) female - student in second class from the college of physical education/ university of Baghdad . The searchers have reached into specific conclusions concerning with statistic analysis about immoral joint relation between sensitive- kinetic coincidence and realization and with Artistic Performance Level in Gymnastics Skills for Women for second class .The an important recommen
... Show MoreIn this study, an approach inspired by a standardized calibration method was used to test a laser distance meter (LDM). A laser distance sensor (LDS) was tested with respect to an LDM and then a statistical indicator explained that the former functions in a similar manner as the latter. Also, regression terms were used to estimate the additive error and scale the correction of the sensors. The specified distance was divided into several parts with percent of longest one and observed using two sensors, left and right. These sensors were evaluated by using the regression between the measured and the reference values. The results were computed using MINITAB 17 package software and excel office package. The accuracy of the results in this wo
... Show MorePermeability estimation is a vital step in reservoir engineering due to its effect on reservoir's characterization, planning for perforations, and economic efficiency of the reservoirs. The core and well-logging data are the main sources of permeability measuring and calculating respectively. There are multiple methods to predict permeability such as classic, empirical, and geostatistical methods. In this research, two statistical approaches have been applied and compared for permeability prediction: Multiple Linear Regression and Random Forest, given the (M) reservoir interval in the (BH) Oil Field in the northern part of Iraq. The dataset was separated into two subsets: Training and Testing in order to cross-validate the accuracy
... Show MoreThe topic of strategic intelligence is considered as important topics that acquires the attention of organizations, Because of its role in supplying the decision-making centers by strategic ideas according to the opportunities and threats facing the organization, in an effort to improve the performance of their organizations to reach the high performance organization.
A lot of organizations lack to strategy guides the strategic intelligence towards achieving high performance organization.
This research aims to determine the level of strategic intelligence that characterized the leaders of diseases and kidney transplant center in Medicine city. What is the application level of the
... Show MoreAssessing performance efficiency is critical to the management need for oversight, planning, and continuous periodic evaluation of the multiple activities of Northern Cement State Company in order to determine the level of achievement of the objectives set, and to correct the deviations and delays that the evaluation shows and limitation of liability. What cannot be measured cannot be managed. The aim of this research is to highlight the impact of using BSC, financial and non-financial, to give comprehensive and clear picture of the company's performance and to measure the quality of its performance by using six-sigma and the level of deviations in achieving the planned goals. Therefore, four-key hypotheses were formulated for th
... Show More