Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Objective(s): To evaluate nurses’ Practice toward neonatal endotracheal suctioning procedure, and to determine the effectiveness of the interventional program on nurses’ practices, as well as to find out the relationship between nurses’ practice and their demographic characteristics.
Methodology: A Pre-experimental, one group design, was carried out to achieve the objectives of the current study using the evaluation approach and the implementation of the education program for the period from January 17 to June 31, 2022. A non- probability, purposive sample of (24) nurses were selected from the Neonatal Intensive Care Unit at Pediatric Teaching Hospital/ Medical City Department. A checklist w
... Show MoreIn this work, there will be upgraded on the work of (Mahmood and Mahmood , 2018) by finding a general rule of the code for any text made from any number of words by using James e-Abacus Diagram in partition theory
The electronic payment systems are considered the most important infrastructure for the work of banks, particularly after a steady and remarkable development in information and communication technology, Which created the reality of the work of the infrastructure for these systems and these systems also become one of the most important components of infrastructure for the work of banks, cause it is one of the most important channels through which the transfer of cash, financial instruments between financial institutions in general and banking in particular.
In order to achieve the objectives of the research, the most important to identify the concept of electronic payment systems, and its divisions, and th
... Show MoreMachine learning-based techniques are used widely for the classification of images into various categories. The advancement of Convolutional Neural Network (CNN) affects the field of computer vision on a large scale. It has been applied to classify and localize objects in images. Among the fields of applications of CNN, it has been applied to understand huge unstructured astronomical data being collected every second. Galaxies have diverse and complex shapes and their morphology carries fundamental information about the whole universe. Studying these galaxies has been a tremendous task for the researchers around the world. Researchers have already applied some basic CNN models to predict the morphological classes
... Show MoreBackground: With the start of the current century, increased the interest in the role of the adipose tissue derived substances that named adipokines in the inflammatory diseases of the human being including the inflammatory periodontal disease, but scientific evidences were not clearly demonstrate the association between these adipokines and periodontal pathologies. Materials and Methods: Forty two subjects male only with normal body mass index were selected for the study with an age ranged (30-39 years). Samples were divided into three groups of 14 subjects in each group based on clinical periodontal parameters; clinically healthy gingiva (group I), gingivitis group (group II) and chronic periodontitis patients group (group III), from whom
... Show MoreOpenStreetMap (OSM) represents the most common example of online volunteered mapping applications. Most of these platforms are open source spatial data collected by non-experts volunteers using different data collection methods. OSM project aims to provide a free digital map for all the world. The heterogeneity in data collection methods made OSM project databases accuracy is unreliable and must be dealt with caution for any engineering application. This study aims to assess the horizontal positional accuracy of three spatial data sources are OSM road network database, high-resolution Satellite Image (SI), and high-resolution Aerial Photo (AP) of Baghdad city with respect to an analogue formal road network dataset obtain
... Show More
The current research variables have received increasing attention in the recent period because they are one of the important issues affecting the future of organizations, as a result of the speed of environmental variables that have greatly affected organizations and for the purpose of explaining the relationships and links between research variables, as this research presents a test "the type and direction of the relationship between strategic foresight capabilities As an independent variable and green creativity "as a respondent variable. A set of questions has arisen about the basic research problem, including what is the nature and level of interest in the research variables (strategic foresight capabilities an
... Show MoreRemote sensing techniques used in many studies for classfying and measuring of wildfires. Satellite Landsat8(OLI) imagery is used in the presented work. The satellite is considered as a near-polar orbit, with a high multispectral resolution for covering Wollemi National Park in Australia. The work aims to study and measure wildfire natural resources prior to and throughout fire breakout which occurred in Wollemi National Park in Australia for a year (October, 2019), as well as analyzing the harm resulting from such wildfires and their effects on earth and environment through recognizing satellite images for studied region prior to and throughout wildfires. A discussion of methods for computing the affecred area i
... Show MoreAn Experimental comparison between the current-voltage
characteristic and the efficiency conversion from solar to electric energy were studied for square and circular single crystal silicon solar
cell of equal area (35.28 cm2) . The results show that the solar shape is
an important factor in calculating the current-voltage characteristics and efficiency of the solar cell. It was shown that the performance effici
... Show More