Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
This research presents results on the full energy peak efficiency of a high purity germanium (HPGe) detector from point source as a function of photon energy and source-detector distance. The directions of photons emitted from the source and the photon path lengths in the detector were determined by Monte Carlo technique. A major advantage of this technique is the short computation time compared to the experiments. Another advantage is the flexibility for inputting detector-related parameters (such as source–detector distance, detector radius, length and attenuation coefficient) into the algorithm developed, thus making it an easy and flexible method to apply to other detector systems and configurations. It has been designed and writte
... Show MoreIn this paper, the Decomposition method was used to find approximation solutions for a system of linear Fredholm integral equations of the second kind. In this method the solution of a functional equations is considered as the sum of an infinite series usually converging to the solution, and Adomian decomposition method for solving linear and nonlinear integral equations. Finally, numerical examples are prepared to illustrate these considerations.
An analytical approach based on field data was used to determine the strength capacity of large diameter bored type piles. Also the deformations and settlements were evaluated for both vertical and lateral loadings. The analytical predictions are compared to field data obtained from a proto-type test pile used at Tharthar –Tigris canal Bridge. They were found to be with acceptable agreement of 12% deviation.
Following ASTM standards D1143M-07e1,2010, a test schedule of five loading cycles were proposed for vertical loads and series of cyclic loads to simulate horizontal loading .The load test results and analytical data of 1.95
... Show MoreDC planar sputtering system is characterized by varying discharge potential of (250-2000 volt) and Argon gas pressures of (3.5×10-2 – 1.5) mbar. The breakdown voltage for silver electrode was studied with a uniform electric field at different discharge distances, as well as plasma parameters. The breakdown voltage is a product of the Argon gas pressure inside the chamber and gab distance between the electrodes, represent as Paschen curve. The Current-voltage characteristics curves indicate that the electrical discharge plasma is working in the abnormal glow region. Plasma parameters were found from the current-voltage characteristics of a single probe positioned at the inter-cathode space. Typical values of the electron temperature an
... Show MoreThe aim of the study is to assess the risk factors which lead to myocardial infarction and relation to some variables. The filed study was carried out from the 1st of April to the end of Sept. 2005. The Sample of the study consisted of (100) patients in lbn-Albeetar and Baghdad Teaching Hospital. The result of the study indicated the following; 45% of patients with age group (41-50) were more exposed to the disease and there is no significant difference was seen in the level of education, Martial status, weight and height. The result shows that there are significant difference in risk factors like hypertension, cholesterol level in blood and diabetes. When analyzed by T.test at level of P < 0.01 and there are significant difference in smoki
... Show MoreThis study was designed for isolation of some fungi that caused sub clinical mastitis in cow milk in AL-anbar governorate in Iraq . Total number of 100 milk samples were collected from 25 cow in AL- Falluja city , by using of California mastitis test 45% of milk samples were infected with sub clinical mastitis ,while 55% of milk samples were negative for California mastitis test. The results showed of fungal isolation of milk samples which were positive for California mastitis test, recording 19 isolates of different types of fungi : Aspergillus fumigatus2.2% , Aspergillus niger 4.4% , Candida albicans 26.6% , Blastomced dermetitidis% 4.4 and Geotrichum candidum4.4%. The isolation of these types of fungi, may be attributed to many factor
... Show More<span lang="EN-GB">Transmitting the highest capacity throughput over the longest possible distance without any regeneration stage is an important goal of any long-haul optical network system. Accordingly, Polarization-Multiplexed Quadrature Phase-Shift-Keying (PM-QPSK) was introduced lately to achieve high bit-rate with relatively high spectral efficiency. Unfortunately, the required broad bandwidth of PM-QPSK increases the linear and nonlinear impairments in the physical layer of the optical fiber network. Increased attention has been spent to compensate for these impairments in the last years. In this paper, Single Mode Fiber (SMF), single channel, PM-QPSK transceiver was simulated, with a mix of optical and electrical (Digi
... Show MoreMany approaches of different complexity already exist to edge detection in
color images. Nevertheless, the question remains of how different are the results
when employing computational costly techniques instead of simple ones. This
paper presents a comparative study on two approaches to color edge detection to
reduce noise in image. The approaches are based on the Sobel operator and the
Laplace operator. Furthermore, an efficient algorithm for implementing the two
operators is presented. The operators have been applied to real images. The results
are presented in this paper. It is shown that the quality of the results increases by
using second derivative operator (Laplace operator). And noise reduced in a good
In dental and medical applications, poly-methyl methacrylate (PMMA) has been widely accepted due to the excellent biocompatibility and easy fabrication. Yet, some of the physical and mechanical characteristics of this compound are considered inferior. Seven groups of PMMA nano-composite samples were reported to be fabricated at laboratory temperature . These samples could be used in manufacturing the complete or partial maxillary denture base. The aim of this research is to prepare nano-composite materials which consist of PMMA as a matrix material and two different types of powder (prepared nanoparticles of SnO2 and natural egg shell powder (ESP)) as strengthening materials. The selected additives were used in many cases as p
... Show MoreThe need for optical fibers has emerged for its ability to transmit information with less attenuation and over long distances. In this work, four optical fibers with core radii from 1 μm to 4.75 μm in steps of 1.25 μm and a numerical aperture of 0.17 were studied and their modes properties have been calculated at a wavelength of 633 nm by using RP Fiber Calculator (free version 2022). Also, the effect of increasing the core radius on these properties has been studied. Multimode fibers can be obtained when the radius of the fiber core is large compared to the operating wavelength of the fiber which is less than the cutoff wavelength of the mode. Otherwise, a single-mode fiber is obtained. It has been concluded that all the calculated p
... Show More