Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Background : the major focus of respiratory cytology is the diagnosis of lung cancer , carcinoma of the lung is now reported to be the most commonly diagnosed non- Cutaneous malignancy in the world. Iraq has faced the increase in incidence of this lethal type of cancer. Sputum cytology is a convenient method of screening and diagnosing primary epithelial tumor of the lung which is of many types include fresh smear ,Sacccomanno smear, and mailing container method.
Methods : Sputum cytological study was done on 50 patients suspected to have pulmonary carcinoma prepared by fresh smear method ,Saccomanno method ,and mailing container
method.One, two,or three samples taken from each patient.Slides were prepared
The designer must find the optimum match between the object's technical and economic needs and the performance and production requirements of the various material options when choosing material for an engineering application. This study proposes an integrated (hybrid) strategy for selecting the optimal material for an engineering design depending on design requirements. The primary objective is to determine the best candidate material for the drone wings based on Ashby's performance indices and then rank the result using a grey relational technique with the entropy weight method. Aluminum alloys, titanium alloys, composites, and wood have been suggested as suitable materials for manufacturing drone wings. The requirement
... Show MoreDouble skin ventilated roof is one of the important passive cooling techniques that aims to reduce solar heat gain through roofs by reducing both the conduction and convection heat transfer from the roof to the ceiling of buildings. On the other hand, radiant barrier system (RBS) is very powerful in blocking the radiation heat transfer between the two skins. In this research,the effect of placing a thin layer of aluminium foil at different locations on the thermal insulation performance of a double skin ventilated roof model is investigated experimentally and the optimum location that transmits less heat flux from the lower skinis specified.The model is made of two parallel inclined galvanized steel plates. Galvanized steel has been used
... Show MoreThis research seeks to identify the dimensions of the development of staff performance (training ', incentives, management skills) and its impact on the settlement of compensatory claims adopted in the current research in the Iraqi insurance company،This research aims to highlight the role of developing insurance company’s workers’ performance in settling insurance compensation, and to check this process, the research was applied in the general Iraqi insurance company as it considered as the research community, and a sample was taken from this community that represented by company’s insurance department workers or by collecting actual data that is related to the research’s sample, in addition to the financial compensation data.<
... Show MoreIn every system of security, to keep important data confidential, we need a high degree of protection. Steganography can be defined as a way of sending confidential texts through a secure medium of communications as well as protecting the information during the process of transmission. Steganography is a technology that is used to protect users' security and privacy. Communication is majorly achieved using a network through SMS, e-mail, and so on. The presented work suggested a technology of text hiding for protecting secret texts with Unicode characters. The similarities of glyphs provided invisibility and increased the hiding capacity. In conclusion, the proposed method succeeded in securing confidential data and achieving high p
... Show MoreSome maps of the chaotic firefly algorithm were selected to select variables for data on blood diseases and blood vessels obtained from Nasiriyah General Hospital where the data were tested and tracking the distribution of Gamma and it was concluded that a Chebyshevmap method is more efficient than a Sinusoidal map method through mean square error criterion.
The thermal performance of indirect expansion solar assisted heat pump, IX-SAHP, was investigated experimentally under Iraqi climate. An Indirect-Solar Assisted Heat Pump system was designed, built, instrumented and tested. Experimental tests were conducted by varying the controlling parameters to investigate their effects on the thermal performance of the IX-SAHP such as cooling water flow rate, heating water flow rate, ambient temperature and solar radiation intensity. The investigation covered values of cooling water flow rate of (2, 3, 4, 5 l/min) and heating water flow rate of (2, 3, 4, 5 l/min) under meteorological condition of Baghdad from November 2014 to January 2015.
The results indicated that the performance of the IX-
... Show MoreIn recent decades, there has been increasing interest in wastewater treatment because of its direct impact on the environment and public health. Over time, other forms of treatment have been developed and modified, including extended aeration. This process is included in the suspended growth system. In this paper, a comparative study was conducted between the efficiency of the extended aeration plant and that of the trickling filter plant in removal of BOD and COD. The method of comparison was done by knowing the value of the pollutant before and after the treatment and then extract the removal ratio of each pollutant within each plant. The results showed that the percentage of removal of BOD in the trickling filte
... Show More