Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Abstract:
The research aimed to know favoured mass media for children and
modifying their behaviour ,the child became aquires the information from
mass media that he exposure them without any guidance , where upon the
quidance proqrammes becomes real danger whereas qet out their civil
style and converting to deadly poisons,and because of little study for this
supject the two researchers opined to perform astudy to know the favoured
mass media to the children and what are the mass media that modify their
behavior according to ther parent points of view ,after propring the research
measurement and the suilable statical methods it has shown that there are
mass media affect in children behavior ,they are st
The aim of this study was to increasing natural carotenoides production by a locally isolate Rodotorula mucilagenosa M. by determination of the optimal conditions for growth and production of this agents, for encouragest to use it in food application permute artificial pigments which harmfull for consumer health and envieronmental. The optimal condition of carotenoides production from Rhodotorula mucilaginosa M were studied. The results shows the best carbon and nitrogen source were glucose and yeast extract. The carotenoids a mount production was 47430 microgram ̸ litter and 47460 microgram ̸ litter, respectively, and the optimum temperature was 30°C, PH 6, that the carotenoides a mount was 47470 microgram ̸ litter and 47670 microgr
... Show MoreAbstract:
This investigation was carried out to study the nutritional adequacy for
infant milk formula, which imported by Iraqi Ministry of Trade, and are
available in local markets .Most of these formulas contained nearly the same
composition of nutrients which are ,Matines ,Sunny Boy , Salsabeel AL- Badie
,Moroug, ,Charton ,Materna Lery Celia ,Lacstar Lailac,Nactalia. yet they are
unbalanced for providing the daily nutritional requirements for infants whom
depend on bottle feeding for six times daily in their first six month of age. As
there were an increase in daily intake for protein content and most vitamins
that included D, E, C, B1, B2, Niacin, B6, B12, and Biotin as well as most
minerals namely Calci
Objective the research is to identify Over the Commitment of a Rushed Bank in Baghdad has applied social responsibility in accordance with ISO 26000 by measuring and diagnosing the gap between the actual reality in the bank and the requirements of the standard.
The fatty acids in the embryo's liver at ages (7, 11, 14 and 19) days incubation, small chicken aged (14) days after hatching and adult were analyzed, and found (5) fatty acids, the highest concentration of fatty acid in the adult of domesticated chicken and lowest concentration in small chicken age (14) days after hatching. Statistically, there were high significant differences at the probability level (P≤0.001) between all ages together, and the highest concentrations of Oleic acid (C18:1) and Linoleic acid (C18:2) were in embryo age (7) days incubation, while in embryo age (11) days incubation Stearic acid (C18:0) and α-Linolenic acid (C18:3) were higher concentration and Palmitic acid (C16:0) was the highest concentration in the adul
... Show MoreCollapsing building structures during recent earthquakes, especially in Northern and Eastern Kurdistan, including the 2003 earthquake in Cewlig; the 2011 earthquake in Van; and the 2017 earthquake near Halabja province, has raised several concerns about the safety of pre-seismic code buildings and emergency facilities in Erbil city. The seismic vulnerability assessment of the hospital buildings as emergency facilities is one of the necessities which have a critical role in the recovery period following earthquakes. This research aims to study in detail and to extend the present knowledge about the seismic vulnerability of the Rizgary public hospital building in Erbil city, which was constructed before releasing the seism
... Show MoreThe scientific and technological developments and their practical applications in all fields of life in general and in the education field in specific have led to the emergence of variables in the educational structure, teaching methods and in education in their modern form which is consistent in its entirety with the spirit of the age. We today live the age of knowledge increase full of wide ranging scientific and technological developments. Thus life demands human capabilities of a special kind able to develop and innovate. Here the increasing significance emerges for taking care of the human powers through educational systems much different from those current traditional systems. System
... Show MoreA study was carried out to determine the concentrations of trace metals in vegetables and fruits, which are locally available in the markets of Baghdad-samples of fourteen varieties of vegetables and fruits, belonging to Beta vulgaris, Brassica rapa, Daucus carota, Allium cepa, Eurica sativa, Malva silvestris, Coriandrum Sativum, Trigonella Foenum craecum, Anethum graveolens, Barassica oleracea, Phaseolus vulgaris, citrus reticulata, Py rus malus, and Punica granatum. Analysis for Cd,Pb, Mn, Fe, Co, Ni, Cu and Zn were determined by flame atomic absorption sp ectrophotometry. The results indicated that the Malva silvestris recorded the highest concentrations of Cd and Mn while Allium cepa showed the highest concentrations of Pb and Cu. But E
... Show MoreThe study of the chemical and physical factors that induce egg-laying is important for understanding mosquitoes' ecology. These substances may also help assess and control mosquito populations. With this in mind, we have highlighted the attractiveness of Culex pipiens gravid females concerning the containers' color and surface, which has enabled us to show that females of this species are always attracted to large containers of yellow. The ethological tests were made with four biopesticides on the attractiveness of C. pipiens females. It has been observed that the highest densities of the eggs are deposited in the container which contains the biopesticides extracts compared to that which includes the spring wat
... Show MoreA study was carried out to determine the concentrations of trace metals in vegetables and fruits, which are locally available in the markets of Baghdad-samples of fourteen varieties of vegetables and fruits, belonging to Beta vulgaris, Brassica rapa, Daucus carota, Allium cepa, Eurica sativa, Malva silvestris, Coriandrum Sativum, Trigonella Foenum craecum, Anethum graveolens, Barassica oleracea, Phaseolus vulgaris, citrus reticulata, Pyrus malus, and Punica granatum. Analysis for Cd,Pb, Mn, Fe, Co, Ni, Cu and Zn were determined by flame atomic absorption spectrophotometry. The results indicated that the Malva silvestris recorded the highest concentrations of Cd and Mn while Allium cepa showed the highest concentrations of Pb and Cu. But
... Show More