Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
The science of (- - Semiology) comes in the introduction to language sciences and linguistics that addressed the levels of language building and its phonemic signs, through which we can monitor and analyze the data of the phoneme of the actor, and the ways to build his linguistic speech, especially since (the linguist Saussure - He emphasized that linguistque is only part of the science of signs, which is particularly advanced within logic, social psychology, and general psychology, and since language is in the origin - whatever language, and at what level - it is not A separate, single and unified language, in fact, they are intertwined, multiple, varied and renewed languages due to their influence The times and its development and the
... Show MoreIn many oil-recovery systems, relative permeabilities (kr) are essential flow factors that affect fluid dispersion and output from petroleum resources. Traditionally, taking rock samples from the reservoir and performing suitable laboratory studies is required to get these crucial reservoir properties. Despite the fact that kr is a function of fluid saturation, it is now well established that pore shape and distribution, absolute permeability, wettability, interfacial tension (IFT), and saturation history all influence kr values. These rock/fluid characteristics vary greatly from one reservoir region to the next, and it would be impossible to make kr measurements in all of them. The unsteady-state approach was used to calculate the relat
... Show MoreThe aim of the research is to know the level of time management application and its impact on the performance of the job, a survey search in the general company for communication and information technology and provide recommendations that help employees to optimize the use of time and improve performance, which is an important element in controlling the various functions of the company. In order to achieve the objectives of the research, the questionnaire was based on two main variables and distributed to a random sample of (44) employees in the company out of (308) employees, thus the proportion of the sample (14%). After collecting the samples from the sample, there are (6) incomplete forms that have been retri
... Show Morethe researchers Sought to determine the impact of the customer contact (Within a client contact there are two times, first is the total time required to create a service and within it there is contact time while the second time is the time of client contact ؛ where means a time that records the physical presence of the customer during the process of service) on operations performance by concentrate attention on the cost (labor productivity) and quality (patient ratio to the doctor) and speed (cycle time) and flexibility (the flexibility range) , as well as ruling out variable of innovation because of impossibility to measure this variable in the Specialty Center for Dental in al-alwia due to the center is lacking of mechanisms t
... Show MoreThe research aimed to use HIIT exercises, and to know the effect of HIIT exercises on some physiological and physical indicators of the young badminton players, and to identify the degree of competition anxiety and the performance of some offense skills among the young badminton players. The research community (the young badminton players), the research sample and its selection method (the research sample was chosen by the intentional method (8) badminton player from the Athwari Club), the scientific method (the experimental method with pre and post tests), measurement tools: physiological tests (high and low blood pressure) , pulse, and physical exams (explosive force of arms and legs) and the offense skills and the scale of competit
... Show MoreThe research aimed to use HIIT exercises, and to know the effect of HIIT exercises on some physiological and physical indicators of the young badminton players, and to identify the degree of competition anxiety and the performance of some offense skills among the young badminton players. The research community (the young badminton players), the research sample and its selection method (the research sample was chosen by the intentional method (8) badminton player from the Athwari Club), the scientific method (the experimental method with pre and post tests), measurement tools: physiological tests (high and low blood pressure) , pulse, and physical exams (explosive force of arms and legs) and the offense skills and the scale of competition an
... Show MoreThe study aimed to investigate the employment of electronic supervision applications in developing the teaching performance of teachers in Oman. Based on the qualitative method and the study population consisted of all the teachers of the first cycle in the Governorate of Muscat. The study sample amounted to 24 female teachers. The interview was used as a tool for data collection. The study reached several results, including: There are difficulties in employing electronic supervision applications, which are weak network, density of curricula, lack of experience in applying technology, and the large number of tasks assigned to the teacher. These difficulties can also be overcome by strengthening the network, training teachers, reducing th
... Show MoreThe importance of evaluation depends on many institutions, whether governmental or private, including media institutions, rely on the list to evaluate the weight of things and appreciation, as well as judging things, achievements, and everything related to the institution is concerned. Because through evaluation, institutions can recognize weaknesses and strengths. It is also an effective tool for management review, through which the institution can review everything related to planning and decision-making, leadership, incentives and other administrative matters.
Therefore, this study attempts to shed light on the evaluation process, its basics and its importance in general and in the Kurdish media institutions as a model in part
... Show MoreTwo locally isolated microalgae (Chlorella vulgaris Bejerinck and Nitzschia palea (Kützing) W. Smith) were used in the current study to test their ability to production biodiesel through stimulated in different nitrogen concentration treatments (0, 2, 4, 8 gl ), and effect of nitrogen concentration on the quantity of primary product (carbohydrate, protein ), also the quantity and quality of lipid. The results revealed that starvation of nitrogen led to high lipid yielding, in C. vulgaris and N. palea the lipid content increased from 6.6% to 40% and 40% to 60% of dry weight (DW) respectively.Also in C. vulgaris, the highest carbohydrate was 23% of DW from zero nitrate medium and the highest protein was 50% of DW in the treatment 8gl. Whil
... Show More