Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Praise be to God, who started his book with the praise of himself and prayers and peace be upon those who have no prophet after him and his family and companions and those who followed them with charity until the Day of Judgment.
For it is known to every researcher in jurisprudence and its origins that the semantics in terms of formulas for assignment are divided into an order and a prohibition, and I have seen it necessary to write a small research on the prohibition, and since this topic is complex, and has a great impact on the difference of scholars, I decided to write on one issue of it And it is the absolute prohibition and its effect on the difference of jurists, and what is meant by the absolute p
Preserving the independence and sovereignty of the countries of the South Caucasus region, establishing political and economic stability in the region, and supporting regional cooperation are the main elements of Turkish foreign policy towards the region. The South Caucasus region, which has historical and cultural ties between it and Turkey, is a bridge that links Turkey to Central Asia.
Turkey later developed advanced relations with Azerbaijan and Georgia. However, the same momentum was not achieved in terms of relations with Armenia because of the conflict over Nagorny Karabakh
... Show MoreSingled current study on the subject of shorthand in structure designs logos (Iraqi sports clubs model), as the current study Tdmt four chapters, was in the first chapter defines the research problem and its significance, as well as the aim of the research in the know shorthand formal in structure designs logos (Iraqi sports clubs model ), and identifies Find time limits: - slogans Iraqi sports clubs for the years (1956 - 1970), because it represented the years to include designs slogans official Iraqi sports clubs that have been elected in this period for the purpose of examining the reality of the design in the current search. And it represented the spatial limits: - the Republic of Iraq - Iraqi slogans designs sports
... Show MorePurpose: To contribute to the development of an appropriate program for the management of medical waste based on clear-cut principles in order to reach the overall goal of improving the public health and environment of the population in our country.
Design / Approach / Introduction: The research is based on the analytical descriptive approach as a method of study in the field of data collection using a check list and analysis of the data through the use of some statistical treatments.
Results: The need is to establish a medical waste management in hospitals and follow international standards in all stages of waste management from sorting, collection, transportation and treat
... Show MoreBackground: Temporomandibular joint disorders refer to a group of heterogeneous pain and dysfunction conditions involving the masticatory system, reducing life quality of the sufferers. Arthrocentesis is simple and less invasive surgical procedure for the treatment of internal derangement than arthroscopy and better than other conservative procedures such as drugs, occlusal appliances and physiotherapy. The aim of the study was to evaluate the effect of arthrocentesis with injection of hyaluronic acid in the treatment of internal derangement of temporomandibular joint for the restoration of its function, reducing pain and preventing further deterioration of the temporomandibular joint dysfunction. Materials and methods: This study was perfo
... Show MoreBackground : Shoulder pain is a common problem that can pose difficult diagnostic and therapeutic challenges for the family physician It is the third most common musculoskeletal complaint in the general population, and account for 5% of all general practitioners musculoskeletal consults Objective: To determine the diagnostic performance of ultrasonography compared with the physical examination for detection of rotator cuff tears in painful shoulder syndrome. Method: Prospective study was done on seventy patients (48 male, 22 female), age ranged between 30-70 years (mean age 50 years), From February 2007 to July 2011, were subjected to comparative study in Al-Kindy teaching hospital with rotator cuff tears, including physical and ultrasonogr
... Show MoreA study that collected 240 samples and divided into two groups: the first 120 samples were for diabetics and the second 120 samples were for healthy people, and each group included (90, 20.10) samples from the mouth, urine and vagina respectively, The results showed positive (28.67, 4.00, 1.67) isolates of Candida. In the mouth, urine and vagina, respectively, of diabetic patients compared to (9.33, 2.33, 5.00) positive isolates in the mouth, urine and vagina, respectively, in the healthy. The rate of positive isolates in women was high in women with diabetes and healthy, and it reached 25.33 and 9.00 isolates, respectively, compared with the rate of isolates in men with Candida disease for diabetic patients and healthy people 14.67 and 2.0
... Show MoreThe article examines metaphors as one of the fundamental means used by D. Rubina when writing the novel “Parsley Syndrome” to form images of dolls as equal heroes of the work. The author of the article continues research related to the work of Dina Ilinichna Rubina, a representative of modern Russian prose.
This paper aims to deal with the understanding of the properties of the molecular gas hydrogen in the extragalactic spirals sample. It is critical to make observations of CO (J = 1-0) line emission for spiral galaxies, particularly those with an energetic nucleus. In the sample of spiral galaxies compiled, a carbon monoxide CO (1-0) emission line can be observed. This sample of galaxies' gas kinematics and star-forming should be analyzed statistically utilizing appropriate atomic gas HI, molecular gas H2, infrared (1μm-1000μm), visual (at λblue-optical=4400A0), and radio spectrum (at νradio=1.4 GHz and 5GHz) databases. STATISTICA is software that allows us to perform this statistical analysis. The presence of a high scale of s
... Show More