Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
Abstract
The perpetuity of the Quranic discourse required being suitable for all ages.
Accordingly, the method of the Glorious Quran a pre request for the conscious
investigation and realization in order to detect the core of the texts, as the Quranic
discourse is considered a general address for the humanity as a whole. For this
reason, the progress of the concerned studies neceiated that it should cope with the
current development in the age requirements and its cultural changes within ages.
The texts of the Glorious Quran lightened the human reason as being the
Creator’s miracle for it is characterized by certain merits that makes it different from
poetry and prose. It is a unique texture in its rheto
... Show MoreAn experimental investigation based on thirty three simple pullout cylinder specimens was conducted to study the bond-slip trend between concrete and steel reinforcement. Plain and deformed steel reinforcement bars were used in this investigation. The effect of bar diameter, concrete compressive strength and development length on bond-slip relation was detected. The results showed that the bond strength increases with increasing of compressive strength and with decreasing of bar diameter and development length. A nonlinear regression analysis for the experimental results yields in a mathematical correlation to predict the bond strength as a function of concrete compressive strength, reinforcing bar diameter and its yield stress. The minimum
... Show MoreIn this research work, a simulator with time-domain visualizers and configurable parameters using a continuous time simulation approach with Matlab R2019a is presented for modeling and investigating the performance of optical fiber and free-space quantum channels as a part of a generic quantum key distribution system simulator. The modeled optical fiber quantum channel is characterized with a maximum allowable distance of 150 km with 0.2 dB/km at =1550nm. While, at =900nm and =830nm the attenuation values are 2 dB/km and 3 dB/km respectively. The modeled free space quantum channel is characterized at 0.1 dB/km at =860 nm with maximum allowable distance of 150 km also. The simulator was investigated in terms of the execution of the BB84 prot
... Show More<p>The demand for internet applications has increased rapidly. Providing quality of service (QoS) requirements for varied internet application is a challenging task. One important factor that is significantly affected on the QoS service is the transport layer. The transport layer provides end-to-end data transmission across a network. Currently, the most common transport protocols used by internet application are TCP (Transmission Control Protocol) and UDP (User Datagram Protocol). Also, there are recent transport protocols such as DCCP (data congestion control protocol), SCTP (stream congestion transmission protocol), and TFRC (TCP-friendly rate control), which are in the standardization process of Internet Engineering Task
... Show MoreIn this research work, a simulator with time-domain visualizers and configurable parameters using a continuous time simulation approach with Matlab R2019a is presented for modeling and investigating the performance of optical fiber and free-space quantum channels as a part of a generic quantum key distribution system simulator. The modeled optical fiber quantum channel is characterized with a maximum allowable distance of 150 km with 0.2 dB/km at =1550nm. While, at =900nm and =830nm the attenuation values are 2 dB/km and 3 dB/km respectively. The modeled free space quantum channel is characterized at 0.1 dB/km at =860 nm with maximum allowable distance of 150 km also. The simulator was investigated in terms of the execution of the BB84 p
... Show MoreThe aim of this investigation is to study and analysis the role of governance in the evaluation of the and social performance of the economic units to be addressed through the concept of corporate governance and then to the social performance and its relationship to corporate governance.
The most important obtained results from this research is that the corporate governance of extreme importance, and derive their importance from being an essential tool to contribute to the transparency and fair disclosure of the financial results of economic units in the fight against financial and administrative corruption in economic units, thus providing protection and confidence of all parties, and the evaluating soci
... Show MoreThe objective of this study is to verify the overall performance and evaluate the wastewater quality of the wastewater treatment plant at the Abu Ghraib Dairy Factory and compare the results with the Iraqi Quality Standards (IQS) for effluent disposal and with the national determinants of treated water use. Agricultural irrigation wastewater, which included daily assessment records of the main parameters affecting wastewater [five-day biochemical oxygen demand (BOD5), chemical oxygen demand (COD), total dissolved solids (T.D.S), total suspended solids (TSS), phosphate (PO4), nitrate (NO3), hydrogen ion concentration (pH)] obtained from the quality control department of Abu Ghraib dairy plant registered from January 2017 to December 2020. Th
... Show More