Today with increase using social media, a lot of researchers have interested in topic extraction from Twitter. Twitter is an unstructured short text and messy that it is critical to find topics from tweets. While topic modeling algorithms such as Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) are originally designed to derive topics from large documents such as articles, and books. They are often less efficient when applied to short text content like Twitter. Luckily, Twitter has many features that represent the interaction between users. Tweets have rich user-generated hashtags as keywords. In this paper, we exploit the hashtags feature to improve topics learned from Twitter content without modifying the basic topic model of LSA and LDA. Users who share the same hashtag at most discuss the same topic. We compare the performance of the two methods (LSA and LDA) using the topic coherence (with and without hashtags). The experiment result on the Twitter dataset showed that LSA has better coherence score with hashtags than that do not incorporate hashtags. In contrast, our experiments show that the LDA has a better coherence score without incorporating hashtags. Finally, LDA has a better coherence score than LSA and the best coherence result obtained from the LDA method was (0.6047) and the LSA method was (0.4744) but the number of topics in LDA was higher than LSA. Thus, LDA may cause the same tweets to discuss the same subject set into different clustering.
Arabic text categorization for pattern recognitions is challenging. We propose for the first time a novel holistic method based on clustering for classifying Arabic writer. The categorization is accomplished stage-wise. Firstly, these document images are sectioned into lines, words, and characters. Secondly, their structural and statistical features are obtained from sectioned portions. Thirdly, F-Measure is used to evaluate the performance of the extracted features and their combination in different linkage methods for each distance measures and different numbers of groups. Finally, experiments are conducted on the standard KHATT dataset of Arabic handwritten text comprised of varying samples from 1000 writers. The results in the generatio
... Show MoreSpeech is the essential way to interact between humans or between human and machine. However, it is always contaminated with different types of environment noise. Therefore, speech enhancement algorithms (SEA) have appeared as a significant approach in speech processing filed to suppress background noise and return back the original speech signal. In this paper, a new efficient two-stage SEA with low distortion is proposed based on minimum mean square error sense. The estimation of clean signal is performed by taking the advantages of Laplacian speech and noise modeling based on orthogonal transform (Discrete Krawtchouk-Tchebichef transform) coefficients distribution. The Discrete Kra
A novel median filter based on crow optimization algorithms (OMF) is suggested to reduce the random salt and pepper noise and improve the quality of the RGB-colored and gray images. The fundamental idea of the approach is that first, the crow optimization algorithm detects noise pixels, and that replacing them with an optimum median value depending on a criterion of maximization fitness function. Finally, the standard measure peak signal-to-noise ratio (PSNR), Structural Similarity, absolute square error and mean square error have been used to test the performance of suggested filters (original and improved median filter) used to removed noise from images. It achieves the simulation based on MATLAB R2019b and the resul
... Show MoreReinforcing asphalt concrete with polyester fibers considered as an active remedy to alleviate the harmful impact of fatigue deterioration. This study covers the investigation of utilizing two shapes of fibers size, 6.35 mm by 3.00 mm and 12.70 mm by 3.00 mm with mutual concentrations equal to 0.25 %, 0.50 % and 0.75 % by weight of mixture. Composition of asphalt mixture consists of different optimum (40-50) asphalt cement content, 12.50 mm nominal aggregate maximum size with limestone dust as a filler. Following the traditional asphalt cement and aggregate tests, three essential test were carried out on mixtures, namely: Marshall test (105 cylindrical specimens), indirect tensile strength test (21 cylindrical specimens)
... Show MoreThe research deals with A very important two subjects, computer aided process planning (CAPP) and Quality of product with its dimintions which identified by the producer organization, the goal of the research is to Highlight and know the role of the CAPP technology to improve quality of the product of (rotor) in the engines factory in the general company for electrical industries, The research depends case study style by the direct visits of researcher to the work location to apply the operational paths generated by specialized computer program designed by researcher, and research divides into four axes, the first regard to the general structure of the research, the second to the theoretical review, the t
... Show MoreThis study was conducted to evaluate the hydrocarbon biodegradation abilities of Enterobacter cloacae, Staphylococcus aureus, Sphingomonas paucimobilis, and Pentoae species which were isolated from different diesel-contaminated soil samples. The isolates were identified by the Vitek 2 system. Fourier-transform spectroscopy (FT-IR) tested the potential of these isolates to biodegrade the diesel according to the peak areas, a significant decrease in the area of the peaks at 2856-2928 cm−1 corresponds to aliphatic hydrocarbons. The appearance of small peaks at 900-1032 cm−1 refers to substituted benzene derivative compounds. An appearance of some new peaks at 3010- 3030 cm−1 which indicate the presence of alcohol (-OH) and ketones (RC=O)
... Show More