Estimating the semantic similarity between short texts plays an increasingly prominent role in many fields related to text mining and natural language processing applications, especially with the large increase in the volume of textual data that is produced daily. Traditional approaches for calculating the degree of similarity between two texts, based on the words they share, do not perform well with short texts because two similar texts may be written in different terms by employing synonyms. As a result, short texts should be semantically compared. In this paper, a semantic similarity measurement method between texts is presented which combines knowledge-based and corpus-based semantic information to build a semantic network that represents the relationship between the compared texts and extracts the degree of similarity between them. Representing a text as a semantic network is the best knowledge representation that comes close to the human mind's understanding of the texts, where the semantic network reflects the sentence's semantic, syntactical, and structural knowledge. The network representation is a visual representation of knowledge objects, their qualities, and their relationships. WordNet lexical database has been used as a knowledge-based source while the GloVe pre-trained word embedding vectors have been used as a corpus-based source. The proposed method was tested using three different datasets, DSCS, SICK, and MOHLER datasets. A good result has been obtained in terms of RMSE and MAE.
Abstract
This research aim to overcome the problem of dimensionality by using the methods of non-linear regression, which reduces the root of the average square error (RMSE), and is called the method of projection pursuit regression (PPR), which is one of the methods for reducing dimensions that work to overcome the problem of dimensionality (curse of dimensionality), The (PPR) method is a statistical technique that deals with finding the most important projections in multi-dimensional data , and With each finding projection , the data is reduced by linear compounds overall the projection. The process repeated to produce good projections until the best projections are obtained. The main idea of the PPR is to model
... Show MoreThe calculation of the oil density is more complex due to a wide range of pressuresand temperatures, which are always determined by specific conditions, pressure andtemperature. Therefore, the calculations that depend on oil components are moreaccurate and easier in finding such kind of requirements. The analyses of twenty liveoil samples are utilized. The three parameters Peng Robinson equation of state istuned to get match between measured and calculated oil viscosity. The Lohrenz-Bray-Clark (LBC) viscosity calculation technique is adopted to calculate the viscosity of oilfrom the given composition, pressure and temperature for 20 samples. The tunedequation of state is used to generate oil viscosity values for a range of temperatu
... Show MoreAlpha shape theory for 3D visualization and volumetric measurement of brain tumor progression using magnetic resonance images
Background: Polycystic ovary syndrome (PCOS) is common heterogeneous disorder syndrome in females, characterized by chronic oligoovulation, polycystic ovary, and hyperandrogenism. This study aimed to the association of ferritin and transforming growth factor- β1 (TGF-β1) levels with insulin resistance, cardiovascular and type 2 diabetes risks. Patients and methods: (61) Iraqi women with PCOS patients diagnosed according to the Rotterdam criteria, were subdivided according to their Body Mass Index (BMI) to: (20) lean women with normal BMI: (18-24), (17) overweight women with BMI: (25-29) and (25) obese women with BMI >30. For the the purpose of comparison, (20) healthy Iraqi women were enrolled as controls ma
... Show MoreBackground: Metabolic syndrome (MetS) is a collection of connected cardiovascular risk factors that characterizes the complicated illness. The waist circumference cutoff point fluctuation has so far defined Mets. Objective: This study aimed to determine the cutoff point for WC in healthy Iraqi adults. Methods: This cross-sectional survey establishes the standard value for WC among 300 healthy university students in Wasit city, Iraq. They are aged between 18-25 years. The receiver operator characteristic (ROC) curve was used WC to predict the presence of two or more risk factors for MetS, as defined by IDF. Results: The cutoff level yielding maximum sensitivity and specificity for predicting the presence of multiple risk factors was
... Show MoreThis work represents the set of measurements of radon and thoron concentrations levels of soil-gas in Al-Kufa city in Iraq using electric Radon meter (RAD-7). Radon and thoron concentration were measured in soil-gas in 20 location for three depth of (50, 100 and 150) cm.
The results show that the emanation rate of radon and thoron gas varied from location to anther, depending on the geological formation. The Radon concentration in soil has been found to vary from (12775±400) Bq/m3 at 150 cm depth in location (sample K2) to (41.45±17) Bq/m3, for depth 150 cm in location (sample K20). The thoron concentration in soil has been found to vary from (198±8.5) Bq/m3 at 150 cm depth in location samples (K1 & K2) to undetected in the mos
Background: Multifactor affect the pathogenesis of thrombosis in solid malignancy; however, a significant role is attributed to the cancer cells ability to interact with and activate the host hemostatic system. [1]
Hemostasis is highly correlated to tumor growth, angiogenesis and metastasis, modulation of these pathways reflects interesting and promising treatment options in the future. [1]
Most patients with cancer frequently suffer from chronic compensated DIC and have abnormal laboratory coagulation tests without clinical manifestations of thrombosis, which is a subclinical hypercoagulable state that can be detected by varying degrees of activation of blood clotting. The results of laboratory tests in th
... Show MoreWeibull distribution is considered as one of the most widely distribution applied in real life, Its similar to normal distribution in the way of applications, it's also considered as one of the distributions that can applied in many fields such as industrial engineering to represent replaced and manufacturing time ,weather forecasting, and other scientific uses in reliability studies and survival function in medical and communication engineering fields.
In this paper, The scale parameter has been estimated for weibull distribution using Bayesian method based on Jeffery prior information as a first method , then enhanced by improving Jeffery prior information and then used as a se
... Show MoreIn this work, an estimation of the key rate of measurement-device-independent quantum key distribution (MDI-QKD) protocol in free space was performed. The examined free space links included satellite-earth downlink, uplink and intersatellite link. Various attenuation effects were considered such as diffraction, atmosphere, turbulence and the efficiency of the detection system. Two cases were tested: asymptotic case with infinite number of decoy states and one-decoy state case. The estimated key rate showed the possibility of applying MDI-QKD in earth-satellite and intersatellite links, offering longer single link distance to be covered.