Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes to include and which irrelevant features to exclude for predictive modeling. It is a crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, and improving classification accuracy. It has demonstrated its efficacy in myriads of domains, ranging from its use for text classification (TC), text mining, and image recognition. While there are many traditional FS methods, recent research efforts have been devoted to applying metaheuristic algorithms as FS techniques for the TC task. However, there are few literature reviews concerning TC. Therefore, a comprehensive overview was systematically studied by exploring available studies of different metaheuristic algorithms used for FS to improve TC. This paper will contribute to the body of existing knowledge by answering four research questions (RQs): 1) What are the different approaches of FS that apply metaheuristic algorithms to improve TC? 2) Does applying metaheuristic algorithms for TC lead to better accuracy than the typical FS methods? 3) How effective are the modified, hybridized metaheuristic algorithms for text FS problems?, and 4) What are the gaps in the current studies and their future directions? These RQs led to a study of recent works on metaheuristic-based FS methods, their contributions, and limitations. Hence, a final list of thirty-seven (37) related articles was extracted and investigated to align with our RQs to generate new knowledge in the domain of study. Most of the conducted papers focused on addressing the TC in tandem with metaheuristic algorithms based on the wrapper and hybrid FS approaches. Future research should focus on using a hybrid-based FS approach as it intuitively handles complex optimization problems and potentiality provide new research opportunities in this rapidly developing field.
The logistic regression model is one of the oldest and most common of the regression models, and it is known as one of the statistical methods used to describe and estimate the relationship between a dependent random variable and explanatory random variables. Several methods are used to estimate this model, including the bootstrap method, which is one of the estimation methods that depend on the principle of sampling with return, and is represented by a sample reshaping that includes (n) of the elements drawn by randomly returning from (N) from the original data, It is a computational method used to determine the measure of accuracy to estimate the statistics, and for this reason, this method was used to find more accurate estimates. The ma
... Show MoreOur research aimed to find a new material that can be an efficient heavy metal free flame retardant for plasticized poly(vinyl chloride) comparable to the conventional flame retardants. One of these extraordinary materials is Oxydtron using as an admixture for concrete. Oxydtron showed unexpected efficiency as a flame retardant agent and an excellent heat stabilizer as well. Limiting oxygen index (LOI), static heat stability, Congo-red, and differential scanning calorimetry (DSC) were carried out. The thermal tests proved that Oxydtron is suitable to improve plasticized poly(vinyl chloride) performance at high temperatures applications in terms of flame retarding and thermal stability
This paper discusses an optimal path planning algorithm based on an Adaptive Multi-Objective Particle Swarm Optimization Algorithm (AMOPSO) for two case studies. First case, single robot wants to reach a goal in the static environment that contain two obstacles and two danger source. The second one, is improving the ability for five robots to reach the shortest way. The proposed algorithm solves the optimization problems for the first case by finding the minimum distance from initial to goal position and also ensuring that the generated path has a maximum distance from the danger zones. And for the second case, finding the shortest path for every robot and without any collision between them with the shortest time. In ord
... Show MoreA total of 589 fishes, belonging to 23 species were collected from eight different localities
in north and mid Iraq during 1993. The parasitological inspection of such fishes revealed the
presence of 59 parasite species and two fungi. Among such parasites, five monogenetic
trematodes were recorded on the gills of some fishes for the first time in Iraq. These
included:- Ancyrocephalus vanbenedenii on Liza abu from Tigris river at Al-Zaafaraniya,
south of Baghdad; Dactylogyrus anchoratus on Cyprinus carpio from Tigris river at Al –
Zaafaranya D. minutus on C. carpio from both Tigris river at Al-Zaafaraniya and Euphrates
river at Al-Qadisiya dam lake; Discocotyle sagittata on L. abu from both the drainage system
at
The present work describes the adsorption of Ba2+ and Mg2+ions from aqueous solutions by activated alumina in single and binary system using batch adsorption. The effect of different parameters such as amount of alumina, concentration of metal ions, pH of solution, contact time and agitation speed on the adsorption process was studied. The optimum adsorbent dosage was found to be 0.5 g and 1.5 g for removal of Ba2+ and Mg2+, respectively. The optimum pH, contact time and agitation speed, were found to be pH 6, 2h and 300 rpm, respectively, for removal of both metal ions. The equilibrium data were analyzed by Langmuir and Freundlich isotherm models and the data fitted well to both isotherm modes as indicated by higher correlation of deter
... Show MoreIn this paper, Mann-Kendall test was used to investigate the existence of possible deterministic and stochastic climatic trends in (Baghdad,Basrah,Mosul,Al-Qaim) stations. The statistical test was applied to annual monthly mean of temperatures for the period (19932009). The values of S-statistic were (62, 44, 52, 64) by comparing these values with the table of null probability values for S we get a probability of (0.002, 0.026, 0.010, 0.002) this result is less than α for the 95% confidence level (α = 0.05) indicating a significant result at this level of confidence. Concluded that an increasing trend in concentration is present at the 95% confidence level and the variance of the S-statistic is calculated and it is com
... Show MorePyrolysis of high density polyethylene (HDPE) was carried out in a 750 cm3 stainless steel autoclave reactor, with temperature ranging from 470 to 495° C and reaction times up to 90 minute. The influence of the operating conditions on the component yields was studied. It was found that the optimum cracking condition for HDPE that maximized the oil yield to 70 wt. % was 480°C and 20 minutes. The results show that for higher cracking temperature, and longer reaction times there was higher production of gas and coke. Furthermore, higher temperature increases the aromatics and produce lighter oil with lower viscosity.