Voice Activity Detection (VAD) is considered as an important pre-processing step in speech processing systems such as speech enhancement, speech recognition, gender and age identification. VAD helps in reducing the time required to process speech data and to improve final system accuracy by focusing the work on the voiced part of the speech. An automatic technique for VAD using Fuzzy-Neuro technique (FN-AVAD) is presented in this paper. The aim of this work is to alleviate the problem of choosing the best threshold value in traditional VAD methods and achieves automaticity by combining fuzzy clustering and machine learning techniques. Four features are extracted from each speech segment, which are short term energy, zero-crossing rate, autocorrelation, and log energy. A modified version of fuzzy C-Means is then used to cluster speech segments into three clusters; two clusters for voice and one for unvoiced. After that, three feed forward neural networks are trained to adjust their weights, in which each network represents one cluster. To make the final decision regarding the class type of a given speech segment, the membership degrees of this segment in all clusters along with neural networks' decisions are given to a defuzzification step which finally gives the class type of that segment. The proposed FN-AVAD is tested on the public multimodal emotion database, Surrey AudioVisual Expressed Emotion (SAVEE), and the error rate was 2.08%. The achieved results are comparable to the results achieved by the current published works in the literature.
Schiff base derived from PVA and Erythroascorbic acid derivative (pentulosono-ɣ-lactone-2, 3-enedianisoate) was synthesized and characterized by Thin Layer Chromatography (TLC) and FTIR spectra, aldehyde was also characterized by (U.V-Vis), 1HNMR, 13CNMR and mass spectra. The inhibitory effect of prepared polymer on the activity of human serum Cholinesrerase has been studied in vitro. The polymer showed a remarkable activity at low concentration (4.5*10-3 – 4.5*10-8 M).
The complexes of the 2-hydroxy-4-Nitro phenyl piperonalidene with metal ions Cr(III), Ni(II), Pt(IV) and Zn(II) were prepared in ethanolic solution. These complexes were characterized by spectroscopic methods, conductivity, metal analyses and magnetic moment measurements. The nature of the complexes formed in ethanolic solution was study following the molar ratio method. From the spectral studies, monomer structures proposed for the nickel (II) and Zinc (II) complexes while dimeric structures for the chromium (III) and platinum (IV) were proposed. Octahedral geometry was suggested for all prepared complexes except zinc (II) has tetrahedral geometry, Structural geometries of these compounds were also suggested in gas phase by using
... Show MoreThe compound [L] was produced in the current study through the reaction of 4-aminoacetophenon with 4-methoxyaniline in the cold, concentrated HCl with 10% NaNO2. Curcumin, several transition metal complexes (Ni (II), La (III), and Hg (II)), and compound [L] were combined in EtOH to create new complexes. UV-vis spectroscopy, FTIR, AA, TGA-DSC, conductivity, chloride content, and elemental analysis (CHNS) were used to describe the structure of produced complexes. Biological activities against fungi, S. aureus (G+), Pseudomonas (G-), E. coli (G-), and Proteus (G-) were demonstrated using complexes. Depending on the outcomes of the aforementioned methods, octahedral formulas were given as the geometrical structures for each created comp
... Show MoreIn the present study, chitosan Schiff base has been prepared from chitosan reaction with p-chloro benzaldehyde. The AuNPs and AgNPs were manufactured by extract of onion peels as a reducing agent. The AuNPs and AgNPs that have been synthesized were characterized through UV-vis spectroscopy, XRD analyses and SEM microscopy. The polymer blends of the chitosan / PEG has been prepared by using the approach of solution casting. Chitosan Schiff base / PEG Au and Ag nanocomposites were synthesized, nanocomposites and polymer blends have been characterized by FTIR which confirm the formation of Schiff base by revealing a new band of absorption at 1693 cm-1 as a result of the (C=N) imine group. FESEM, DSC and TGA confirm the thermal stability
... Show MoreIn this paper we investigate the automatic recognition of emotion in text. We propose a new method for emotion recognition based on the PPM (PPM is short for Prediction by Partial Matching) character-based text compression scheme in order to recognize Ekman’s six basic emotions (Anger, Disgust, Fear, Happiness, Sadness, Surprise). Experimental results with three datasets show that the new method is very effective when compared with traditional word-based text classification methods. We have also found that our method works best if the sizes of text in all classes used for training are similar, and that performance significantly improves with increased data.
In Automatic Speech Recognition (ASR) the non-linear data projection provided by a one hidden layer Multilayer Perceptron (MLP), trained to recognize phonemes, and has previous experiments to provide feature enhancement substantially increased ASR performance, especially in noise. Previous attempts to apply an analogous approach to speaker identification have not succeeded in improving performance, except by combining MLP processed features with other features. We present test results for the TIMIT database which show that the advantage of MLP preprocessing for open set speaker identification increases with the number of speakers used to train the MLP and that improved identification is obtained as this number increases beyond sixty.
... Show MoreIn this paper, a subspace identification method for bilinear systems is used . Wherein a " three-block " and " four-block " subspace algorithms are used. In this algorithms the input signal to the system does not have to be white . Simulation of these algorithms shows that the " four-block " gives fast convergence and the dimensions of the matrices involved are significantly smaller so that the computational complexity is lower as a comparison with " three-block " algorithm .
Digital image is widely used in computer applications. This paper introduces a proposed method of image zooming based upon inverse slantlet transform and image scaling. Slantlet transform (SLT) is based on the principle of designing different filters for different scales.
First we apply SLT on color image, the idea of transform color image into slant, where large coefficients are mainly the signal and smaller one represent the noise. By suitably modifying these coefficients , using scaling up image by box and Bartlett filters so that the image scales up to 2X2 and then inverse slantlet transform from modifying coefficients using to the reconstructed image .
&nbs
... Show MoreThe purpose of this research is to enhance the role of organizational communication in organizations using IT technologies. The results showed that there is a strong relationship with information technology technologies in enhancing the role of organizational communication, which in turn helps to improve the performance of organizations in general