B-Scopus
Permanent URI for this communityhttps://dspace.psgrkcw.com/handle/123456789/3730
Browse
10 results
Search Results
Item EFFICIENT PREDICTION OF PHISHING WEBSITES USING SUPERVISED LEARNING ALGORITHMS (Conference Paper)(Elsevier B.V, 2012-12-09) Lakshmi V, Santhana; Vijaya, M SPhishing is one of the luring techniques used by phishing artist in the intention of exploiting the personal details of unsuspected users. Phishing website is a mock website that looks similar in appearance but different in destination. The unsuspected users post their data thinking that these websites come from trusted financial institutions. Several antiphishing techniques emerge continuously but phishers come with new technique by breaking all the antiphishing mechanisms. Hence there is a need for efficient mechanism for the prediction of phishing website. This paper employs Machine-learning technique for modelling the prediction task and supervised learning algorithms namely Multi layer perceptron, Decision tree induction and Naïve bayes classification are used for exploring the results. It has been observed that the decision tree classifier predicts the phishing website more accurately when comparing to other learning algorithms.Item GRAPH CONVOLUTIONAL NEURAL NETWORK FOR IC50 PREDICTION MODEL WITH DRUG SMILES GRAPHS AND GENE EXPRESSIONS OF AMYOTROPHIC LATERAL SCLEROSIS(Journal of Theoretical and Applied Information Technology, 2024-01) Devipriya, S; Vijaya, M SIC50 prediction for neurodegenerative disorders like Amyotrophic Lateral Sclerosis is crucial in biomedical studies. Traditional machine learning models that use molecular descriptors and gene expression for building IC50 prediction models produce less accuracy and also most of the descriptors created by different tools are irrelevant and undefined. In this paper, a Graph Convolutional Neural Network, a deep learning algorithm, is employed for constructing a more precise IC50 prediction model. The model leverages the structural properties of drug molecules represented in graph format, and incorporates gene expression data as global features. So, the model is able to learn drug-gene interactions better. The drug-gene interactivity is learned by the model without drug-induced gene expressions as it is not found for most of the diseases. The work is implemented with well-known and most relevant 80 drugs related to ALS based on the pIC50 values of 32 protein targets of ALS disorder. The Canonical Smiles graph and their corresponding IC50 values of 80 drugs have been derived from the ChEMBL databases. Based on information from the Repurposing Hub in the Depmap database gene expression data for drug-related genes connected with ALS-related conditions is collected. The predictive results show that the proposed GCNN model with fine-tuned hyperparameters achieves MAE of 0.18, RMSE of 0.16 and R2 Score of 0.90.Item SUPPORT VECTOR MACHINE BASED EPILEPSY PREDICTION USING TEXTURAL FEATURES OF MRI(Elsevier Ltd, 2010) Sujitha, V; Sivagami, P; Vijaya, M SEpilepsy is a disorder of the central nervous system, specifically the brain. It is a neurological malfunction affecting about 1% of the population and is the third most common neurological disorder following rheumatic heart disease and Alzheimer’s disease, but it imposes higher costs on society. Magnetic Resonance Imaging (MRI) is one of the most common diagnostic tests used for patients for epilepsy prediction. Shortage of radiologists and the large volume of MRI scan images that need to be analyzed may lead to labor intensive, expensive and inaccurate prediction. Hence there is a need to generate an efficient prediction model for making a correct diagnosis of epilepsy and accurate prediction of its type. This paper describes the modeling of epilepsy prediction using Support Vector Machines (SVM), a machine learning algorithm. The prediction model has been generated by training the support vector machine with descriptive features derived from MRI data of 350 patients and observed that the SVM based model with a Radial Basis Function (RBF) kernel produces 93.87% of prediction accuracy.Item TEMPORAL FUSION TRANSFORMER: A DEEP LEARNING APPROACH FOR MODELING AND FORECASTING RIVER WATER QUALITY INDEX(International Journal of Intelligent Systems and Applications in Engineering, 2023-07-23) Jitha P, Nair; Vijaya, M SWater quality is a major factor when it comes to human and environmental health. The WQI is a key performance indicator for water management effectiveness. Water quality changes over time due to several seasonal attributes and physiochemical properties. Asthe seasons change at each site, the weather records are transformed into time series data, and the values of the physiochemical parameters shift accordingly. This paper introduces a novel temporal fusion transformer architecture for modelling and forecasting river water quality index. The WQI prediction model for the Bhavani River utilizes the temporal fusion transformer to incorporate temporal features fromvarious scales of time series data obtained from monitoring stations.The performance results of the study are compared with other existing prediction models and demonstrated the effectiveness of the temporal fusion transformer approach for modelling and forecasting river water quality.Item GATED RECURRENT NEURAL NETWORK FOR AUTISM SPECTRUM DISORDER GENE PREDICTION(International Journal on Emerging Technologies, 2020) Pream Sudha, V; Vijaya, M SAutism Spectrum Disorder (ASD) is the fastest-growing complex disorder and the genetic ground of this comprehensive developmental disability is very difficult to research. Autism diagnosis for an average child is not done till the age of four, though it can be given at the age of 18 months to two years. Hence a computational model that enables the early diagnosis and personalized treatment is the need of the hour. In this research work, a deep learning based approach is proposed for Autism Spectrum Disorder (ASD) gene prediction. There are various contributors for Autism including genes, mutations, chromosomal settings influence of the environment, prenatal influences, family factors and problems during birth. Recurrent Neural Network (RNN) based Gated Recurrent Units (GRU) are implemented to develop a model that predicts ASD genes, mutations and gene susceptibility. GRUs with their internal memory capability are valuable to store and filter information using the update and reset gates. Also GRU offers a powerful tool to handle sequence data. The model is trained using three simulated datasets with features representing genes, mutations and gene susceptibility to ASD. Besides, the proposed approach is compared to original RNN and Long Short Term Memory Units (LSTM) for ASD prediction. The experimental results confirm that the proposed approach is promising with 82.5% accuracy and hence GRU RNN is found to be effective for ASD gene prediction.Item SUPPORT VECTOR REGRESSION FOR PREDICTING BINDING AFFINITY IN SPINOCEREBELLAR ATAXIA(Springer Link, 2019) Asha, P R; Vijaya, M SSpinocerebellar ataxia (SCA) is an inherited disorder. It arises mainly due to gene mutations, which affect gray matter in the brain causing neurodegeneration. There are certain types of SCA that are caused by repeat mutation in the gene, which produces differences in the formation of protein sequence and structures. Binding affinity is very essential to know how tightly the ligand binds with the protein. In this work, a binding affinity prediction model is built using machine learning. To build the model, predictor variables and their values such as binding energy, IC50, torsional energy and surface area for both ligand and protein are extracted from the complex using AutoDock, AutoDock Vina and PyMOL. A total of 17 structures and 18 drugs were used for learning the support vector regression (SVR) model. Experimental results proved that the SVR-based affinity prediction model performs better than other regression models.Item BINDING AFFINITY PREDICTION MODELS FOR SPINOCEREBELLAR ATAXIA USING SUPERVISED LEARNING(Springer Link, 2018-08-21) Asha, P R; Vijaya, M SSpinocerebellar Ataxia (SCA) is an inherited disorder flow in the family, even when one parent is affected. Disorder arises mainly due to mutations in the gene, which affects the gray matter in the brain and causes neuron degeneration. There are certain types of SCA that are caused by repeat mutation in the gene, which produces differences in the formation of protein sequence and structures. Binding affinity is essential to know how tightly the ligand binds to the protein. In this work, the binding affinity prediction model is built using machine learning. To build the model, features like Binding energy, IC50, Torsional energy and surface area for both ligand and protein are extracted from Auto dock, auto dock vina and PYmol from the complex. A total of 17 structures and 18 drugs were used for building the model. This paper proposes a predictive model using applied mathematics, machine learning regression techniques like rectilinear regression, Artificial neural network (ANN) and Random Forest (RF). Experimental results show that the model built using Random Forest outperforms in predicting the binding affinity.Item SUPPORT VECTOR MACHINE BASED EPILEPSY PREDICTION USING TEXTURAL FEATURES OF MRI (Conference Paper pdf)(Elsevier Ltd, 2010) Sujitha, V; Sivagami, P; Vijaya, M SEpilepsy is a disorder of the central nervous system, specifically the brain. It is a neurological malfunction affecting about 1% of the population and is the third most common neurological disorder following rheumatic heart disease and Alzheimer’s disease, but it imposes higher costs on society. Magnetic Resonance Imaging (MRI) is one of the most common diagnostic tests used for patients for epilepsy prediction. Shortage of radiologists and the large volume of MRI scan images that need to be analyzed may lead to labor intensive, expensive and inaccurate prediction. Hence there is a need to generate an efficient prediction model for making a correct diagnosis of epilepsy and accurate prediction of its type. This paper describes the modeling of epilepsy prediction using Support Vector Machines (SVM), a machine learning algorithm. The prediction model has been generated by training the support vector machine with descriptive features derived from MRI data of 350 patients and observed that the SVM based model with a Radial Basis Function (RBF) kernel produces 93.87% of prediction accuracy.Item EFFICIENT PREDICTION OF PHISHING WEBSITES USING SUPERVISED LEARNING ALGORITHMS(Elsevier Ltd, 2012) Santhana Lakshmi, V; Vijaya, M SPhishing is one of the luring techniques used by phishing artist in the intention of exploiting the personal details of unsuspected users. Phishing website is a mock website that looks similar in appearance but different in destination. The unsuspected users post their data thinking that these websites come from trusted financial institutions. Several antiphishing techniques emerge continuously but phishers come with new technique by breaking all the antiphishing mechanisms. Hence there is a need for efficient mechanism for the prediction of phishing website. This paper employs Machine-learning technique for modelling the prediction task and supervised learning algorithms namely Multi layer perceptron, Decision tree induction and Naïve bayes classification are used for exploring the results. It has been observed that the decision tree classifier predicts the phishing website more accurately when comparing to other learning algorithms.Item MEASURING WEB CONTENT CREDIBILITY USING PREDICTIVE MODELS(Springer Link, 2020-01-30) Manjula, R; Vijaya, M SWeb content credibility is a measure of believable and trustworthy of the web content that is perceived. Content can turn out to be unreliable if it is not up-to-date and it is not measured for quality or accuracy and therefore, web content credibility is important for the individuals to access the content or information. The analysis of content credibility is an important and challenging task as the content credibility is expressed on essential factors. This paper focus on building predictive models to discover and evaluate credibility of a web page content through machine learning technique. A corpus of 300 web page contents have been developed and the factors like Readability, Freshness, Duplicate Content are defined and captured to model the credibility of web content. Two different labeling such as binary labeling and numeric labeling are used for defining credibility. In case of binary labeling, the high and low credibility of web content are represented by 1 and 0, respectively, whereas in case of numeric labeling five-point scale rating is used to mark the content credibility. Accordingly, two independent datasets have been developed. Different regression algorithms such as Linear Regression, Logistic Regression, Support Vector Regression (SVR) are employed for building the predictive models. Various experiments have been carried out using two different datasets and the performance analysis shows that the Logistic Regression model outperforms well when compared to other prediction algorithms.