Department of Information Technology

Permanent URI for this communityhttps://dspace.psgrkcw.com/handle/123456789/161

Browse

Search Results

Now showing 1 - 3 of 3
  • Item
    SYNTHETIC MICRODATA GENERATION IN PRIVACY PRESERVING DATA MINING
    (VELS university, Chennai., 2011-03-11) Vijayarani S; Nithya S
    Data mining is the process of extracting the previously unknown patterns from large amount of data. Privacy preserving data mining is one of the reserve areas in data mining. It is used to provide the privacy for personally identifiable information in data mining. It also provides security to protect data. Privacy preserving Association Rule Mining, Privacy Preserving Clustering, Privacy Preserving classification, and etc., are some of the privacy preserving algorithms. The various techniques used in the privacy preserving data mining are statistical disclosure control, randomization, k-anonymity and I-diversity. In this paper, we havediscussed about the synthetic data generation concept, its techniques and methods which are used for protecting sensitive data in statistical disclosure control.
  • Item
    AN EFFICIENT CLUSTERING ALGORITHM FOR OUTLIER DETECTION
    (International Journal of Computer Applications, 2011-10) Vijayarani S; Nithya S
    With the help of data mining, an important and valuable knowledge is extracted from the large massive collection of data. There are s several techniques and algorithms are used for extracting the hidden patterns from the large data sets and finding the relationships between them. Clustering is one of the important techniques in data mining. Clustering algorithms are used for grouping the data items based on their similarity. Outlier Detection is a very important research problem in data mining. Clustering algorithms are used for detecting the outliers efficiently. In this research paper, we focused on outlier detection in health data sets such as Pima Indians Diabetes data set and Breast Cancer Wisconsin data set using partitioning clustering algorithms. The algorithms used in this research work are PAM, CLARA AND CLARANS and a new clustering algorithm ECLARANS is proposed for detecting outliers. In order to find the best clustering algorithm for outlier detection several performance measures are used. The experimental results show that the outlier detection accuracy is very good in the proposed ECLARANS clustering algorithm compared to the existing algorithms.
  • Item
    SENSITIVE OUTLIER PROTECTION IN PRIVACY PRESERVING DATA MINING
    (International Journal of Computer Applications, 2011-11) Vijayarani S; Nithya S
    Data mining is the extraction of hidden predictive information from large databases and also a powerful new technology with great potential to analyze important information in their data warehouses. Privacy preserving data mining is a latest research area in the field of data mining which generally deals with the side effects of the data mining techniques. Privacy is defined as “protecting individual’s information”. Protection of privacy has become an important issue in data mining research. Sensitive outlier protection is novel research in the data mining research field. Clustering is a division of data into groups of similar objects. One of the main tasks in data mining research is Outlier Detection. In data mining, clustering algorithms are used for detecting the outliers efficiently. In this paper we have used four clustering algorithms to detect outliers and also proposed a new privacy technique GAUSSIAN PERTURBATION RANDOM METHOD to protect the sensitive outliers in health data sets.