Volume-14 ~ Issue-4
- Citation
- Abstract
- Reference
- Full PDF
Abstract: Privacy Preserving Data Mining(PPDM) is a rising field of research in Data Mining and various approaches are being introduced by the researchers. One of the approaches is a sanitization process, that transforms the source database into a modified one by removing selective items so that the counterparts or adversaries cannot extract the hidden patterns from. This study address this concept and proposes a revised Item-based Maxcover Algorithm(IMA) which is aimed at less information loss in the large databases with minimal removal of items.
Keywords: Privacy Preserving Data Mining, Restrictive Patterns, Sensitive Transactions, Maxcover, Sanitized database.
[1] Verykios,V.S, Bertino.E, Fovino.I.N, ProvenzaL.P, Saygin.Y and Theodoridis.Y, "State-of-the-art in Privacy Preservation Data Mining", New York,ACM SIGMOD Record, vol.33, no.2, pp.50-57,2004.
[2] Atallah.M, Bertino.E, Elmagarmid.A, Ibrahim.M and Verykios.V.S, "Disclosure Limitation of Sensitive Rules", In Proc. of IEEE Knowledge and Data Engineering Workshop, pages 45–52, Chicago, Illinois, November 1999.
[3] Clifton.C and Marks.D, " Security and Privacy Implications of Data Mining", In Workshop on Data Mining and Knowledge Discovery, pages 15–19, Montreal, Canada, February 1996.
[4] Dasseni.E, Verykios.V.S, Elmagarmid.A.K and Bertino.E, " Hiding Association Rules by Using Confidence and Support", In Proc. of the 4th Information Hiding Workshop, pages 369– 383, Pittsburg, PA, April 2001.
[5] Saygin.Y, Verykios.V.S, and Clifton.C, "Using Unknowns to Prevent Discovery of Association Rules", SIGMOD Record, 30(4):45–54, December 2001.
[6] Oliveira.S.R.M and Zaiane.O.R, "Privacy preserving Frequent Itemset Mining", in the Proc. of the IEEE ICDM Workshop on Privacy, Security, and Data Mining, Pages 43-54, Maebashi City, Japan, December 2002.
[7] Oliveira.S.R.M and Zaiane.O.R, "An Efficient One-Scan Sanitization for Improving the Balance between Privacy and Knowledge Discovery", Technical Report TR 03-15, June 2003.
[8] Cynthia Selvi P, Mohamed Shanavas A.R, "An Effective Heuristic Approach for Hiding Sensitive Patterns in Databases", IOSR-Journal on Computer Engineering, Volume 5, Issue 1(Sep-Oct, 2012), PP 06-11, DOI. 10.9790/0661-0510611.
[9] Agrawal R, Imielinski T, and Swami.A, "Mining association rules between sets of items in large databases", Proceedings of 1993 ACM SIGMOD international conference on management of data, Washington, DC; 1993. p. 207-16.
[10] http://fimi.cs.helsinki.fi/data/
- Citation
- Abstract
- Reference
- Full PDF
Abstract: Mining knowledge from large amounts of spatial data is known as spatial data mining. It becomes a highly demanding field because huge amounts of spatial data have been collected in various applications ranging from geo-spatial data to bio-medical knowledge. The amount of spatial data being collected is increasing exponentially. So, it far exceeded human's ability to analyze. Recently, clustering has been recognized as a primary data mining method for knowledge discovery in spatial database. The development of clustering algorithms has received a lot of attention in the last few years and new clustering algorithms are proposed. DBSCAN is a pioneer density based clustering algorithm. It can find out the clusters of different shapes and sizes from the large amount of data containing noise and outliers. This paper shows the results of analyzing the properties of density based clustering characteristics of three clustering algorithms namely DBSCAN, k-means and SOM using synthetic two dimensional spatial data sets.
Keywords: Clustering, DBSCAN, K-Means, SOM, SOFM
[1] Kaufman L. and Rousseeuw P. J (1990), "Finding Groups in Data: An Introduction to Cluster Analysis", John Wiley & Sons.
[2] Ankerst M., Markus M. B., Kriegel H., Sander J(1999), "OPTICS: Ordering Points To Identify the Clustering Structure", Proc.ACM SIGMOD‟99 Int. Conf. On Management of Data, Philadelphia, PA, pp.49-60.
[3] Guha S, Rastogi R, Shim K (1998), "CURE: An efficient clustering algorithm for large databases", In: SIGMOD Conference, pp.73~84.
[4] Ester M., Kriegel H., Sander J., Xiaowei Xu (1996), "A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise", KDD‟96, Portland, OR, pp.226-231.
[5] Wang W., Yang J., Muntz R(1997), "STING: A statistical information grid approach to spatial data mining", In: Proc. of the 23rd VLDB Conf. Athens, pp.186~195.
[6] Raymond T. Ng and Jiawei Han (2002), "CLARANS: A Method for Clustering Objects for Spatial Data Mining", IEEE Transactions on Knowledge and Data Engineering, Vol. 14, No. 5.
[7] Rakesh A., Johanners G., Dimitrios G., Prabhakar R(1999), "Automatic subspace clustering of high dimensional data for data mining applications", In: Proc. of the ACM SIGMOD, pp.94~105. pp.975-982.
- Citation
- Abstract
- Reference
- Full PDF
Abstract: In medical science, the importance of the Electrocardiography is remarkable since heart diseases constitute one of the major causes of mortality in the world. Electrocardiogram (ECG) is the only way for doctors to see the cardiac actions of a particular person. It provides a graphic depiction of the electrical forces generated by the heart and then by analysing this graph doctors can tell about any abnormality present in heart. In the paper we focus on the QRS complex detection in electrocardiogram and the idea of further recognition of anomalies in QRS complexes based on some dimensional features of ECG is described. As medical information system is widely used and growing medical databases requires efficient classification method for efficient computer assisted analysis of ECG.
[1] Urszula Markowska-Kaczmar et al, "Mining of an Electrocardiogram" in XXI Autumn Meeting of Polish Information Processing Society Conference Proceedings pp. 169–175, 2005.
[2] Chia-Hung Lin et al, "Multiple Cardiac Arrhythmia Recognition Using Adaptive Wavelet Network", in proceedings of the 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference Shanghai, China, September pp1-4, 2005.
[3] P. Sasikala, Dr. R.S.D. Wahidabanu, "Robust R Peak and QRS detection in Electrocardiogram Using Wavelet Transform", in International Journal of Advanced Computer Science and Applications, Vol. 1, No.6, December 2010.
[4] V.S. Chouhan et al, "Delineation of QRS-complex, P and T-wave in 12-lead ECG", in International Journal of Computer Science and Network Security, VOL.8 No.4, April 2008
[5] A. Dallali et al, " Integration of HRV, WT and Neural Networks for ECG Arrhythmias Classification", in ARPN Journal of Engineering and Applied Sciences, VOL. 6, NO. 5, May 2011.
[6] S. S. Mehta et al, "Comparative Study of QRS Detection in Single Lead and 12-Lead ECG Based on Entropy And Combined Entropy Criteria Using Support Vector Machine", in Journal of Theoretical and Applied Information Technology, 2007.
[7] Victor Dan Moga et al, "Wavelets As Methods For ECG Signal Processing", University of Medicine and Pharmacy Timisoara, Eftimie Murgu Square No. 1, Timisoara Romania
[8] C. Saritha, V. Sukanya, Y. Narasimha Murthy, " ECG Signal Analysis Using Wavelet Transforms", Anantapur, Andhrapradesh, India.
[9] K.V.L.Narayana et al, "Noise removal using adaptive noise cancelling, analysis of ECG using MATLAB", in International Journal of Engineering Science and Technology, Vol. 3 No. 4, Apr 2011.
[10] Jiapu Pan, Willis J. Tompkins, "A Real Time QRS Detection Algorithm", in IEEE Transactions on Biomedical Engineering, Vol. 3 No. 3, March 1983
