Research Article

Hierarchical Clustering Algorithm - A Comparative Study

by  N.Rajalingam, K.Ranjini
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 19 - Issue 3
Published: April 2011
Authors: N.Rajalingam, K.Ranjini
10.5120/2340-3052
PDF

N.Rajalingam, K.Ranjini . Hierarchical Clustering Algorithm - A Comparative Study. International Journal of Computer Applications. 19, 3 (April 2011), 42-46. DOI=10.5120/2340-3052

                        @article{ 10.5120/2340-3052,
                        author  = { N.Rajalingam,K.Ranjini },
                        title   = { Hierarchical Clustering Algorithm - A Comparative Study },
                        journal = { International Journal of Computer Applications },
                        year    = { 2011 },
                        volume  = { 19 },
                        number  = { 3 },
                        pages   = { 42-46 },
                        doi     = { 10.5120/2340-3052 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2011
                        %A N.Rajalingam
                        %A K.Ranjini
                        %T Hierarchical Clustering Algorithm - A Comparative Study%T 
                        %J International Journal of Computer Applications
                        %V 19
                        %N 3
                        %P 42-46
                        %R 10.5120/2340-3052
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Clustering is a data mining (machine learning) technique used to place data elements into related groups without advance knowledge on the group definitions. In this paper the authors provides an in depth explanation of implementation of agglomerative and divisive clustering algorithms for various types of attributes. Database - the details of the victims of Tsunami in Thailand during the year 2004, was taken as the test data. The algorithms are implemented using Visual programming and the formation of the clusters and running time needed of the algorithms using different linkages (agglomerative) to different types of data are taken for analysis.

References
  • Sung Young Jung, and Taek-Soo Kim, “An Agglomerative Hierarchical Clustering Using Partial Maximum Array and Incremental Similarity Computation Method”, Proceedings of the 2001 IEEE International Conference on Data Mining, p.265-272, November 29-December 02, 2001
  • R.J. Gil-Garcia; J.M. Badia-Contelles, “A General Framework for Agglomerative Hierarchical Clustering Algorithms A Pons-Porrata Pattern Recognition, 2006. ICPR 2006. 18th International Conference on Volume 2, 2006 Page(s):569 – 572
  • K.P.Soman, Shyam Diwakar, and V.Ajay, “Insight into Data Mining- Theory and Practice”, Eastern Economy Edition, Prentice Hall of India Pvt. Ltd, New Delhi, 2006
  • “Measuring Association d12 Between Clusters 1 and 2” in http://www.stat.psu.edu/online/courses/stat505/18_cluster/05_cluster_between.html
  • Margaret H.Dunham “Data Mining Introductory and Advance Topics”, Low price Edition – Pearson Education, Delhi, 2003.
  • “Euclidean Distance” in http://people.revoledu.com /kardi/tutorial/Similarity/EuclideanDistance.html
  • “Cluster analysis” in http://en.wikipedia.org/ wiki/Cluster_ analysis
  • “Levenshtein_Distance” in http://en.wikipedia.org/wiki/ Levenshtein_Distance
  • “Similarity Metrics” in http://www.dcs.shef.ac.uk/~sam/ stringmetrics.html#hamming
  • “Levenshtein_Distance” in http://www.dcs.shef.ac.uk/ ~sam/ stringmetrics.html#Levenshtein
  • “Tsunami victim list“ http://www.ems.narenthorn. thaigov. net/tsunami_e/tsunamilist.php
  • “Euclidean distance” in http://en.wikipedia.org/wiki/ Euclidean_distance#One-dimensional_distance
  • “Distance” in http://en.wikipedia.org/wiki/Distance# Mathematics
  • “Hierarchical Clustering Algorithms” in http://home.dei. polimi.it/matteucc/Clustering/tutorial_html/hierarchical.html
  • Hui-Chuan Lin (2009)“Survey and Implementation of Clustering Algorithms” an Unpublished master's thesis for master's degree, Hsinchu, Taiwan, Republic of China
  • Jiawei Han and Micheline Kamber, “Data Mining Concepts and Techniques”, Second Edition- Morgan Kaufmann Publishers, San Francisco, 2006.
  • Evangelos Petroutsos, “Mastering Visual Basic 6”, BPP publications, New Delhi.
  • Yu Zhong , Anil K. Jain , M.-P. Dubuisson-Jolly, “Object Tracking Using Deformable Templates”, IEEE Transactions on Pattern Analysis and Machine Intelligence, v.22 n.5, p.544-549, May 2000.
  • Gary Cornell, “Visual Basic 6 from the Groung Up”, Tata McGraw Hill, New Delhi, 2003.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Agglomerative Divisive Clustering Tsunami Database Data mining

Powered by PhDFocusTM