Research Article

Predicting Primary Tumors using Multiclass Classifier Approach of Data Mining

by  Mehak Naib, Amit Chhabra
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 96 - Issue 8
Published: June 2014
Authors: Mehak Naib, Amit Chhabra
10.5120/16813-6559
PDF

Mehak Naib, Amit Chhabra . Predicting Primary Tumors using Multiclass Classifier Approach of Data Mining. International Journal of Computer Applications. 96, 8 (June 2014), 9-13. DOI=10.5120/16813-6559

                        @article{ 10.5120/16813-6559,
                        author  = { Mehak Naib,Amit Chhabra },
                        title   = { Predicting Primary Tumors using Multiclass Classifier Approach of Data Mining },
                        journal = { International Journal of Computer Applications },
                        year    = { 2014 },
                        volume  = { 96 },
                        number  = { 8 },
                        pages   = { 9-13 },
                        doi     = { 10.5120/16813-6559 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2014
                        %A Mehak Naib
                        %A Amit Chhabra
                        %T Predicting Primary Tumors using Multiclass Classifier Approach of Data Mining%T 
                        %J International Journal of Computer Applications
                        %V 96
                        %N 8
                        %P 9-13
                        %R 10.5120/16813-6559
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

Data mining has been widely adopted in recent years in many fields, especially in the medical field. This paper highlights the prediction of unknown primary tumors in the dataset. The multiclass classifier with Random forest is used for classification of multiclass dataset as it gives much higher accuracy than binary classifiers. SMOTE method for this imbalanced dataset with Randomize technique is applied during preprocessing for reducing the biasness among classes. These all evaluations and results are carried out using WEKA 3. 6. 10 as a data mining tool.

References
  • Ba-Alwi FM. and Hintaya M. 2013. "Comparative Study for Analysis the Prognostic in Hepatitis Data: Data Mining Approach", International Journal of Scientific & Engineering Research, vol. 4, Issue 8, ISSN 2229-5518.
  • Bhuvaneswari N. and Yamuna S. 2013. "Information extraction of predicting blood cancer", International Journal of Computer Science, Vol. 1, Issue 4.
  • Damtew A. 2011. "Designing a predictive model for heart disease detection using data mining techniques", A Thesis Submitted to the School of Graduate Studies of Addis Ababa University in Partial Fulfillment of the Requirements for the Degree of Master of Science in Health Informatics.
  • Dataset collected, [http://tunedit. org/repo/UCI] accessed 2013.
  • Khaleel M. , Pradham S. and Dash G. N. 2013. "A Survey of Data Mining Techniques on Medical Data for Finding Locally Frequent Diseases", International Journal of Advanced Research in Computer Science and Software Engineering, Vol. 3, Issue 8, ISSN: 2277 128X.
  • Khan M. , Qamar S. and Massin L. 2012. " A Prototype of Cancer/Heart Disease Prediction Model Using Data Mining", International Journal of Applied Engineering Research, Vol. 7, No. 11, ISSN 0973-4562.
  • Lavanya D. and Rani K. 2012. "Ensemble decision tree classifier for breast cancer data", International Journal of Information Technology Convergence and Services (IJITCS), Vol. 2, No. 1.
  • Lokanayaki K. and Malathi A. 2013. , "Exploring on Various Prediction Model in Data Mining Techniques for Disease Diagnosis", International Journal of Computer Applications, Vol. 77, No. 5.
  • Primary Tumor, [http://www. wisegeek. com] accessed 23 march 2014.
  • Radaideh Q. , Shawakfa E. and Najjar M. 2006, "Mining Student Data Using Decision Trees" The International Arab Journal of Information Technology.
  • Ramamohan Y. , Vasantharao K. , Chakravarti C. and Ratnam A. S. K. 2012. "A Study of Data Mining Tools in Knowledge Discovery Process", International Journal of Soft Computing and Engineering (IJSCE), Vol. 2, Issue-3, ISSN: 2231-2307.
  • Vijiyarani S. and Sudha S. 2013. "Disease Prediction in Data Mining Technique – A Survey", International Journal of Computer Applications & Information Technology, Vol. II, Issue I, ISSN: 2278-7720.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

SMOTE WEKA Primary tumor Multiclass classifier Random forest

Powered by PhDFocusTM