Pattern Recognition in Breast Cancer Using Machine Learning
Name: M.SARAVANAKUMAR
AUTHOR Name: M.SARAVANAKUMAR MSC(CS)
Designation of 1st Author: THIRUTHANGAL-626130
SUPERVISOR NAME :Dr.S.KANNAN MCA.,PhD
Name of Department of : COMPUTER SCIENCE
Name of organization: SCHOOL OF INFORMATION TECHNOLOGY
City: SIVAKASI
Country: INDIA
ABSTRACT
Breast Cancer is the most often identified cancer among women and major reason for increasing mortality rate among women. As the diagnosis of this disease manually takes long hours and the lesser availability of systems, there is a need to develop the automatic diagnosis system for early detection of cancer. Data mining techniques contribute a lot in the development of such system. For the classification of benign and malignant tumor we have used classification techniques of machine learning in which the machine is learned from the past data and can predict the category of new input. This paper is a relative study on the implementation of models using Logistic Regression, Support Vector Machine (SVM) and Latent Dirichlet Allocation (LDA) is done on the dataset taken from the UCI repository. With respect to the results of accuracy, precision, sensitivity, specificity and False Positive Rate the efficiency of each algorithm is measured and compared. Breast Cancer is the most leading malignancy affecting 2.1 million women each year which leads to greatest number of deaths among women. Early treatment not only helps to cure cancer but also helps in prevention of its recurrence. And hence this system mainly focuses on prediction of breast cancer where it uses different machine learning algorithms for creating models like decision tree, logistic regression, random forest which are applied on pre-processed data which suspects greater accuracy for prediction. Amongst all the models, Random Forest Classification leads to best accuracy with 98.6% techniques