Malignant Melanoma Classification with Deep Learning

Detta är en Master-uppsats från KTH/Medicinteknik och hälsosystem

Författare: Jakob Kisselgof; [2019]

Nyckelord: Melanoma classification deep learning;

Sammanfattning: Malignant melanoma is the deadliest form of skin cancer. If correctly diagnosed in time, the expected five-year survival rate can increase up to 97 %. Therefore, exploring various methods for early detection can contribute with tools which can be used to improve detection of disease and finally to make sure that help is given in time. The purpose of this work was to investigate the performance and behavior of different convolutional neural network (CNN) architectures and to explore whether presegmenting clinical images would improve the prediction results on a binary classifier system. For the purposes of this paper, the two selected CNNs were Inception v3 and DenseNet201. The networks were pretrained on ImageNet and transfer learning techniques such as feature extraction and fine-tuning were used to extract the features of the training set. Batch size was varied and five-fold cross-validation was applied during training to find the optimal number of epochs for training. Evaluation was done on the ISIC test set, the PH2 dataset and a combined set of images from Karolinska University Hospital and FirstDerm, where the latter was also cropped to evaluate presegmentation. The achieved results for the ISIC test set were AUCs of 0.66 for Inception v3 and 0.71 for DenseNet201. For the PH2 test set, the AUCs were 0.82 and 0.73. The results for the Karolinska and FirstDerm set were 0.49 and 0.42. Presegmenting the latter test set resulted in AUCs of 0.58 and 0.51. In conclusion, quality of images could have a big impact on the classification performance. Batch size seems to affect the performance and could thus be an important hyperparameter to tune. Ultimately, the Inception v3 architecture seems to be less affected by different variability why selecting this architecture for a real-world clinical image application could be more suitable. However, the networks performed much worse than state of the art results in previous papers and the conclusions are based on rather inconclusive results. Therefore more research has to be done to verify the conclusions.

  HÄR KAN DU HÄMTA UPPSATSEN I FULLTEXT. (följ länken till nästa sida)