February 24, 2020

Data Science 1: Wisconsin Breast Cancer Database

Following on my New Years Resolution to explore Data Science, I’ve just released my first write-up. It’s based around the Wisconsin Breast Cancer Database, and the classification of benign and malignant growths - i.e it’s the standard introduction to machine learning!

The techniques used are very basic - using only histograms and a correlation matrix for the initial data analysis, and Linear SVC via sklearn for the predictive model. It was a good introduction to the tooling - pandas, plotly, numpy, and sklearn - though, as well as the overall workflow.

You can find the initial data analysis, as well as the predictive model, on Github - fergusinlondon.github.io/data.

