Following on my New Years Resolution to explore Data Science, I’ve just released my first write-up. It’s based around the Wisconsin Breast Cancer Database, and the classification of benign and malignant growths - i.e it’s the standard introduction to machine learning!

The techniques used are very basic - using only histograms and a correlation matrix for the initial data analysis, and Linear SVC via `sklearn`

for the predictive model. It was a good introduction to the tooling - `pandas`

, `plotly`

, `numpy`

, and `sklearn`

- though, as well as the overall workflow.

You can find the initial data analysis, as well as the predictive model, on Github - fergusinlondon.github.io/data.