In this tutorial, we are going to use the K-Nearest Neighbors (KNN) algorithm to solve a classification problem. Firstly, what exactly do we mean by classification? Classification across a variable means that results are categorised into a particular group. e.g. […]

Continue reading »# Category: scikit-learn

# Decision Trees with Python

Let’s take a look at how we can construct decision trees in Python. A decision tree is a model used to solve classification and regression tasks. As we saw in our example for R, the model allows us to generate […]

Continue reading »# Linear and Logistic Regression Modelling in Python

The statsmodels and sklearn libraries are frequently used when it comes to generating regression output. While these libraries are frequently used in regression analysis, it is often the case that a user needs to work with different libraries depending on […]

Continue reading »# K-Means Clustering: Analysing City of London Traffic

A K-Means Clustering algorithm allows us to group observations in close proximity to the mean. This allows us to create greater efficiency in categorising the data into specific segments. In this instance, K-Means is used to analyse traffic clusters across […]

Continue reading »