Example 3.5 #86

crossxwill · 2020-01-24T08:08:59Z

The example below was supposed to show K=5 is better than K=1. However, the test error for K=1 is much lower than K=5.

The test set sample’s sale price is $176K and the neighbor’s prices, from closest to farthest, are: $175K, $128K, $100K, $120K, $125K. Using K = 1, the model would miss the true house price by $0.9K. This illustrates the concept of overfitting introduced in Section 1.2.1; the model is too aggressively using patterns in the training set to make predictions on new data points. For this model, increasing the number of neighbors might help alleviate the issue. Averaging all K = 5 points to make a prediction substantially cuts the error to $-46.4K.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example 3.5 #86

Example 3.5 #86

crossxwill commented Jan 24, 2020

Example 3.5 #86

Example 3.5 #86

Comments

crossxwill commented Jan 24, 2020