Decision Tree Regressor

Decision Tree Regressor #

A Decision Tree Regressor is the regression version of decision trees.
Instead of predicting a class label (like “Iris-setosa” or “Iris-versicolor”), it predicts a continuous value (like house price, temperature, sales).
The dataset is recursively split based on features, but instead of maximizing classification purity (Gini/Entropy), we minimize the variance (or mean squared error) in the target values.

Start at the root node (whole dataset).
At each split:
- Choose the feature & threshold that minimizes a cost function.
- Common cost functions for regression:
  - Mean Squared Error (MSE)
    
    \[ MSE = \frac{1}{n}\sum_{i=1}^n (y_i - \hat{y})^2 \]
  - Mean Absolute Error (MAE)
    
    \[ MAE = \frac{1}{n}\sum_{i=1}^n |y_i - \hat{y}| \]
- Here, \(\hat{y}\) is the mean (or median) of values in that node.
Split until stopping criteria (max depth, min samples per leaf, etc.).
Prediction: For a new sample, traverse the tree and return the mean value of the leaf node it falls into.