Performance Metrics

ROC Curve: Plots TPR (Recall) vs. FPR (1 - Specificity) at different thresholds.
AUC (Area Under Curve): Measures how well the model separates classes.
- AUC = 1 → perfect model.
- AUC = 0.5 → random guessing.

Log Loss (Cross-Entropy Loss)#

Measures how well predicted probabilities match actual labels.

\[ \text{Log Loss} = -\frac{1}{N} \sum_{i=1}^N \big[ y_i \log(p_i) + (1-y_i) \log(1-p_i) \big] \]

Lower log loss = better model.

Summary:

Accuracy → overall correctness (works best for balanced data).
Precision → good when FP is costly.
Recall → good when FN is costly.
F1 / F-beta → balance precision and recall.
ROC-AUC → good for comparing models.
Log Loss → probability-based evaluation.

import numpy as np
import matplotlib.pyplot as plt

# Define sigmoid function
def sigmoid(z):
    return 1 / (1 + np.exp(-z))

# Define MSE cost for binary classification
def mse_cost(y, y_hat):
    return 0.5 * (y_hat - y)**2

# Define log loss cost
def log_loss(y, y_hat):
    eps = 1e-10  # avoid log(0)
    return -(y*np.log(y_hat + eps) + (1-y)*np.log(1 - y_hat + eps))

# Generate predictions from sigmoid
z = np.linspace(-10, 10, 200)
y_hat = sigmoid(z)

# Compute costs for y=1 and y=0
mse_y1 = mse_cost(1, y_hat)
mse_y0 = mse_cost(0, y_hat)
log_y1 = log_loss(1, y_hat)
log_y0 = log_loss(0, y_hat)

# Plotting
plt.figure(figsize=(12, 6))

# For y=1
plt.subplot(1, 2, 1)
plt.plot(y_hat, mse_y1, label="MSE", color="blue")
plt.plot(y_hat, log_y1, label="Log Loss", color="red")
plt.title("Cost when y = 1")
plt.xlabel("Predicted Probability (ŷ)")
plt.ylabel("Cost")
plt.legend()
plt.grid(True)

# For y=0
plt.subplot(1, 2, 2)
plt.plot(y_hat, mse_y0, label="MSE", color="blue")
plt.plot(y_hat, log_y0, label="Log Loss", color="red")
plt.title("Cost when y = 0")
plt.xlabel("Predicted Probability (ŷ)")
plt.ylabel("Cost")
plt.legend()
plt.grid(True)

plt.tight_layout()
plt.show()

../../../_images/16e6232994453118c10f1073352cb987a4712ed2867c346e0ed36cf9256f12a1.png

import numpy as np
import matplotlib.pyplot as plt
from sklearn.metrics import roc_curve, auc

# Simulated probabilities and true labels
y_true = np.array([1, 0, 1, 0, 1, 0, 1, 0, 1, 0])
y_scores_a = np.array([0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 0.3, 0.2, 0.1, 0.05])
y_scores_b = np.array([0.7, 0.6, 0.5, 0.4, 0.3, 0.2, 0.1, 0.05, 0.02, 0.01])

# Compute ROC curves
fpr_a, tpr_a, _ = roc_curve(y_true, y_scores_a)
fpr_b, tpr_b, _ = roc_curve(y_true, y_scores_b)

# Compute AUC
auc_a = auc(fpr_a, tpr_a)
auc_b = auc(fpr_b, tpr_b)

# Plot
plt.figure(figsize=(8, 6))
plt.plot(fpr_a, tpr_a, color='blue', lw=2, label=f'Classifier A (AUC = {auc_a:.2f})')
plt.plot(fpr_b, tpr_b, color='red', lw=2, label=f'Classifier B (AUC = {auc_b:.2f})')
plt.plot([0, 1], [0, 1], color='gray', lw=1, linestyle='--', label='Random')
plt.xlabel('False Positive Rate (FPR)')
plt.ylabel('True Positive Rate (TPR)')
plt.title('ROC Curve')
plt.legend(loc='lower right')
plt.grid(True)
plt.show()

../../../_images/dd06a6d8e2d641c448196f4702ebfe832a4b37ed12135e6abcd438c0b60c2de0.png

Performance Metrics

Contents