Response versus gradient boosting trees, GLMs and neural networks under Tweedie loss and log-link

Hainaut, Donatien; Trufin, Julien; Denuit, Michel

doi:doi/10.1080/03461238.2022.2037016

Citer

Response versus gradient boosting trees, GLMs and neural networks under Tweedie loss and log-link

par Hainaut, Donatien ;Trufin, Julien

;Denuit, Michel

Référence Scandinavian actuarial journal, 2022, 10, page (841-866)
Publication Publié, 2022-11-25

Article révisé par les pairs

Résumé :

Thanks to its outstanding performances, boosting has rapidly gained wide acceptance among actuaries. To speed up calculations, boosting is often applied to gradients of the loss function, not to responses (hence the name gradient boosting). When the model is trained by minimizing Poisson deviance, this amounts to apply the least-squares principle to raw residuals. This exposes gradient boosting to the same problems that lead to replace least-squares with Poisson Generalized Linear Models (GLM) to analyze low counts (typically, the number of reported claims at policy level in personal lines). This paper shows that boosting can be conducted directly on the response under Tweedie loss function and log-link, by adapting the weights at each step. Numerical illustrations demonstrate similar or better performances compared to gradient boosting when trees are used as weak learners, with a higher level of transparency since responses are used instead of gradients.

Référencement	Visibilité	Pérennité	Facilité
Les publications encodées constituent la bibliographie académique de l'Université.	Les documents déposés sont indexés par les moteurs de recherche (Google Scholar,…).	Les documents déposés en open-access sont archivés au sein du réseau de préservation SAFE-PLN (www.safepln.org).	Les listes de publications sont compatibles avec le CV-ULB, le FNRS et accessibles sur le web.

Response versus gradient boosting trees, GLMs and neural networks under Tweedie loss and log-link

Documents en relation

DI-fusion