Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transfomer Architectures
| dc.contributor.affiliation | Departamento de Estadística e Investigación Operativa Aplicadas y Calidad | |
| dc.contributor.affiliation | Centro de Investigación en Gestión e Ingeniería de Producción | |
| dc.contributor.affiliation | Escuela Politécnica Superior de Alcoy | |
| dc.contributor.author | Solly, Finn L. | es_ES |
| dc.contributor.author | Soriano-González, Raquel | |
| dc.contributor.author | Juan, Angel A. | |
| dc.contributor.author | Guerrero, Antoni | es_ES |
| dc.contributor.funder | GENERALITAT VALENCIANA | es_ES |
| dc.contributor.funder | AGENCIA ESTATAL DE INVESTIGACION | es_ES |
| dc.contributor.funder | Agencia Estatal de Investigación | es_ES |
| dc.date.accessioned | 2026-06-18T07:00:11Z | |
| dc.date.available | 2026-06-18T07:00:11Z | |
| dc.date.issued | 2026-04-17 | es_ES |
| dc.description.abstract | [EN] In insurance portfolios, classifying customers without a prior history at a given company is particularly challenging due to the absence of historical behavior, extreme class imbalance, heavy-tailed loss distributions, and strict operational constraints. Traditional machine learning approaches, including the baseline methodology proposed in previous studies, typically optimize global predictive accuracy and therefore fail to capture business-critical outcomes, especially the identification of high-risk clients. This study extends the existing approach by evaluating two complementary business-aware classification strategies: (i) a balanced bagging ensemble specifically designed to handle class imbalance and maximize expected profit under explicit customer-omission constraints, and (ii) a lightweight Transformer-based architecture capable of learning richer feature representations. Both approaches incorporate the asymmetric financial cost structure of insurance and operate under operational selection limits. The empirical analysis is conducted on a proprietary large-scale auto insurance dataset comprising 51,618 customers and is complemented by validation on nine synthetic datasets to assess robustness. Model performance is evaluated using statistical tests (ANOVA, Friedman, and pair-wise comparisons) together with business-oriented metrics. The results show that both proposed approaches consistently outperform the baseline methodology (p < 0.001) in terms of profit, with the ensemble offering a better balance of performance and efficiency, while the Transformer shows stronger robustness and generalization under data perturbations. The balanced ensemble provides the most favourable trade-off between predictive performance, robustness, interpretability, and computational efficiency, making it suitable for deployment in regulated insurance environments, while the Transformer achieves competitive results and exhibits stronger generalization under data perturbations. The proposed approach aligns machine learning with actuarial portfolio optimization by explicitly integrating profit-driven objectives and operational constraints, offering two practical and scalable solutions for risk-based decision-making in real-world insurance settings. | es_ES |
| dc.description.accrualMethod | S | es_ES |
| dc.description.bibliographicCitation | Solly, FL.; Soriano-González, Raquel; Juan, Angel A.; Guerrero, A. (2026). Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transfomer Architectures. Risks. 14(4). https://doi.org/10.3390/risks14040091 | es_ES |
| dc.description.issue | 4 | es_ES |
| dc.description.sponsorship | This work has been partially supported by the Spanish Ministry of Science, Innovation and Universities/AEI (PID2022-138860NB-I00, DIN2024-013395, AIA2025-163553-C44) and the Generalitat Valenciana (2024 CIAICO 117). | es_ES |
| dc.description.volume | 14 | es_ES |
| dc.identifier.doi | 10.3390/risks14040091 | es_ES |
| dc.identifier.eissn | 2227-9091 | es_ES |
| dc.identifier.uri | https://riunet.upv.es/handle/10251/236401 | |
| dc.language | Inglés | es_ES |
| dc.publisher | MDPI | es_ES |
| dc.relation.ispartof | Risks | es_ES |
| dc.relation.pasarela | S\580348 | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-138860NB-I00/ES/INTELIGENCIA ARTIFICIAL E INTERNET DE LAS COSAS PARA OPTIMIZAR EL CONSUMO ENERGETICO EN EL TRANSPORTE CON VEHICULOS ELECTRICOS/ | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/GVA//CIAICO%2F2024%2F117//ICSO Meta: Hybridizing Heuristic Algorithms with Machine Learning and Simulation for Energy-Efficient Transportation and Mobility/ | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI//AIA-163553-C44//Marco Integrado de Predicción para la Movilidad Urbana y Gobernanza/ | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI//DIN2024-013395/ | es_ES |
| dc.relation.publisherversion | https://doi.org/10.3390/risks14040091 | es_ES |
| dc.rights | Reconocimiento (by) | es_ES |
| dc.rights.accessRights | Abierto | es_ES |
| dc.subject | Insurance risk modeling | es_ES |
| dc.subject | Customer classification | es_ES |
| dc.subject | High-risk customer detection | es_ES |
| dc.subject | Balanced ensembles | es_ES |
| dc.subject | Transformer models | es_ES |
| dc.subject.ods | 09.- Desarrollar infraestructuras resilientes, promover la industrialización inclusiva y sostenible, y fomentar la innovación | es_ES |
| dc.title | Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transfomer Architectures | es_ES |
| dc.type | Artículo | es_ES |
| dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
| dspace.entity.type | Publication | |
| person.identifier | 685069 | |
| person.identifier | 490349 | |
| person.identifier.orcid | 0000-0002-1337-9561 | |
| person.identifier.orcid | 0000-0003-1392-1776 | |
| relation.isAuthorOfPublication | 5cbb2128-f347-4615-b110-5eb7538827c7 | |
| relation.isAuthorOfPublication | 55e15b2b-1d12-4a12-b048-e805538d51e1 | |
| relation.isAuthorOfPublication.latestForDiscovery | 5cbb2128-f347-4615-b110-5eb7538827c7 | |
| relation.isOrgUnitOfPublication | 73ebfca7-bf81-404f-861a-703ddec70645 | |
| relation.isOrgUnitOfPublication | 556fb9e2-3fb3-44c3-97e9-26873f979909 | |
| relation.isOrgUnitOfPublication | 96a57980-3fe4-46f5-8a98-7c3aac322fc5 | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 73ebfca7-bf81-404f-861a-703ddec70645 | |
| upv.uuid | d0a0cc06-7763-4463-a452-3bd9fa997116 | es_ES |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- SollySoriano-GonzalezJuan - Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balan....pdf
- Tamaño:
- 912.23 KB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Versión editorial