Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transfomer Architectures

dc.contributor.affiliationDepartamento de Estadística e Investigación Operativa Aplicadas y Calidad
dc.contributor.affiliationCentro de Investigación en Gestión e Ingeniería de Producción
dc.contributor.affiliationEscuela Politécnica Superior de Alcoy
dc.contributor.authorSolly, Finn L.es_ES
dc.contributor.authorSoriano-González, Raquel
dc.contributor.authorJuan, Angel A.
dc.contributor.authorGuerrero, Antonies_ES
dc.contributor.funderGENERALITAT VALENCIANAes_ES
dc.contributor.funderAGENCIA ESTATAL DE INVESTIGACIONes_ES
dc.contributor.funderAgencia Estatal de Investigaciónes_ES
dc.date.accessioned2026-06-18T07:00:11Z
dc.date.available2026-06-18T07:00:11Z
dc.date.issued2026-04-17es_ES
dc.description.abstract[EN] In insurance portfolios, classifying customers without a prior history at a given company is particularly challenging due to the absence of historical behavior, extreme class imbalance, heavy-tailed loss distributions, and strict operational constraints. Traditional machine learning approaches, including the baseline methodology proposed in previous studies, typically optimize global predictive accuracy and therefore fail to capture business-critical outcomes, especially the identification of high-risk clients. This study extends the existing approach by evaluating two complementary business-aware classification strategies: (i) a balanced bagging ensemble specifically designed to handle class imbalance and maximize expected profit under explicit customer-omission constraints, and (ii) a lightweight Transformer-based architecture capable of learning richer feature representations. Both approaches incorporate the asymmetric financial cost structure of insurance and operate under operational selection limits. The empirical analysis is conducted on a proprietary large-scale auto insurance dataset comprising 51,618 customers and is complemented by validation on nine synthetic datasets to assess robustness. Model performance is evaluated using statistical tests (ANOVA, Friedman, and pair-wise comparisons) together with business-oriented metrics. The results show that both proposed approaches consistently outperform the baseline methodology (p < 0.001) in terms of profit, with the ensemble offering a better balance of performance and efficiency, while the Transformer shows stronger robustness and generalization under data perturbations. The balanced ensemble provides the most favourable trade-off between predictive performance, robustness, interpretability, and computational efficiency, making it suitable for deployment in regulated insurance environments, while the Transformer achieves competitive results and exhibits stronger generalization under data perturbations. The proposed approach aligns machine learning with actuarial portfolio optimization by explicitly integrating profit-driven objectives and operational constraints, offering two practical and scalable solutions for risk-based decision-making in real-world insurance settings.es_ES
dc.description.accrualMethodSes_ES
dc.description.bibliographicCitationSolly, FL.; Soriano-González, Raquel; Juan, Angel A.; Guerrero, A. (2026). Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transfomer Architectures. Risks. 14(4). https://doi.org/10.3390/risks14040091es_ES
dc.description.issue4es_ES
dc.description.sponsorshipThis work has been partially supported by the Spanish Ministry of Science, Innovation and Universities/AEI (PID2022-138860NB-I00, DIN2024-013395, AIA2025-163553-C44) and the Generalitat Valenciana (2024 CIAICO 117).es_ES
dc.description.volume14es_ES
dc.identifier.doi10.3390/risks14040091es_ES
dc.identifier.eissn2227-9091es_ES
dc.identifier.urihttps://riunet.upv.es/handle/10251/236401
dc.languageIngléses_ES
dc.publisherMDPIes_ES
dc.relation.ispartofRiskses_ES
dc.relation.pasarelaS\580348es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-138860NB-I00/ES/INTELIGENCIA ARTIFICIAL E INTERNET DE LAS COSAS PARA OPTIMIZAR EL CONSUMO ENERGETICO EN EL TRANSPORTE CON VEHICULOS ELECTRICOS/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/GVA//CIAICO%2F2024%2F117//ICSO Meta: Hybridizing Heuristic Algorithms with Machine Learning and Simulation for Energy-Efficient Transportation and Mobility/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI//AIA-163553-C44//Marco Integrado de Predicción para la Movilidad Urbana y Gobernanza/es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI//DIN2024-013395/es_ES
dc.relation.publisherversionhttps://doi.org/10.3390/risks14040091es_ES
dc.rightsReconocimiento (by)es_ES
dc.rights.accessRightsAbiertoes_ES
dc.subjectInsurance risk modelinges_ES
dc.subjectCustomer classificationes_ES
dc.subjectHigh-risk customer detectiones_ES
dc.subjectBalanced ensembleses_ES
dc.subjectTransformer modelses_ES
dc.subject.ods09.- Desarrollar infraestructuras resilientes, promover la industrialización inclusiva y sostenible, y fomentar la innovaciónes_ES
dc.titleAdvanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transfomer Architectureses_ES
dc.typeArtículoes_ES
dc.type.versioninfo:eu-repo/semantics/publishedVersiones_ES
dspace.entity.typePublication
person.identifier685069
person.identifier490349
person.identifier.orcid0000-0002-1337-9561
person.identifier.orcid0000-0003-1392-1776
relation.isAuthorOfPublication5cbb2128-f347-4615-b110-5eb7538827c7
relation.isAuthorOfPublication55e15b2b-1d12-4a12-b048-e805538d51e1
relation.isAuthorOfPublication.latestForDiscovery5cbb2128-f347-4615-b110-5eb7538827c7
relation.isOrgUnitOfPublication73ebfca7-bf81-404f-861a-703ddec70645
relation.isOrgUnitOfPublication556fb9e2-3fb3-44c3-97e9-26873f979909
relation.isOrgUnitOfPublication96a57980-3fe4-46f5-8a98-7c3aac322fc5
relation.isOrgUnitOfPublication.latestForDiscovery73ebfca7-bf81-404f-861a-703ddec70645
upv.uuidd0a0cc06-7763-4463-a452-3bd9fa997116es_ES

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
SollySoriano-GonzalezJuan - Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balan....pdf
Tamaño:
912.23 KB
Formato:
Adobe Portable Document Format
Descripción:
Versión editorial