Study Evaluates Machine Learning Models for Health Insurance Risk Classification

Researchers tested three ensemble algorithms on a dataset of 59,381 insurance applicants to measure accuracy, fairness, and interpretability in underwriting decisions. The analysis compared performance across binary, three-class, and eight-class risk settings and examined disparities by age and body mass index.

1 source·May 18, 8:00 PM(22 days ago)·1m read

Study Evaluates Machine Learning Models for Health Insurance Risk Classification

Audio version

Tap play to generate a narrated version.

Developing·Limited corroboration so far. This page will refresh as more sources emerge.

A peer-reviewed study published on 19 May 2026 examined machine learning methods for classifying insurance applicants into risk categories. The work focused on balancing predictive accuracy with fairness and explainability in health insurance underwriting.

Three ensemble models were tested on a benchmark dataset of 59,381 applicants. Researchers applied Random Forest, XGBoost, and LightGBM across binary, three-class, and eight-class risk classification tasks.

XGBoost recorded the highest test accuracy of 0.831 and Matthews Correlation Coefficient of 0.624 in the binary setting. Performance declined as the number of risk classes increased. Body Mass Index and applicant age together accounted for more than 40 percent of total model importance. Feature selection used the Boruta algorithm to reduce the input space.

Fairness metrics showed mild differences across age groups and larger differences across BMI categories. Statistical Parity Difference and Equal Opportunity Difference were used to quantify these disparities. Bootstrap resampling over 1,000 iterations and threshold sensitivity tests from 0.1 to 0.9 indicated stable performance.

Ranking Generalisation Assessment confirmed consistent model behavior under sampling variations. The study provides a framework that combines accuracy, interpretability via SHAP values, fairness audits, and robustness checks for potential use in insurance underwriting.

ai health-insurance risk-classification algorithmic-fairness

Transparency

1 source · single source

CorroborationModerate · 1 source

Story details

Anthropic Releases Public Version of Mythos AI Model With Added Safeguards

Anthropic made a Mythos-class model available to the general public on Tuesday. The release follows months of restricted access due to cybersecurity concerns.

+10

16 sources

Meta to Lease Capacity at 168 MW Reliance AI Data Center in Jamnagar, Gujarat

Cnbc

ai50 min agoUpdated

Meta to Lease Capacity at 168 MW Reliance AI Data Center in Jamnagar, Gujarat

Meta will lease capacity at the Reliance-built facility, which is scheduled for completion within two years and will run on renewable power.

5 sources

New York Post

ai20 hrs ago

Apple Adds Advanced AI Features to Siri

The company introduced an updated Siri digital assistant and new Apple Intelligence features at its Worldwide Developers Conference. The event marks the final conference under current leadership before a planned transition in September.