Huzzle Labs
Launching Aug 2026 Benchmark for the insurance industry

InsureBench

InsureBench is a benchmark that measures how language models perform on insurance work. It spans three task families — underwriting, claims & coverage, and actuarial analysis — built from document-grounded cases that each resolve to a single verifiable answer. Models are evaluated pass@1 and scored against the recorded outcome, not the wording of the response.

An AI benchmark for insurance

InsureBench is an insurance AI benchmark. It measures how language models handle the document-grounded work the industry runs on: reading policies and supporting files, applying the terms, and producing a decision or a number that can be checked against a recorded outcome. The cases are drawn from real insurance work rather than synthetic exam questions.

The three task families

Underwriting

Models assess risk from application materials and supporting documents, decide whether to offer cover, and set terms such as limits, exclusions, and pricing inputs.

Claims & coverage

Models read the policy and the claim file, determine whether a loss is covered, identify the controlling clauses, and calculate the amount payable.

Actuarial

Models work through reserving, pricing, and exposure calculations, applying the relevant tables and assumptions to reach a numeric result.

How models are scored

Every case in the benchmark resolves to a single verifiable answer: a decision, a covered or not-covered determination, or a number. Models run pass@1, one attempt per case with no retries. Scores reflect the recorded outcome, not the style or fluency of the response.

A GDPval-style benchmark for insurance

InsureBench follows the approach of GDPval: evaluating models on real, economically valuable work instead of abstract puzzles. Where GDPval spans many occupations, InsureBench is a GDPval for insurance — an insurance benchmark built around the specific tasks underwriters, claims handlers, and actuaries carry out.

Leaderboard opening 2026. Built by Huzzle Labs.
Get in touch about InsureBench

Backed by 10X Founders·Angel Invest·Emerge·a16z Scout Fund·Thomas Wolf Hugging Face·Bernd Heinemann Allianz·Yaser Khalighi Stanford