DTeval

A Collective Intelligence Project

Demographics Findings Experiments

Experiments

Track A/B experiments comparing evaluation conditions: context formats, reasoning modes, and eval types.

Loading experiments...

A Collective Intelligence Project

View App on GitHub|View Eval Blueprints on GitHub|

Experiments | DTEF