petergpt/

bullshit-benchmark

BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.

HTML

Stars

606

+65 today+154 /wk+158 /mo

Forks

Issues

Watchers

606

Star History

CreatedFeb 24, 2026

Last push21h ago

Date	Stars	Forks	Issues
Mar 4, 2026	606	30	7
Mar 3, 2026	464	26	7
Mar 2, 2026	409	21	7

petergpt/bullshit-benchmark — Star Growth & Stats | GitHubVC by AttentionVC