It would be useful to add benchmarks with failed `F` and fast evaluations for testing.
It would be useful to add benchmarks with failed
Fand fast evaluations for testing.