LLM Hallucination Benchmark: R1, o1, o3-mini, Gemini 2.0 Flash Think Exp 01-21github.com/lechmazur17 pointszone411a year ago