• Gemini 3 Flash typically invents solutions as a substitute of admitting when it doesn’t know one thing
  • The issue arises with factual or excessive‑stakes questions
  • However it nonetheless checks as probably the most correct and succesful AI mannequin

Gemini 3 Flash is quick and intelligent. However in case you ask it one thing it doesn’t really know – one thing obscure or difficult or simply exterior its coaching – it is going to nearly all the time attempt to bluff its manner via, in response to a latest analysis from the impartial testing group Synthetic Evaluation.

It appears Gemini 3 Flash hit 91% on the “hallucination charge” portion of the AA-Omniscience benchmark. Which means when it didn’t have the reply, it nonetheless gave one anyway, nearly on a regular basis, one which was totally fictional.




Source link