OpenAI’s new reasoning AI models hallucinate more

OpenAI’s new reasoning AI models hallucinate more

OpenAI’s recently launched o3 and o4-mini AI models are state-of-the-art in many respects. However, the new models still hallucinate, or make things up — in fact, they hallucinate more than several of OpenAI’s older models. Hallucinations have proven to be one of the biggest and most difficult problems to solve in AI, impacting even today’s…

Read More
#AI isn’t very good at history, new paper finds

#AI isn’t very good at history, new paper finds

AI might excel at certain tasks like coding or generating a podcast. But it struggles to pass a high-level history exam, a new paper has found. A team of researchers has created a new benchmark to test three top large language models (LLMs) — OpenAI’s GPT-4, Meta’s Llama, and Google’s Gemini — on historical questions….

Read More