Technology writer Lance Whitney tested six popular generative AI tools by asking each the same series of trick questions and found that every one hallucinated at least once. The AIs tested were ChatGPT (GPT-5.2), Google Gemini (Gemini 3 Flash), Microsoft Copilot (GPT-5), Claude AI (Claude 3.5 Sonnet), Meta