Hallucinations: The impact to user experience

, and Pulse Labs

July 13, 2023 . 10:35 AM

1 min read

Hallucinations - an industry term used to describe the situation where an AI platform generates responses that are inaccurate or don't make sense but are presented as factual - is an area where platforms and the public may not see the same thing.

Only 5% of user interactions rated are seen as containing hallucinations. Most responses are rated as accurate (95%).
That said, hallucination rates more than double for “experimentation” and “creative” types of prompts:
Experimentation: to learn what the AI can do. E.g., tell me what my meal plan should be if I want to grow muscle.
Creative: to create something new or unique. E.g., write me a song about fishing in the style of Michael Jackson.
In comparing platforms:
17% of users flag GPT-4 responses as possible hallucinations around “creative” prompts.
Bard has the fewest reports of hallucinations in creative.

Implication

The gap around a platform’s ability to manage hallucinations will likely gain more attention in three areas:

1) The consumer knows the topic intimately (a plumber would more likely see a hallucination in a response about plumbing vs. aerospace).

2) In topics where higher accuracy is expected such as health, finance, and law.

3) Topics that are often targeted for misinformation campaigns and if any of that is creeping into the training data and responses.

Lastly, another factor impacting hallucination is the recency gap - or new news - as there are time-limits on training data (e.g., GPT-4 leverages training data prior to October 2021, so anything after that would not be included in their results at this time).

Comments

Latest

3 Ways to Keep UX Research in Sync with Fast-Moving Product Teams

The best product data is what you have at hand while you're sketching wireframes or tinkering with a prototype. Is "on-stage" too theatrical for someone visible on a video call? Does "off-stage" clearly mean muted and camera-off? These hyper-specific, time-sensitive questions hit mid-flow in

, and Aisling Carty

May 29, 2025

Paid Members Public

Pulse Insights: The Power of Listening. Unlocking User Insights

In today’s fast-paced app development world, relying on assumptions about what your users need can lead to missed opportunities. Instead of guessing, imagine having access to real-time, user-prioritized feedback to guide your product decisions. Guessing what users want often results in: * Roadmaps that miss the mark with users * Misspent

, and Pulse Labs

March 13, 2025

Paid Members Public

Challenges of Generative AI in Product Development

Generative AI is transforming product development—but not without challenges. It’s enabling teams to work smarter, innovate faster, and enhance user experiences, but navigating its complexities is key to long-term success. Here are five challenges product teams should be aware of when adopting AI: 1) Addressing Bias in AI

, and Pulse Labs

February 6, 2025

Paid Members Public

Opportunities for Generative AI in Product Development

Generative AI is revolutionizing the way we design, market, and refine products. It’s giving teams the tools to better understand users, create more meaningful experiences, and connect with audiences in new ways. From improving designs to boosting engagement and expanding your reach, AI is opening up exciting opportunities. Here

, and Pulse Labs

February 4, 2025

Paid Members Public

Hallucinations: The impact to user experience

Implication

Comments

Related

Latest