Skip to content
Sign In Subscribe

Hallucinations: The impact to user experience

Photo by Aedrian / Unsplash

Hallucinations - an industry term used to describe the situation where an AI platform generates responses that are inaccurate or don't make sense but are presented as factual - is an area where platforms and the public may not see the same thing.

  • Only 5% of user interactions rated are seen as containing hallucinations. Most responses are rated as accurate (95%).
    That said, hallucination rates more than double for “experimentation” and “creative” types of prompts:
  • Experimentation: to learn what the AI can do. E.g., tell me what my meal plan should be if I want to grow muscle.
  • Creative: to create something new or unique. E.g., write me a song about fishing in the style of Michael Jackson.
  • In comparing platforms:
  • 17% of users flag GPT-4 responses as possible hallucinations around “creative” prompts.
  • Bard has the fewest reports of hallucinations in creative.

Implication

The gap around a platform’s ability to manage hallucinations will likely gain more attention in three areas:

1) The consumer knows the topic intimately (a plumber would more likely see a hallucination in a response about plumbing vs. aerospace).

2) In topics where higher accuracy is expected such as health, finance, and law.

3) Topics that are often targeted for misinformation campaigns and if any of that is creeping into the training data and responses.

Lastly, another factor impacting hallucination is the recency gap - or new news - as there are time-limits on training data (e.g., GPT-4 leverages training data prior to October 2021, so anything after that would not be included in their results at this time).

Comments

Latest

3 Ways to Keep UX Research in Sync with Fast-Moving Product Teams
UX

3 Ways to Keep UX Research in Sync with Fast-Moving Product Teams

The best product data is what you have at hand while you're sketching wireframes or tinkering with a prototype. Is "on-stage" too theatrical for someone visible on a video call? Does "off-stage" clearly mean muted and camera-off? These hyper-specific, time-sensitive questions hit mid-flow in

Members Public
Pulse Insights: The Power of Listening. Unlocking User Insights

Pulse Insights: The Power of Listening. Unlocking User Insights

In today’s fast-paced app development world, relying on assumptions about what your users need can lead to missed opportunities. Instead of guessing, imagine having access to real-time, user-prioritized feedback to guide your product decisions.  Guessing what users want often results in: * Roadmaps that miss the mark with users * Misspent

Members Public
Challenges of Generative AI in Product Development

Challenges of Generative AI in Product Development

Generative AI is transforming product development—but not without challenges. It’s enabling teams to work smarter, innovate faster, and enhance user experiences, but navigating its complexities is key to long-term success. Here are five challenges product teams should be aware of when adopting AI: 1) Addressing Bias in AI

Members Public
Opportunities for Generative AI in Product Development

Opportunities for Generative AI in Product Development

Generative AI is revolutionizing the way we design, market, and refine products. It’s giving teams the tools to better understand users, create more meaningful experiences, and connect with audiences in new ways. From improving designs to boosting engagement and expanding your reach, AI is opening up exciting opportunities.  Here

Members Public