AI Risk · Performance

Poor Document Retrieval Accuracy

Failures in the retrieval component of retrieval-augmented systems (e.g. RAG) can lead to inaccurate, irrelevant, outdated, or conflicting documents being surfaced.

📋 Description

In Retrieval-Augmented Generation (RAG) systems, the retrieval phase is critical as it determines which documents or chunks are passed to the language model to ground its responses. When this retrieval step performs poorly, the result is a model that may produce hallucinations, contradictory claims, or entirely irrelevant answers.

Common failure modes include:

- Conflicting Source Documents: When several retrieved passages contain opposing information, the model may merge or alternate between them, causing confusion or factual inaccuracy.
- Poor Update Latency: If the vector database isn't refreshed in a timely manner, recent documents (e.g., updated policies, terms, or prices) may not be included, leading to outdated responses despite recent ground-truth changes.
- Suboptimal Similarity Metrics: Relying on one retrieval strategy (e.g., BM25 or cosine similarity) can underperform in specific query types. Hybrid systems that blend semantic and keyword search tend to perform better across edge cases.
- Incompatible Embedding Spaces: Mixing embeddings generated by different models (e.g., OpenAI vs. Cohere) without alignment can degrade retrieval performance due to distance metric mismatch.

🔍 Public Examples and Common Patterns

- Incident 639: Customer Overcharged Due to Air Canada Chatbot's False Discount Claims - Air Canada was ordered to pay over $600 in damages for providing inaccurate bereavement discount information via its chatbot, leading to a customer overpaying for flights. The tribunal ruled the airline responsible for the chatbot's misinformation.

🛡️ Recommended Mitigations

📐 External Framework Mapping

- MITRE ATLAS: AML.T0071 - False RAG Entry Injection

📚 References

- Improving Embedding Accuracy for Document Retrieval Using Entity Relationship Maps and Model-Aware Contrastive Sampling

Cite this page

Trustible. "Poor Document Retrieval Accuracy." Trustible AI Governance Insights Center, 2026. https://trustible.ai/ai-risks/poor-document-retrieval-accuracy/

← All AI Risks Insights Center

Manage AI Risk with Trustible

Trustible's AI governance platform helps enterprises identify, assess, and mitigate AI risks like this one at scale.

Explore the Platform

Platform

Features

By Framework

By Industry