#004 Top 10 LLM+RAG Anti-Patterns

I find Anti-patterns as a best way to QUICKLY understand a NEW topic

Sep 14, 2025

🔟 LLM + RAG Anti-Patterns with Business Problems & Solutions

1. Stuffing the LLM with Too Many Documents

Problem: Retrieval pulls 50+ long passages → context window bloats → model hallucinates or truncates.
Business Impact: Customer support chatbot gives irrelevant or cut-off answers → high support costs.
Solution:
- Measure: Track token usage vs. retrieval relevance score.
- Fix: Use Max Marginal Relevance (MMR) or reranking to keep top 3–5 most relevant docs.

2. Embedding Everything Without Normalization

Problem: Raw text with boilerplate, disclaimers, stopwords gets embedded.
Business Impact: Search recalls irrelevant “legal” or “footer” text instead of meaningful business content.
Solution:
- Measure: Audit recall quality with top-k evaluation.
- Fix: Clean & normalize text (remove boilerplates, dedupe, split semantically).

3. Over-Reliance on Default Vector Similarity

Problem: Using only cosine similarity, ignoring domain semantics (e.g., “premium” ≠ “subscription”).
Business Impact: Insurance RAG system retrieves wrong policy clauses → regulatory compliance risk.
Solution:
- Measure: Evaluate retrieval F1 with domain-specific benchmarks.
- Fix: Fine-tune embeddings or hybrid retrieval (BM25 + dense).

4. Ignoring Recency in Data

Problem: Index not refreshed frequently → outdated retrieval.
Business Impact: Financial chatbot uses last year’s rates → customers misled, legal exposure.
Solution:
- Measure: Track % of queries answered with stale docs.
- Fix: Incremental indexing, metadata filters (date-aware retrieval).

5. Chunking Without Overlap or Semantics

Problem: Arbitrary splitting (e.g., every 512 tokens) → broken meaning across chunks.
Business Impact: Medical assistant misses critical context → wrong treatment recommendations.
Solution:
- Measure: Evaluate recall on multi-chunk queries.
- Fix: Semantic chunking with overlaps & document structure awareness.

6. No Grounding in Retrieved Sources

Problem: LLM answers but doesn’t cite sources.
Business Impact: Legal research tool delivers hallucinated case law → damages trust & liability.
Solution:
- Measure: Track % answers with cited references.
- Fix: Chain-of-thought prompting with “include citation spans” or structured output with metadata.

7. One-Size-Fits-All Prompting

Problem: Same prompt template for FAQs, financial reports, and contracts.
Business Impact: Poor precision in specialized queries (e.g., compliance rules).
Solution:
- Measure: Measure answer accuracy across task types.
- Fix: Context-specific prompt templates (FAQ mode vs. compliance mode).

8. Ignoring Query Understanding

Problem: Treat user query as raw text → retrieval ignores intent (e.g., “cheapest plan” vs. “most affordable long-term”).
Business Impact: Sales chatbot suggests wrong product bundle → revenue loss.
Solution:
- Measure: Compare retrieval precision with/without query rewriting.
- Fix: Add query rephrasing step (LLM reformulates for retrieval).

9. Lack of Evaluation Pipeline

Problem: No systematic way to measure hallucinations, grounding, latency.
Business Impact: System deployed → business learns problems only via angry customers.
Solution:
- Measure: Build RAG eval harness (precision@k, factual consistency).
- Fix: Automated regression testing + business KPI dashboards.

10. Overlooking Latency & Cost

Problem: Every query hits embedding store + long LLM call.
Business Impact: High infra bills + slow user experience → customer churn.
Solution:
- Measure: Track cost per query & response latency.
- Fix: Cache frequent queries, use lightweight rerankers before LLM, and tiered infra (fast embeddings + deep retrieval only when needed).

Discussion about this post

No posts

Ready for more?

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts