RAG Fundamentals
Combine retrieval with generation.
Architecture
Retriever: find relevant docs. Generator: generate response. Query: combine context.
Implementation
Chunk documents. Embed with vectors. Store in vector DB. Retrieve top-k.
Tools
LangChain: framework. LlamaIndex: data indexing. Chroma, Pinecone: vector stores.
Key Takeaways
- Retrieve context for LLM
- Chunk + embed + store
- LangChain, LlamaIndex help