Build a lightweight 'internal memory' retrieval system for clinical LLMs that avoids expensive external vector DB lookups. Focus on high-speed, relevant context retrieval for real-time medical decision-making.