Jillian Fisher, Jennifer Neville, Chan Young Park
View original ↗Implement a memory selection engine that retrieves context based on response-utility (how it changes the model output) rather than raw semantic similarity.
Suggested repo: rums-rag
"Stop retrieving based on similarity; start retrieving based on what changes the output."
Estimated effort: 40h