arXiv6h ago

Response-Aware User Memory Selection for LLM Personalization

Jillian Fisher, Jennifer Neville, Chan Young Park

View original ↗

Analysis

Viral velocity

low

Implementation gapYES

Novelty7/10

Categorypaper

Topics

raginference

Opportunity Brief

Implement a memory selection engine that retrieves context based on response-utility (how it changes the model output) rather than raw semantic similarity.

Suggested repo: rums-rag

"Stop retrieving based on similarity; start retrieving based on what changes the output."

Estimated effort: 40h