doctaj
View original ↗Develop a privacy-preserving 'clean room' tool that allows developers to scrub PII from legacy enterprise data (Slack, email dumps) before vectorizing them for RAG. This bridges the gap between massive dark data availability and compliance-compliant AI training.
Suggested repo: scrub-ai
"Turn your toxic data dumps into compliant AI gold."
Estimated effort: 60h