/u/Lines25
View original ↗Small-scale researchers frequently struggle with batching logic on consumer hardware. Creating a simplified 'Auto-Batcher' library that optimizes gradient accumulation for sub-8GB VRAM cards would be a huge community win.
Suggested repo: nano-batch
"Train bigger models on your tiny VRAM."
Estimated effort: 30h