feed trends discover showcase archive

Feed Trends Discover Showcase Archive Dashboard

Submit Showcase

Trending now

Llm + Rl + Training66 Agents + Workflow56 Math + Games56

View all trends →

hypedar

AI trend radar for developers. Catch emerging papers, repos, and discussions before the hype peaks.

About GitHub Discord

By the makers of hypedar

Codepawl

Open-source tools for developers.

Explore our tools →

About Privacy Terms X

© 2026 Codepawl

Built by Codepawl·© 2026

About·Terms·Privacy·Security

GitHub·Discord·X

feed trends discover showcase archive

Training + Rl + Inference

24.0

Build a library for 'plasticity-preserving' RL that maintains an ensemble of policies to prevent catastrophic forgetting. This is vital for developers working on long-running, continuous agent environments.

+0

emergingimplementation gap

rltraininginference

Signals (11)

Compressed-Sensing-Guided, Inference-Aware Structured Reduction for Large Language Models

StoSignSGD: Unbiased Structural Stochasticity Fixes SignSGD for Training Large Language Models

Target-Oriented Pretraining Data Selection via Neuron-Activated Graph

GroupDPO: Memory efficient Group-wise Direct Preference Optimization

Improving Reasoning Capabilities in Small Models through Mixture-of-Layers Distillation with Stepwise Attention on Key Information

Discover and Prove: An Open-source Agentic Framework for Hard Mode Automated Theorem Proving in Lean 4

Optimizing Stochastic Gradient Push under Broadcast Communications

MOONSHOT : A Framework for Multi-Objective Pruning of Vision and Large Language Models

DALM: A Domain-Algebraic Language Model via Three-Phase Structured Generation

Beyond Single-Model Optimization: Preserving Plasticity in Continual Reinforcement Learning

Lightweight Geometric Adaptation for Training Physics-Informed Neural Networks