feed trends discover showcase archive

Feed Trends Discover Showcase Archive Dashboard

Submit Showcase

Trending now

Math + Games56 Design + Ui + Agents51 Fine Tuning + Reasoning + Inference47

View all trends →

hypedar

AI trend radar for developers. Catch emerging papers, repos, and discussions before the hype peaks.

About GitHub Discord

By the makers of hypedar

Codepawl

Open-source tools for developers.

Explore our tools →

About Privacy Terms X

© 2026 Codepawl

Built by Codepawl·© 2026

About·Terms·Privacy·Security

GitHub·Discord·X

feed trends discover showcase archive

Safety + Rl + Fine Tuning

19.0

Explore and implement AltTrain, a structural fine-tuning approach to reasoning models. This changes how models process logic to bake-in safety without sacrificing performance.

+0

emergingimplementation gap

rlreasoningfine-tuningsafetyagents

Signals (8)

Human-Guided Harm Recovery for Computer Use Agents

SaFeR-Steer: Evolving Multi-Turn MLLMs via Synthetic Bootstrapping and Feedback Dynamics

Subliminal Transfer of Unsafe Behaviors in AI Agent Distillation

Preregistered Belief Revision Contracts

Reasoning Structure Matters for Safety Alignment of Reasoning Models

Harmonizing Multi-Objective LLM Unlearning via Unified Domain Representation and Bidirectional Logit Distillation

Shifting the Gradient: Understanding How Defensive Training Methods Protect Language Model Integrity

ARES: Adaptive Red-Teaming and End-to-End Repair of Policy-Reward System