Optimization + Rl

33.0

Build a simplified 'SGLang-lite' for edge devices, focusing on small-model inference (1B-3B parameters) that prioritizes latency over throughput. This would target the 'AI on hardware' niche.

emergingimplementation gap

rltrainingschedulingoptimizationquantumroboticsinference

Signals (9)

arXiv2d ago

AlphaCNOT: Learning CNOT Minimization with Model-Based Planning

arXiv2d ago

Sparse Goodness: How Selective Measurement Transforms Forward-Forward Learning

arXiv2d ago

Optimizing Earth Observation Satellite Schedules under Unknown Operational Constraints: An Active Constraint Acquisition Approach

arXiv2d ago

Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates

arXiv10d ago

Algebraic Structure Discovery for Real World Combinatorial Optimisation Problems: A General Framework from Abstract Algebra to Quotient Space Learning

GitHub2d ago

sgl-project/sglang

arXiv11d ago

Adaptive Threshold-Driven Continuous Greedy Method for Scalable Submodular Optimization

arXiv2d ago

Does Dimensionality Reduction via Random Projections Preserve Landscape Features?

arXiv2d ago

Optimization + Rl

Signals (9)

AlphaCNOT: Learning CNOT Minimization with Model-Based Planning

Sparse Goodness: How Selective Measurement Transforms Forward-Forward Learning

Optimizing Earth Observation Satellite Schedules under Unknown Operational Constraints: An Active Constraint Acquisition Approach

Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates

Algebraic Structure Discovery for Real World Combinatorial Optimisation Problems: A General Framework from Abstract Algebra to Quotient Space Learning

sgl-project/sglang

Adaptive Threshold-Driven Continuous Greedy Method for Scalable Submodular Optimization

Does Dimensionality Reduction via Random Projections Preserve Landscape Features?

Automated co-design of high-performance thermodynamic cycles via graph-based hierarchical reinforcement learning