Cs-Lg
Displacement-Resistant Extensions of DPO with Nonconvex $f$-Divergences
On the Convergence of Multicalibration Gradient Boosting
Soft Forward-Backward Representations for Zero-shot Reinforcement Learning with General Utilities
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos
SuReNav: Superpixel Graph-based Constraint Relaxation for Navigation in Over-constrained Environments
Fair Transit Stop Placement: A Clustering Perspective and Beyond
Method for noise-induced regularization in quantum neural networks
Simmering: Sufficient is better than optimal for training neural networks
Deep Concept Identification for Generative Design
Toward Reasoning on the Boundary: A Mixup-based Approach for Graph Anomaly Detection
Prism: Spectral Parameter Sharing for Multi-Agent Reinforcement Learning
Advances in Battery Energy Storage Management: Control and Economic Synergies
Emergent Low-Rank Training Dynamics in MLPs with Smooth Activations
RuleSmith: Multi-Agent LLMs for Automated Game Balancing
Learning Rate Scaling across LoRA Ranks and Transfer to Full Finetuning
SCONE: A Practical, Constraint-Aware Plug-in for Latent Encoding in Learned DNA Storage
Is Gradient Ascent Really Necessary? Memorize to Forget for Machine Unlearning
SR4-Fit: An Interpretable and Informative Classification Algorithm Applied to Prediction of U.S. House of Representatives Elections
Sample-Efficient Policy Space Response Oracles with Joint Experience Best Response
Towards Generalizable Reasoning: Group Causal Counterfactual Policy Optimization for LLM Reasoning
Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models
Adaptive Transfer Clustering: A Unified Framework