All Papers

Source: | Tag:
Title Authors Year Venue Tags Source Date
Layer Normalization Jimmy Lei Ba, Jamie Ryan Kiros, et al. 2016 arXiv
manual 2026-03-28
TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate Amir Zandieh, Majid Daliri, et al. 2025 arXiv
manual 2026-03-28
T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search Hyomin Lee, Sangwoo Park, et al. 2026 arXiv
auto 2026-03-28
Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR Haobo Xu, Sirui Chen, et al. 2026 arXiv
auto 2026-03-28
SEVerA: Verified Synthesis of Self-Evolving Agents Debangshu Banerjee, Changming Xu, et al. 2026 arXiv
auto 2026-03-28
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Jingwei Ni, Yihao Liu, et al. 2026 arXiv
auto 2026-03-28
Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model Jiahao Wu, Ning Lu, et al. 2026 arXiv
auto 2026-03-28