| Title | Authors | Year | Venue | Tags | Source | Date |
|---|---|---|---|---|---|---|
| Layer Normalization | Jimmy Lei Ba, Jamie Ryan Kiros, et al. | 2016 | arXiv | | manual | 2026-03-28 |
| TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate | Amir Zandieh, Majid Daliri, et al. | 2025 | arXiv | | manual | 2026-03-28 |
| T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search | Hyomin Lee, Sangwoo Park, et al. | 2026 | arXiv | | auto | 2026-03-28 |
| Prune as You Generate: Online Rollout Pruning for Faster and Better RLVR | Haobo Xu, Sirui Chen, et al. | 2026 | arXiv | | auto | 2026-03-28 |
| SEVerA: Verified Synthesis of Self-Evolving Agents | Debangshu Banerjee, Changming Xu, et al. | 2026 | arXiv | | auto | 2026-03-28 |
| Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills | Jingwei Ni, Yihao Liu, et al. | 2026 | arXiv | | auto | 2026-03-28 |
| Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model | Jiahao Wu, Ning Lu, et al. | 2026 | arXiv | | auto | 2026-03-28 |
No papers match the current filters.