Researchbreaking
SSRL Trains LLMs to Search Their Own Parameters 5.5× Faster Than External Methods
SSRL uses RL to teach LLMs to search their own knowledge internally—5.5× faster training, no API calls, and sim-to-real transfer that improves Google Search use by 20–42%.
April 26, 20261 min read