What's new? Composer 1.5 uses 20x more RL steps and a thinking tokens system for code reasoning; it applies self ...