I've read that claude code uses context caching better/differently to generic clients which makes a massive difference to claude token usage. I also thought the detection stuff just switched you over to bonus usage even without your subscription being used up.