I understand prompt caching is an effective way for optimising for input tokens. What are other options for optimising input tokens? My input tokens maximise in using an xml file.
I have avoided using TOONs [1] because it works only with uniform arrays, not nested objects nor non-uniform structures.
[1]https://github.com/toon-format/toon