What is Prompt Caching? Why long prompts and multiple rounds of agents are trying to find a way to eat the cache
Prompt caching refers to caching the recurring prompt prefixes in the model request, and then trying to reuse the processing results directly when enc...