Bandit-Based, Budget-Aware LLM Routing with Preference-Informed LinUCB (PILOT)
206
Treat LLM routing as a contextual bandit and use a preference-informed LinUCB plus a knapsack budget policy to adaptively, cost-effectively pick the right model per query.