ModelCascade – Route LLM calls to your own GPU first, cloud secondgithub.com/wayneColt1 pointwayneIA2 months ago