Ternative – C++/CUDA inference engine for ternary LLMs with runtime LoRAgithub.com/michelangeloromerochisco3 pointsmichelangeloroa month ago