Show HN: RLHF and Lora finetuning to mistralai 7B with DeepSpeed learninggithub.com/genji9701 pointgenji970a year agoUsing multiple gpus, training 7B model with lora and RLHF with external dataset.