Show HN: RLHF and Lora finetuning to mistralai 7B with DeepSpeed learning

Heykuki News

1 point

a year ago

Using multiple gpus, training 7B model with lora and RLHF with external dataset.

No comments

Threaded

Loading comments...

Show HN: RLHF and Lora finetuning to mistralai 7B with DeepSpeed learning | Heykuki News