carperai/trlx
на сайте с 04 мая 2023, 16:10
trlX is a distributed training framework designed from the ground up to focus on fine-tuning large language models with reinforcement learning using either a provided reward function or a reward-labeled dataset.