TextRL

на сайте с 27 марта 2023, 01:17
Text generation with reinforcement learning using huggingface's transformer. RLHF (Reinforcement Learning with Human Feedback) Implementation of ChatGPT for human interaction to improve generation model with reinforcement learning.