TextRL

на сайте с March 27, 2023 01:17
Text generation with reinforcement learning using huggingface's transformer. RLHF (Reinforcement Learning with Human Feedback) Implementation of ChatGPT for human interaction to improve generation model with reinforcement learning.