lucidrains/PaLM-rlhf-pytorch

на сайте с May 04, 2023 16:07
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Maybe I'll add retrieval functionality too, à la RETRO