lucidrains/PaLM-rlhf-pytorch

на сайте с 04 мая 2023, 16:07
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Maybe I'll add retrieval functionality too, à la RETRO