OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
-
Updated
Jan 27, 2023 - Python
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on small scale model (any generation model in hugging face's transformers)
Implementation of Reinforcement Learning from Human Feedback (RLHF)
Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.
To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."