rlhf
Here are 119 public repositories matching this topic...
RAG Law systems base on google search and Gemini Pro
-
Updated
Mar 14, 2024 - Python
This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.
-
Updated
Mar 28, 2024 - Jupyter Notebook
Projects and Models built in Python leveraging PyTorch, implementing Reinforcement Learning algorithms for reward-based tasks.
-
Updated
May 7, 2024 - Jupyter Notebook
This repository is dedicated to small projects and some theoretical material that I used to get into NLP and LLM in a practical and efficient way.
-
Updated
Jun 6, 2024 - Jupyter Notebook
Applying quantum computing principles to large language models for more reliable, interpretable, and steerable systems.
-
Updated
Jan 5, 2024 - Python
Survey of preference alignment algorithms
-
Updated
Feb 25, 2024
Intelligent AI Chatbot that has the capability to learn from the user
-
Updated
Mar 22, 2024 - Python
Some experiments with activation steering in LLMs
-
Updated
Jan 21, 2024 - Python
Researching the reinforcement learning algorithm of ChatGPT
-
Updated
Apr 7, 2023 - Jupyter Notebook
Reinforcement Learning Tutorial (强化学习教程)
-
Updated
Sep 10, 2023
[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"
-
Updated
May 20, 2024 - Python
Language Models Resist Alignment
-
Updated
Jun 7, 2024 - Python
Code for my thesis titled "Eliciting latent knowledge from language reward models" for the MPhil in Machine Learning and Machine Intelligence at the University of Cambridge
-
Updated
Oct 5, 2023 - Jupyter Notebook
This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.
-
Updated
Aug 8, 2023 - Jupyter Notebook
An alternative RLHF reward model formulation from a social choice perspective
-
Updated
Apr 7, 2024 - Python
Open efforts to implement ChatGPT-like models and beyond.
-
Updated
May 10, 2023
After RLHF and SFT show promising results, a new technique named SPIN is invented for 2024
-
Updated
Jan 17, 2024
Improve this page
Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."