rlhf

This repository was commited under the action of executing important tasks on which modern Generative AI concepts are laid on. In particular, we focussed on three coding actions of Large Language Models. Extra and necessary details are given in the README.md file.

aws python3 pytorch lora rnn-pytorch attention-is-all-you-need fine-tuning hate-speech-detection huggingface huggingface-transformers foundation-models large-language-models generative-ai rlhf flan-t5 peft-fine-tuning-llm ml-m5-2xlarge low-rank-ada

Updated Mar 28, 2024
Jupyter Notebook

akain0 / Reinforcement-Learning-

Star

Projects and Models built in Python leveraging PyTorch, implementing Reinforcement Learning algorithms for reward-based tasks.

reinforcement-learning reinforcement-learning-algorithms a3c lstm-neural-networks bellman-equation rlhf

Updated May 7, 2024
Jupyter Notebook

AMfeta99 / NLP_LLM

Star

This repository is dedicated to small projects and some theoretical material that I used to get into NLP and LLM in a practical and efficient way.

Updated Jun 6, 2024
Jupyter Notebook

ChukwumaChukwuma / enyimba2_ai

Star

Applying quantum computing principles to large language models for more reliable, interpretable, and steerable systems.

machine-learning natural-language-processing reinforcement-learning ai artificial-intelligence quantum-computing llms generative-ai rlhf llama2

Updated Jan 5, 2024
Python

BARUDA-AI / Awesome-Preference-Optimization

Star

Survey of preference alignment algorithms

alignment direct preference-learning rlhf preference-alignment

Updated Feb 25, 2024

10mudassir007 / AI-CHATBOT

Star

Intelligent AI Chatbot that has the capability to learn from the user

python nlp ai learning-python chatbot nltk nlp-machine-learning nltk-python rlhf

Updated Mar 22, 2024
Python

shreyansh26 / LLM-Activation-Steering-Experiments

Star

Some experiments with activation steering in LLMs

red-teaming rlhf llama2 llama2-7b

Updated Jan 21, 2024
Python

saschaschramm / tiny-chatgpt

Star

Researching the reinforcement learning algorithm of ChatGPT

gae temporal-differencing-learning ppo chatgpt rlhf general-advantage-estimation

Updated Apr 7, 2023
Jupyter Notebook

OpenRL-Lab / RL_Tutorial

Star

Reinforcement Learning Tutorial (强化学习教程)

reinforcement-learning deep-reinforcement-learning tutorials pytorch dqn on-policy rlhf

Updated Sep 10, 2023

ZiyiZhang27 / tdpo

Star

[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"

reinforcement-learning alignment text-to-image diffusion-models stable-diffusion human-feedback rlhf

Updated May 20, 2024
Python

Nips20262 / Nips20262

Star

Language Models Resist Alignment

alignment theory llm rlhf instruction-tuning unalignment

Updated Jun 7, 2024
Python

AugustasMacijauskas / mlmi-thesis

Star

Code for my thesis titled "Eliciting latent knowledge from language reward models" for the MPhil in Machine Learning and Machine Intelligence at the University of Cambridge

alignment interpretability rlhf

Updated Oct 5, 2023
Jupyter Notebook

navneet1083 / textsum-tune

Star

This project is based on fine-tuning LLM models (FLAN-T5) for text summarisation task using PEFT approach. All evaluation metrics being computed on ROUGE scoring and LoRA optimisation techniques being used for fine-tuning.

lora ppo peft ppo-agent huggingface-transformers rlhf flan-t5 llm-training