[ml][ot] low-end rlhf ala chatgpt, huggingface

Undescribed Horrific Abuse, One Victim & Survivor of Many gmkarl at gmail.com
Sun Mar 12 14:41:52 PDT 2023


i guess huggingface put themselves in charge of machine learning
technology trickling down to the masses in a controlled way
they released a lengthy tutorial that explains the parts, on using a
new patch to their libraries to make custom large models on low end
hardware. it combines the chatgpt rlhf approach with the adapter peft
approach and appears to be roughly what is presently the normative
cutting edge

https://www.reddit.com/r/MachineLearning/comments/11p3a0j/d_finetuning_20b_llms_with_rlhf_on_a_24gb/
https://huggingface.co/blog/trl-peft


More information about the cypherpunks mailing list