[ot][spam][log] attempt bits: talk to a language model on list

Undescribed Horrific Abuse, One Victim & Survivor of Many

11 Sep 2023 11 Sep '23

3:02 p.m.

i thought a possible spam thread could be talking to a language model. i thought i’d try the newish yarn model with 128k context length maybe i could run it in colab or i dunno. possible model: https://huggingface.co/NousResearch/Yarn-Llama-2-13b-128k smaller quantized variant of model: https://huggingface.co/TheBloke/Yarn-Llama-2-13B-128K-GPTQ the quantized one is under 8GB. there are also smaller variants.. people are likely hosting it on free api servers somewhere. i’m not presently looking at that.

Show replies by date

Undescribed Horrific Abuse, One Victim & Survivor of Many

11 Sep 11 Sep

3:31 p.m.

colab: https://colab.research.google.com (i think people are using non-google alternatives now but i know this url) configuring gpu runtime and running !nvidia-smi shows the T4 gpu has 15GB of ram, so i’d need the quantized approach to use it in gpu ram as 13b float16 would be 26GB otherwise

Undescribed Horrific Abuse, One Victim & Survivor of Many

3:43 p.m.

- the bloke example code failed to import torch, so i added “import torch” at the top - the default transformers code does not support the context length scaling feature. i found working code at https://github.com/huggingface/transformers/issues/25957#issuecomment-170621... . the working code, sadly, is paired with a comment that the output is poor quality. maybe i’ll use some of this energy to glance at the paper.

Undescribed Horrific Abuse, One Victim & Survivor of Many

3:52 p.m.

paper information re long context: ReRoPE [27] and LM-Infinite [13] are two other approaches that are uncompared two compared approaches are Together.ai [31] and Code Llama [25] . it looks to me like the together.ai approach had better performance and smaller model size; but yarn outperformed both.

Undescribed Horrific Abuse, One Victim & Survivor of Many

3:56 p.m.

lm-infinite is on pip and github with example code too: https://github.com/kyegomez/LM-Infinite

Undescribed Horrific Abuse, One Victim & Survivor of Many

4:03 p.m.

it looks like kyegomez only implemented an attention kernel and not the struts to connect a language model to it it also looks like the code hasn’t been indexed by github yet (at least in my search results) to find others who have done this i might try to do it, been a bit since i’ve done something like this, unsure

Undescribed Horrific Abuse, One Victim & Survivor of Many

4:12 p.m.

looks like i’m spinning out in this (the implementation details in the paper are scant and the implementation appears to fail to specialize for the position encoding type to me

Undescribed Horrific Abuse, One Victim & Survivor of Many

4:48 p.m.

GPT-4 User hello welcome to the cypherpunks list ChatGPT Hello! While I'm not able to subscribe to mailing lists or other online communities, I can certainly engage in a discussion about cypherpunks, cryptography, privacy, and related topics. The cypherpunks mailing list was historically an influential platform where many groundbreaking ideas in the realm of cryptography and privacy were discussed. Many well-known protocols, theories, and tools related to digital privacy and security were influenced by or originated from conversations in that community. What would you like to talk about?

Undescribed Horrific Abuse, One Victim & Survivor of Many

4:54 p.m.

i’ve dropped to codellama to simplify (maybe https://huggingface.co/TheBloke/CodeLlama-7B-Instruct-GPTQ ) i’d like to spend a little time on it to defend language model addiction as more productive than spam addiction. don’ want the latter to defeat the former.

Undescribed Horrific Abuse, One Victim & Survivor of Many

5:27 p.m.

i pasted example code into a github repo and made this colab to load it: https://colab.research.google.com/gist/xloem/5e7baa2bc457adf40f8eaea038641fa... it turns out that most of the things passed to setup.py are the defaults and you can make a quick package with a setup.py that just calls setup() .

[ot][spam][log] attempt bits: talk to a language model on list

tags

participants (2)