Honestly you can run this on a 16GB VRAM GPU with llama.cpp. Just try it!

		am17an 9 days ago \| parent \| context \| favorite \| on: How do I cancel my ChatGPT subscription? Honestly you can run this on a 16GB VRAM GPU with llama.cpp. Just try it!

		help