I'm calling bullshit. There's no reason a shitty chatbot needs anywhere near that amount of VRAM and processing power for personal use, it doesn't need to be able to spit out instant responses, it just needs to be able to get there eventually, I'm pretty sure you could use a near identical model on regular consumer hardware if it was changed to make use of storage drive space rather than primarily VRAM
I'm calling bullshit. There's no reason a shitty chatbot needs anywhere near that amount of VRAM and processing power for personal use, it doesn't need to be able to spit out instant responses, it just needs to be able to get there eventually, I'm pretty sure you could use a near identical model on regular consumer hardware if it was changed to make use of storage drive space rather than primarily VRAM