Submitted by minimaxir t3_11fbccz in MachineLearning
londons_explorer t1_jam6oyr wrote
Reply to comment by LetterRip in [D] OpenAI introduces ChatGPT and Whisper APIs (ChatGPT API is 1/10th the cost of GPT-3 API) by minimaxir
Aren't biases only a tiny tiny fraction of the total memory usage? Is it even worth trying to quantize them more than weights?
Viewing a single comment thread. View all comments