INT4 LoRA wonderful-tuning vs QLoRA: A user inquired about the differences among INT4 LoRA fine-tuning and QLoRA in terms of precision and speed. One more member explained that QLoRA with HQQ requires frozen quantized weights, will not use tinnygemm, and utilizes dequantizing together with torch.matmulTweet from Robert Graham (@ErrataRob): nVidia i