paragekbote/gemma3-torchao-quant-sparse

A swift setup of gemma-3-4b with INT8 weight-only quantization and sparsity for efficient inference.

Public

70 runs

License

GitHub

Weights