paragekbote/gemma3-torchao-quant-sparse

A swift setup of gemma-3-4b with INT8 weight-only quantization and sparsity for efficient inference.

Public
68 runs
  1. 396049cb

    Latest
  2. Author
    @paragekbote
  3. Author
    @paragekbote
  4. Author
    @paragekbote
  5. Author
    @paragekbote