adirik/mamba-370m

Base version of Mamba 370M, a 370 million parameter state space language model

Public
53 runs

Want to make some of these yourself?

Run this model