Mamba-3

1 comments

I would love to see a purely mamba-based 120b model, and whether or not it outcompetes the open-weights OpenAI model.