The mamba paper Diaries
This model inherits from PreTrainedModel. Examine the superclass documentation for your generic solutions the library implements for all its design (like downloading or preserving, resizing the enter embeddings, pruning heads This dedicate doesn't belong to any department on this repository, and may belong to some fork beyond the repository. arX