Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Try with RWKV models / MAMBA #60

Open
thiswillbeyourgithub opened this issue Jan 2, 2025 · 0 comments
Open

Try with RWKV models / MAMBA #60

thiswillbeyourgithub opened this issue Jan 2, 2025 · 0 comments

Comments

@thiswillbeyourgithub
Copy link

Hi,

As the rwkv models and mamba architectures are decently well known now, and huggingface compatible I was thinking that maybe there were some low hanging fruits regarding steerability of those models via repeng.

Has anyone tried or is there a reason this is not possible? The inner details of those architecture are somewhat beyond me but the idea of injecting 1D activations is somewhat universal still

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant