Lowrank Training

Low-rank adaptation (LoRA) was originally proposed for fine-tuning by reparameterizing weight with its factorization of two lower-dimensional matrices A and B. In pre-training, low-rank adaptation does not achieve performance similar to full parameter training. Recent works have extended LoRA to improve its performance. LoRA-The-Explorer (LTE) suggested tranining multiple LoRA heads in parallel. GaLore projects gradients into lower-dimensional matrices and updates weights in a smaller subspace. In this project, we have applied LTE and GaLore methods in foundation models for language and vision tasks to validate their effectiveness. Our key results are as follows-

Using multiple heads with a small rank can perform same as or slightly better than full-parameter training of GPT model.
Fine-tuning vision transformers (ViTs) with LoRA acheives better accuracy with less number of epochs compared to full parameter fine-tuning.
Applying LoRA on attention layers is most effective and a good balance between model performance and number of trainable parameters.

LTE tasks

Sources:

Initial setup (also see source repo):

run python3 data/cnn_dailymail/prepare.py
python3 train_gpt_lte.py

GaLore tasks

Sources:

GaLore

LoRA Vision

Sources:

LoRA-ViT

Dataset:

MedMNIST

Source code:

meloravit.py: LoRA-ViT + MeLo + GaLore optimizer
ltegalorevit.py: Combined version of lte and galore that we got mostly working.

Contributors

References

LoRA: Low-rank adaptation of large language models. E. J. Hu, Y. Shen, P. Wallis, Z. Allen-Zhu, Y. Li, W. Chen, and T.-Y. Liu. arXiv preprint arXiv:2106.09685, 2021.
Training neural networks from scratch with parallel low-rank adapters. M. Huh, B. Cheung, J. Bernstein, P. Isola, and P. Agrawal. arXiv preprint arXiv:2402.16828, 2024.
Galore: Memory-efficient LLM training by gradient low-rank projection. J. Zhao, Z. Zhang, B. Chen, Z. Wang, A. Anandkumar, and Y. Tian. arXiv preprint arXiv:2403.03507, 2024.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
lora-vision		lora-vision
nanoGPT-LTE		nanoGPT-LTE
scripts		scripts
.gitignore		.gitignore
Poster.pdf		Poster.pdf
Readme.md		Readme.md
Report.pdf		Report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lowrank Training

LTE tasks

GaLore tasks

LoRA Vision

Contributors

References

About

Contributors 3

Languages

sadmankiba/Lowrank-Training

Folders and files

Latest commit

History

Repository files navigation

Lowrank Training

LTE tasks

GaLore tasks

LoRA Vision

Contributors

References

About

Resources

Stars

Watchers

Forks

Contributors 3

Languages