Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/17516
| Title: | VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-Trained Foundation Models |
| Authors: | Hegde, Suhas Kaur, Shilpy Tiwari, Aruna |
| Issue Date: | 2025 |
| Publisher: | IOS Press BV |
| Citation: | Hegde, S., Kaur, S., & Tiwari, A. (2025). VectorFit: Adaptive Singular & Bias Vector Fine-Tuning of Pre-Trained Foundation Models. In I. Lynce, N. Murano, M. Vallati, S. Villata, F. Chesani, M. Milano, A. Omicini, & M. Dastani (Eds.), Front. Artif. Intell. Appl. (Vol. 413, pp. 4522–4529). IOS Press BV Scopus. https://doi.org/10.3233/FAIA251353 |
| Abstract: | Popular PEFT methods reduce trainable parameter count for fine-tuning by parameterizing new low-rank or sparse trainable weights in parallel to the frozen pre-trained weights W. However, these weights are trained from scratch, and there exists a performance gap between these methods and full fine-tuning, especially in low-budget settings. We introduce VectorFit, a new way of parameterization that efficiently utilizes the existing knowledge embedded in W by adaptively training their singular vectors and biases. We show that utilizing the structural and transformational properties of W in this way can lead to high-rank incremental weight matrices ?W, comparable to that of full fine-tuning. VectorFit delivers superior results with 9× fewer trainable parameters than the leading PEFT methods. Through comprehensive experiments across 19 datasets covering a wide range of language and vision tasks such as natural language understanding and generation, question answering, image classification, and image generation, we demonstrate that VectorFit surpasses baselines in terms of performance as a function of parameter-efficiency. © 2025 The Authors. |
| URI: | https://dx.doi.org/10.3233/FAIA251353 https://dspace.iiti.ac.in:8080/jspui/handle/123456789/17516 |
| ISBN: | 978-1614993605 9781643685830 9781586038311 9781614994183 9781614999409 9781607507987 1586035770 9781643685694 9781643685427 9781607500490 |
| ISSN: | 09226389 1879-8314 |
| Type of Material: | Conference Paper |
| Appears in Collections: | Department of Computer Science and Engineering |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: