Skip to content

thisisiron/QFormer_Pretraining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Pre-training Q-Former

This repository contains code for pre-training Q-Former using the transformers library. The code supports training converting pre-trained LAVIS BLIP-2 models to the PyTorch transformers format.

Features

  • Pre-train Q-Former from scratch using transformers library
  • Convert LAVIS BLIP-2 Q-Former weights to transformers format

Usage

From LAVIS BLIP-2

To run the script for pre-training Q-Former from lavis, use the following command:

Citation

@article{blip2,
    title={BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models},
    author={Junnan Li and Dongxu Li and Silvio Savarese and Steven Hoi},
    journal={arXiv:2301.12597},
    year={2023}
}

Releases

No releases published

Packages

No packages published

Languages