ShareGPT4V Model Card

Model details

Model type: This is the vision tower of ShareGPT4V-13B fine-tuned with our ShareGPT4V dataset.

Model date: This vision tower was trained in Nov 2023.

Paper or resources for more information: [Project] [Paper] [Code]

License

Intended use

Primary intended uses: The primary use of this vision tower is research on large multimodal models and chatbots.

Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.

Training dataset

1.2M high-quality image-text pairs

Downloads last month: 74

Paper for Lin-Chen/ShareGPT4V-13B_Pretrained_vit-large336-l12

ShareGPT4V: Improving Large Multi-Modal Models with Better Captions

Paper • 2311.12793 • Published Nov 21, 2023 • 18