Instructions to use microsoft/git-base-textcaps with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/git-base-textcaps with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="microsoft/git-base-textcaps")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("microsoft/git-base-textcaps") model = AutoModelForImageTextToText.from_pretrained("microsoft/git-base-textcaps") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -3,8 +3,8 @@ language: en
|
|
| 3 |
license: mit
|
| 4 |
tags:
|
| 5 |
- vision
|
| 6 |
-
- image-to-text
|
| 7 |
model_name: microsoft/git-base-textcaps
|
|
|
|
| 8 |
---
|
| 9 |
|
| 10 |
# GIT (GenerativeImage2Text), base-sized, fine-tuned on TextCaps
|
|
@@ -63,4 +63,4 @@ During validation, one resizes the shorter edge of each image, after which cente
|
|
| 63 |
|
| 64 |
## Evaluation results
|
| 65 |
|
| 66 |
-
For evaluation results, we refer readers to the [paper](https://arxiv.org/abs/2205.14100).
|
|
|
|
| 3 |
license: mit
|
| 4 |
tags:
|
| 5 |
- vision
|
|
|
|
| 6 |
model_name: microsoft/git-base-textcaps
|
| 7 |
+
pipeline_tag: image-to-text
|
| 8 |
---
|
| 9 |
|
| 10 |
# GIT (GenerativeImage2Text), base-sized, fine-tuned on TextCaps
|
|
|
|
| 63 |
|
| 64 |
## Evaluation results
|
| 65 |
|
| 66 |
+
For evaluation results, we refer readers to the [paper](https://arxiv.org/abs/2205.14100).
|