paligemma source for verification 2026-05-22
This commit is contained in:
18
README.md
Normal file
18
README.md
Normal file
@@ -0,0 +1,18 @@
|
|||||||
|
# PaliGemma - Google Vision-Language Model
|
||||||
|
|
||||||
|
Source: https://huggingface.co/google/paligemma2-3b-ft-docci-448
|
||||||
|
License: gemma
|
||||||
|
Type: Vision-Language Model (VLM)
|
||||||
|
|
||||||
|
## Installation
|
||||||
|
pip install transformers accelerate pillow
|
||||||
|
|
||||||
|
## Usage
|
||||||
|
```python
|
||||||
|
from transformers import PaliGemmaProcessor, PaliGemmaForConditionalGeneration
|
||||||
|
model = PaliGemmaForConditionalGeneration.from_pretrained("google/paligemma2-3b-ft-docci-448")
|
||||||
|
```
|
||||||
|
|
||||||
|
## Model Weights
|
||||||
|
Downloaded automatically via HuggingFace Hub on first use.
|
||||||
|
Cached at: ~/.cache/huggingface/hub/models--google--paligemma2-3b-ft-docci-448/
|
||||||
Reference in New Issue
Block a user