Loading…
Saved in:
Publication Year: | 2024 |
---|---|
Subject Terms: |
Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and L
|
Description: |
We introduce FlexCap, a vision-language model that generates region-specific descriptions of varying lengths. FlexCap is trained to produce
|
Database: | arXiv |