Image Comprehension
3 papers with code • 0 benchmarks • 1 datasets
This task has no description! Would you like to contribute one?
Benchmarks
These leaderboards are used to track progress in Image Comprehension
No evaluation results yet. Help compare methods by
submitting
evaluation metrics.
Most implemented papers
Hierarchical Open-vocabulary Universal Image Segmentation
Open-vocabulary image segmentation aims to partition an image into semantic regions according to arbitrary text descriptions.
RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic and Regional Comprehension
To this end, we propose to extract features corresponding to regional objects as soft prompts for LLM, which provides a straightforward and scalable approach and eliminates the need for LLM fine-tuning.
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
We propose InternLM-XComposer, a vision-language large model that enables advanced image-text comprehension and composition.