Answer Questions About Images
VQA models.
Methods
Bottom-up attention. Show-ask-tell. Transformer-based.
Datasets
VQA. Visual Genome. CLEVR.
Applications
Accessibility. Education. Image search.
Key Takeaways
- Combine image and text
- Visual reasoning
- Accessible applications