Questions
- DALL-E 2 classifies images. (True/False)
- ViT classifies images. (True/False)
- BERT was initially designed to generate images. (True/False)
- CLIP is an image-clipping application. (True/False)
- BERT uses CLIP to identify images. (True/False)
- DALL-E 3 cannot be accessed with an API. (True/False)
- Gradio is a transformer model. (True/False)
- ViT can classify images that are not on its list of labels. (True/False)
- ViT requires a prompt to respond. (True/False)
- GPT-4V will most probably evolve into a more multimodal system. (True/False)