Questions
- A zero-shot method trains the parameters once. (True/False)
- Gradient updates are performed when running zero-shot models. (True/False)
- GPT models only have a decoder stack. (True/False)
- It is impossible to train a 117M GPT model on a local machine. (True/False)
- It is impossible to train the GPT-2 model with a specific dataset. (True/False)
- A GPT-2 model cannot be conditioned to generate text. (True/False)
- A GPT-2 model can analyze the context of an input and produce completion content. (True/False)
- We cannot interact with a 345M-parameter GPT model on a machine with less than 8 GPUs. (True/False)
- Supercomputers with 285,000 CPUs do not exist. (True/False)
- Supercomputers with thousands of GPUs are game-changers in AI. (True/False)