Questions
- Why is it important to convert datasets into a specific format for Detectron2?
 - It is hard to directly perform a regression of the number of people in an image. What is the key insight that allowed the VGG architecture to perform crowd counting?
 - Explain self-supervision in the case of image-colorization.
 - How did we convert a 3D point cloud into an image that is compatible with YOLO?
 - What is a simple way to handle videos using architectures that work only with images?
 
Learn more on Discord
Join our community’s Discord space for discussions with the authors and other readers:
