Computer Vision: Python OCR and Object Detection Quick Starter [Video]

By Abhilash Nelson
    What do you get with a Packt Subscription?

  • Instant access to this title and 7,500+ eBooks & Videos
  • Constantly updated with 100+ new titles each month
  • Breadth and depth in over 1,000+ technologies
  1. Free Chapter
    Course Introduction and Table of Contents
About this video

This course is a quick starter for anyone who wants to explore optical character recognition (OCR), image recognition, object detection, and object recognition using Python without having to deal with all the complexities and mathematics associated with a typical deep learning process.Starting with an introduction to the OCR technology, you'll get your system ready for Python coding by installing Anaconda packages and the necessary libraries and dependencies.

As you advance, you'll work with convolutional neural networks (CNNs), the Keras library, and pre-trained models such as VGGNet 16 and VGGNet 19, to perform image recognition with the help of sample images. The course then focuses on object recognition and shows you how to use MobileNet-SSD and Mask R-CNN pre-trained models to detect and label objects in a real-time live video from the computer's webcam as well as in a saved video. Toward the end, you'll learn how the YOLO model and the lite version, Tiny YOLO, fasten the process of detecting an object from a single image.

By the end of the course, you'll have developed a solid understanding of OCR and the methods involved and gain the confidence to perform optical character recognition using Python with ease.

All resources and code files for this course are placed here:

Publication date:
October 2020
4 hours 41 minutes

About the Author
  • Abhilash Nelson

    Abhilash Nelson is a pioneering, talented, and security-oriented Android/iOS mobile and PHP/Python web application developer with more than 8 years of IT experience involving designing, implementing, integrating, testing, and supporting impactful web and mobile applications. He has a master's degree in computer science and engineering and has PHP/Python programming experience, which is an added advantage for server-based Android and iOS client applications. Abhilash is currently a senior solution architect managing projects from start to finish to ensure high quality and innovative and functional design.

    Browse publications by this author
Computer Vision: Python OCR and Object Detection Quick Starter [Video]
Unlock this video and the full library FREE for 7 days
Start now