You're reading from Hands-On Markov Models with Python

Product typeBook

Published inSep 2018

Reading LevelIntermediate

PublisherPackt

ISBN-139781788625449

Edition1st Edition

Languages

Python

Concepts

Statistics

Authors (2):

Ankur Ankan

Abinash Panda

View More author details

Natural Language Processing

Automatic speech recognition has a lot of potential applications, such as audio transcription, dictation, audio search, and virtual assistants. I am sure that everyone has interacted with at least one of the virtual assistants by now, be it Apple's Siri, Amazon's Alexa, or Google's Assistant. At the core of all these speech recognition systems are a set of statistical models over the different words or sounds in a language. And since speech has a temporal structure, HMMs are the most natural framework to model it.

HMMs are virtually at the core of all speech recognition systems and the core concepts in modeling haven't changed much in a long time. But over time, a lot of sophisticated techniques have been developed to build better systems. In the following sections, we will try to cover the main concepts leading to the development...

Part-of-speech tagging

The first problem that we will look into is known as part-of-speech tagging (POS tagging). According to Wikipedia, POS tagging, also known as grammatical tagging or word-category disambiguation, is the process of marking up a word in a text as corresponding to a particular part of speech based on both its definition and its context, that is, its relationship with adjacent and related words in a phrase, sentence, or paragraph. A simpler version of this, which is usually taught in schools, is classifying words as noun, verbs, adjectives, and so on.

POS tagging is not as easy as it sounds because the same word can take different parts of speech in different contexts. A simple example of this is the word dogs. The word dogs is usually considered a noun, but in the following sentence, it acts like a verb:

The sailor dogs the hatch.

Correct grammatical tagging...

Speech recognition

In the 1950s, Bell Labs was the pioneer in speech recognition. The early designed systems were limited to a single speaker and had a very limited vocabulary. After around 70 years of work, the current speech-recognition systems are able to work with speech from multiple speakers and can recognize thousands of words in multiple languages. A detailed discussion of all the techniques used is beyond the scope of this book as enough work has been done on each technique to have a book on itself.

But the general workflow for a speech-recognition system is to first capture the audio by converting the physical sound into an electrical signal using a microphone. The electrical signal generated by the microphone is analog and needs to be converted to a digital form for storage and processing, for which analog-to-digital converters are used. Once we have the speech in digital...

Summary

In this chapter, we looked into two of the major applications of HMMs: POS tagging and speech recognition. We coded the POS tagger using a most-frequent tag algorithm and used the pomegranate package to build one based on HMM. We compared the performance using both these methods and saw that an HMM-based approach outperforms the most-frequent tag method. Then, we used the SpeechRecognition package to transcribe audio to text using Google's Web Speech API. We looked into using the package with both audio files and live audio from a microphone.

In the next chapter, we will explore more applications of HMMs, specifically in the field of image recognition.

The rest of the chapter is locked

You have been reading a chapter from

Hands-On Markov Models with Python

Published in: Sep 2018Publisher: PacktISBN-13: 9781788625449

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Ankur Ankan

Ankur Ankan is a BTech graduate from IIT (BHU), Varanasi. He is currently working in the field of data science. He is an open source enthusiast and his major work includes starting pgmpy with four other members. In his free time, he likes to participate in Kaggle competitions.
Read more about Ankur Ankan

Abinash Panda

Abinash Panda has been a data scientist for more than 4 years. He has worked at multiple early-stage start-ups and helped them build their data analytics pipelines. He loves to munge, plot, and analyze data. He has been a speaker at Python conferences. These days, he is busy co-founding a start-up. He has contributed to books on probabilistic graphical models by Packt Publishing.
Read more about Abinash Panda

Other recommended products

Related to this chapter

Statistics Crash Course for Beginners

Through both theoretical and practical study with Python, this course will get you up to speed with all you need to know about statistics in programming—a core study of machine learning.

BookMar 2021329 pages

Hands-On Reinforcement Learning with R

Reinforcement Learning is an exciting part of machine learning. It has uses in technology from autonomous cars to game playing, and creates algorithms that can adapt to environmental changes. This book helps to understand how to implement RL with R, and explores interesting practical examples, such as using tabular Q-learning to control robots.

BookDec 2019362 pages

Mastering Machine Learning Algorithms

This book is your guide to quickly get to grips with the most widely used machine learning algorithms. As a data science professional, this book will help you design and train better machine learning models to solve a variety of complex problems, and make the machine learn your requirements.

BookMay 2018576 pages

Bayesian Analysis with Python

Bayesian inference uses probability distributions and Bayes' theorem to build flexible models. The book uses PyMC3 to abstract all the mathematical and computational details from this process allowing readers to solve a wide range of problems in data science.

BookDec 2018356 pages4

Scala for Machine Learning

Scala is becoming the language of choice for software engineers and data scientists who analyze large data sets. This trend has been reinforced by the wide acceptance of Scala based frameworks such as Apache Spark and Kafka. As a functional language, Scala is particularly suited for extracting knowledge through supervised, unsupervised or reinforcement learning techniques. Being object-oriented, Scala ensured the construction of robust and maintainable software solutions. This book introduces the most common used machine learning models as implemented in Scala

BookSep 2017740 pages

Mastering Java Machine Learning

Master key Java machine learning libraries and their applications with the help of real-world case studies. Explore advanced machine learning techniques such as anomaly detection, stream learning, active learning, semi-supervised learning, probabilistic graph modeling, text mining, deep learning, and big data batch and stream machine learning.

BookJul 2017556 pages

Mastering Machine Learning Algorithms

A new second edition of the bestselling guide to exploring and mastering the most important algorithms for solving complex machine learning problems, updated to include Python 3.8 and TensorFlow 2.x as well as the latest in new algorithms and techniques.

BookJan 2020798 pages

Mastering Reinforcement Learning with Python

This book focuses on expert-level explanations and implementations of scalable reinforcement learning algorithms and approaches. Starting with the fundamentals, the book covers state-of-the-art methods from bandit problems to meta-reinforcement learning. You’ll also explore practical examples inspired by real-life problems from the industry.

BookDec 2020544 pages

Mastering Predictive Analytics with R

R offers a free and open source environment that is perfect for both learning and deploying predictive modeling solutions in the real world. With its constantly growing community and plethora of packages, R offers the functionality to deal with a truly vast array of problems. Updated with revamped examples and to the latest version of R, this book is designed to be both a guide and a reference for moving beyond the basics of predictive modeling.

BookAug 2017448 pages

Python Deep Learning

The book will help you learn deep neural networks and their applications in computer vision, generative models, and natural language processing. It will also introduce you to the area of reinforcement learning, where you’ll learn the state-of-the-art algorithms to teach the machines how to play games like Go and Atari.

BookJan 2019386 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages