Reader small image

You're reading from  Natural Language Understanding with Python

Product typeBook
Published inJun 2023
PublisherPackt
ISBN-139781804613429
Edition1st Edition
Right arrow
Author (1)
Deborah A. Dahl
Deborah A. Dahl
author image
Deborah A. Dahl

Deborah A. Dahl is the principal at Conversational Technologies, with over 30 years of experience in natural language understanding technology. She has developed numerous natural language processing systems for research, commercial, and government applications, including a system for NASA, and speech and natural language components on Android. She has taught over 20 workshops on natural language processing, consulted on many natural language processing applications for her customers, and written over 75 technical papers. Th is is Deborah's fourth book on natural language understanding topics. Deborah has a PhD in linguistics from the University of Minnesota and postdoctoral studies in cognitive science from the University of Pennsylvania.
Read more about Deborah A. Dahl

Right arrow

What is unsupervised learning?

The applications that we worked with in earlier chapters were based on data that was manually categorized by human annotators. For example, each review in the movie review corpus that we have used several times was read by a human annotator and assigned a category, positive or negative, based on the human’s opinion. The review-category pairs were then used to train models, using the machine learning algorithms that we previously learned about to categorize new reviews. This whole process is called supervised learning because the training process is, in effect, supervised by the training data. The training data labeled by humans is referred to as the gold standard or ground truth.

Supervised approaches have some disadvantages, however. The most obvious disadvantage is the cost of developing the ground-truth data because of the cost of human annotators. Another consideration is the possibility that the manual annotations from different annotators...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Natural Language Understanding with Python
Published in: Jun 2023Publisher: PacktISBN-13: 9781804613429

Author (1)

author image
Deborah A. Dahl

Deborah A. Dahl is the principal at Conversational Technologies, with over 30 years of experience in natural language understanding technology. She has developed numerous natural language processing systems for research, commercial, and government applications, including a system for NASA, and speech and natural language components on Android. She has taught over 20 workshops on natural language processing, consulted on many natural language processing applications for her customers, and written over 75 technical papers. Th is is Deborah's fourth book on natural language understanding topics. Deborah has a PhD in linguistics from the University of Minnesota and postdoctoral studies in cognitive science from the University of Pennsylvania.
Read more about Deborah A. Dahl