We've explored in this chapter how to use spaCy as part of our pipelines, and in particular how to extract POS-tags. We discussed what POS-tags are, and how they can be useful in different kinds of analysis. We soon moved on to training your own POS-tagger in spaCy and looked at different examples where we use POS-tags. We will now explore other spaCy functionalities such as NER-tagging and dependency parsing.
Chapter 1, What is Text Analysis, and Chapter 2, Python Tips for Text Analysis, introduced text analysis and Python, and Chapter 3, SpaCy's Language Models, and Chapter 4, Gensim - Vectorizing Text and Transformations and n-grams, helped us set-up our code for more advanced text analysis. This chapter will discuss the first of such advanced techniques – part of speech tagging, popularly called POS-tagging. We will study what parts of speech exist, how to identify them in our documents, and what possible uses these POS-tags have.
- What is POS-tagging?
- spaCy for POS-tagging
- Training your POS-tagger
- POS-tagging examples