Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Natural Language Processing and Computational Linguistics

You're reading from  Natural Language Processing and Computational Linguistics

Product type Book
Published in Jun 2018
Publisher Packt
ISBN-13 9781788838535
Pages 306 pages
Edition 1st Edition
Languages
Author (1):
Bhargav Srinivasa-Desikan Bhargav Srinivasa-Desikan
Profile icon Bhargav Srinivasa-Desikan

Table of Contents (22) Chapters

Title Page
Copyright and Credits
Packt Upsell
Contributors
Preface
What is Text Analysis? Python Tips for Text Analysis spaCy's Language Models Gensim – Vectorizing Text and Transformations and n-grams POS-Tagging and Its Applications NER-Tagging and Its Applications Dependency Parsing Topic Models Advanced Topic Modeling Clustering and Classifying Text Similarity Queries and Summarization Word2Vec, Doc2Vec, and Gensim Deep Learning for Text Keras and spaCy for Deep Learning Sentiment Analysis and ChatBots Other Books You May Enjoy Index

Exploring documents

Once we have our topic model of choice set up, we can use it to analyze our corpus, and also get some more insight into the nature of our topic models. While it is certainly useful to know what kind of topics are present in our dataset, to go one step further we should be able to, for example, cluster or classify our documents based on what topics they are made out of.

In our Jupyter notebook example from Chapter 8, Topic Models, let's start looking at document-topic proportions. What exactly are these? When we were looking at topics in the previous chapter, we were observing topic-word proportions - what are the odds of certain words appearing in certain topics. We previously mentioned that we assumed that documents are generated from topics - by identifying document-topic proportions, we can see exactly how the topics generated the documents.

So, do...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}