Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Mastering Clojure Data Analysis

You're reading from  Mastering Clojure Data Analysis

Product type Book
Published in May 2014
Publisher
ISBN-13 9781783284139
Pages 340 pages
Edition 1st Edition
Languages
Author (1):
Eric Richard Rochester Eric Richard Rochester
Profile icon Eric Richard Rochester

Table of Contents (17) Chapters

Mastering Clojure Data Analysis
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
1. Network Analysis – The Six Degrees of Kevin Bacon 2. GIS Analysis – Mapping Climate Change 3. Topic Modeling – Changing Concerns in the State of the Union Addresses 4. Classifying UFO Sightings 5. Benford's Law – Detecting Natural Progressions of Numbers 6. Sentiment Analysis – Categorizing Hotel Reviews 7. Null Hypothesis Tests – Analyzing Crime Data 8. A/B Testing – Statistical Experiments for the Web 9. Analyzing Social Data Participation 10. Modeling Stock Data Index

Chapter 3. Topic Modeling – Changing Concerns in the State of the Union Addresses

A huge source of data right now is the volumes of unstructured, natural-language data that's everywhere on the Internet. Think of all the news articles, blog posts, Twitter posts, and YouTube comments as well as the thousands of other ways that people can create and share textual content online. What they're saying may be important to you, and being able to track what subjects they are talking about is incredibly useful to become aware of the trends and conversations.

A tool to explore the information a group of text documents discusses is called topic modeling. This is a technique to identify the "topics" discussed in a collection of documents, although as we'll see, "topics" is defined a little differently here than it is in informal conversation. The strength of these models is that they don't assume that each document talks only about one thing. Instead, they model documents as collections of topics. This...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}