Reader small image

You're reading from  Data Science for Web3

Product typeBook
Published inDec 2023
PublisherPackt
ISBN-139781837637546
Edition1st Edition
Concepts
Right arrow
Author (1)
Gabriela Castillo Areco
Gabriela Castillo Areco
author image
Gabriela Castillo Areco

Gabriela Castillo Areco holds an M.Sc. in big data science from the TECNUM School of Engineering, University of Navarra. With extensive experience in both the business and data facets of blockchain technology, Gabriela has undertaken roles as a data scientist, machine learning analyst, and blockchain consultant in both large corporations and small ventures. She served as a professor of new crypto businesses at Torcuato di Tella University and is currently a member of the BizOps data team at IOV Labs.
Read more about Gabriela Castillo Areco

Right arrow

Technical requirements

In this chapter, we’ll utilize tools from the libraries that were introduced in Chapter 7 – that is, scikit-learn and Keras. Additionally, we will employ NLTK, a Python library that proves valuable for working with human language data. NLTK includes a range of modules and functions that empower us to execute tasks such as tokenization, stemming, and part-of-speech tagging on our selected databases. This library streamlines the process of processing extensive text datasets so that they’re ready to be integrated with machine learning or deep learning models.

If you have not worked with NLTK before, it can be installed with the following code:

pip install nltk

The documentation for nltk can be found at https://www.nltk.org. Another essential library when handling text manipulation and cleaning is re, short for Regular Expression. A regular expression is a sequence of characters that defines a search pattern. Here’s an example...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Data Science for Web3
Published in: Dec 2023Publisher: PacktISBN-13: 9781837637546

Author (1)

author image
Gabriela Castillo Areco

Gabriela Castillo Areco holds an M.Sc. in big data science from the TECNUM School of Engineering, University of Navarra. With extensive experience in both the business and data facets of blockchain technology, Gabriela has undertaken roles as a data scientist, machine learning analyst, and blockchain consultant in both large corporations and small ventures. She served as a professor of new crypto businesses at Torcuato di Tella University and is currently a member of the BizOps data team at IOV Labs.
Read more about Gabriela Castillo Areco