Reader small image

You're reading from  Deep Learning with MXNet Cookbook

Product typeBook
Published inDec 2023
Reading LevelBeginner
PublisherPackt
ISBN-139781800569607
Edition1st Edition
Languages
Tools
Right arrow
Author (1)
Andrés P. Torres
Andrés P. Torres
author image
Andrés P. Torres

Andrés P. Torres, is the Head of Perception at Oxa, a global leader in industrial autonomous vehicles, leading the design and development of State-Of The-Art algorithms for autonomous driving. Before, Andrés had a stint as an advisor and Head of AI at an early-stage content generation startup, Maekersuite, where he developed several AI-based algorithms for mobile phones and the web. Prior to this, Andrés was a Software Development Manager at Amazon Prime Air, developing software to optimize operations for autonomous drones.
Read more about Andrés P. Torres

Right arrow

Understanding text datasets – loading, managing, and visualizing the Enron Email dataset

Another field that has grown considerably in DL in recent years is natural language processing (NLP). Similarly to CV, this field aims to surpass human performance in real-world datasets.

In this recipe, we will explore one of the simplest NLP tasks: text classification. Given a set of sentences and paragraphs, our task is to correctly classify that text among a given set of labels (classes).

One of the most classic text classification tasks is to distinguish whether received email is spam or not (ham). These datasets are binary text classification datasets (only two labels to assign, 0 and 1, or ham and spam).

In our specific scenario, we will use a real-world email dataset. This set of emails was made public during the investigation of the Enron scandal in the early 2000s by the US Government. This dataset was first published in 2004 and is composed of emails from ~150 users,...

lock icon
The rest of the page is locked
Previous PageNext Chapter
You have been reading a chapter from
Deep Learning with MXNet Cookbook
Published in: Dec 2023Publisher: PacktISBN-13: 9781800569607

Author (1)

author image
Andrés P. Torres

Andrés P. Torres, is the Head of Perception at Oxa, a global leader in industrial autonomous vehicles, leading the design and development of State-Of The-Art algorithms for autonomous driving. Before, Andrés had a stint as an advisor and Head of AI at an early-stage content generation startup, Maekersuite, where he developed several AI-based algorithms for mobile phones and the web. Prior to this, Andrés was a Software Development Manager at Amazon Prime Air, developing software to optimize operations for autonomous drones.
Read more about Andrés P. Torres