Data Labeling in Machine Learning with Python: Explore modern ways to prepare labeled data for training and fine-tuning ML and generative AI models

Name: Data Labeling in Machine Learning with Python
Brand: Packt
SKU: 9781804610541
Availability: InStock

By Vijaya Kumar Suda

~~$39.99~~ $27.98

Book Jan 2024 398 pages 1st Edition

What do you get with eBook?

Instant access to your Digital eBook purchase

Download this book in EPUB and PDF formats

Access this title in our online reader with advanced features

DRM FREE - Read whenever, wherever and however you want

Buy Now

Product Details

Publication date : Jan 31, 2024

Length 398 pages

Edition : 1st Edition

Language : English

ISBN-13 : 9781804610541

Category :

Data

Concepts :

Machine Learning

View table of contents

Preview Book

Download Code

Key benefits

Generate labels for regression in scenarios with limited training data
Apply generative AI and large language models (LLMs) to explore and label text data
Leverage Python libraries for image, video, and audio data analysis and data labeling
Purchase of the print or Kindle book includes a free PDF eBook

Description

Data labeling is the invisible hand that guides the power of artificial intelligence and machine learning. In today’s data-driven world, mastering data labeling is not just an advantage, it’s a necessity. Data Labeling in Machine Learning with Python empowers you to unearth value from raw data, create intelligent systems, and influence the course of technological evolution. With this book, you'll discover the art of employing summary statistics, weak supervision, programmatic rules, and heuristics to assign labels to unlabeled training data programmatically. As you progress, you'll be able to enhance your datasets by mastering the intricacies of semi-supervised learning and data augmentation. Venturing further into the data landscape, you'll immerse yourself in the annotation of image, video, and audio data, harnessing the power of Python libraries such as seaborn, matplotlib, cv2, librosa, openai, and langchain. With hands-on guidance and practical examples, you'll gain proficiency in annotating diverse data types effectively. By the end of this book, you’ll have the practical expertise to programmatically label diverse data types and enhance datasets, unlocking the full potential of your data.

What you will learn

Excel in exploratory data analysis (EDA) for tabular, text, audio, video, and image data Understand how to use Python libraries to apply rules to label raw data Discover data augmentation techniques for adding classification labels Leverage K-means clustering to classify unsupervised data Explore how hybrid supervised learning is applied to add labels for classification Master text data classification with generative AI Detect objects and classify images with OpenCV and YOLO Uncover a range of techniques and resources for data annotation