You're reading from Network Science with Python and NetworkX Quick Start Guide

Product typeBook

Published inApr 2019

Reading LevelIntermediate

PublisherPackt

ISBN-139781789955316

Edition1st Edition

Languages

Python

Tools

NetworkX

Concepts

Data Science

Author (1)

Edward L. Platt

From Data to Networks

To analyze a system using NetworkX, that system must first be modeled as a network, and then be represented as an object within NetworkX. This chapter explains the basic process of creating network representations of data. The first section covers the part of the process that takes place in your head: modeling data as a network. The remaining sections demonstrate the part of the process that happens in code: creating a NetworkX Graph from data, using two different methods. In the first method, data is reformatted into one of the standard network formats supported by NetworkX. In the second method, for more complex data, a network is created from scratch, by using code to add nodes and edges one at a time.

In this chapter, we will cover the following topics:

Modeling data: Giving meaning to nodes and edges
Network files: Saving your networks to files
Networks...

Modeling your data

When representing data as a network, there are many decisions to make along the way. Different types of networks are helpful for understanding different types of data and for asking different types of questions. This section will take a closer look at some of the important considerations.

When creating a network from data, one of the most important questions to consider is what exactly the nodes and edges should represent. Often there are many possibilities, even for the same dataset. Any particular choice focuses on some aspects of the data, possibly discarding others. Networks are fundamentally about relationships and connections, so one way to define nodes and edges is to think about what types of relationships you're interested in. Some possibilities include the following:

Social relationships, such as friendships, romantic relationships, or even rivalries...

Reading and writing network files

NetworkX provides support for reading and writing many network file formats. Of course, if a network has been provided in one of these formats, it will be very easy to load into NetworkX! But, even if you have data in another format, it is often possible to convert it to one of the supported formats without too much difficulty (I would guess that 90% of network science work is converting data between formats most of the rest is complaining about converting data). Spreadsheets, for instance, can often be converted to an appropriate format just by reordering columns and exporting as tab-separated values (TSV format). This section will describe several common formats, including adjacency list, edge list, GEXF, and JSON.

The edge list format is a simple but useful plain-text format. It supports edge attributes, but not node attributes. Edge lists...

Creating a network with code

So far, you've got some handy network formats in your toolbox. But, if your data is too complex or too messy to easily convert into one of the previous formats, you might have to build your network from scratch, adding edges and nodes one at a time. Luckily, the techniques you learned in Chapter 2, Working with Networks in NetworkX, are all you really need! This section walks through a practical example of building a network programmatically from a real data set.

The example in this section is a word co-occurrence network. These networks are used to understand the relationship between words in a particular set of documents. In a co-occurrence network, nodes represent words and edge weights represent how many documents they appear in together. Here, "document could mean any collection of words: blog post, paragraph, sentence, carefully arranged...

Summary

This chapter has demonstrated the process of getting data into NetworkX for analysis. This chapter discussed the types of questions that are important to consider when creating a network from data, and applied them to the example of Wikipedia. This chapter also gave examples of loading networks from standard formats and building networks from scratch. The next chapter introduces affiliation networks—those with two types of nodes.

References

The following is a list of resources that you can consider to get further knowledge:

Shelley, Mary Wollstonecraft. (1818). Frankenstein; or The Modern Prometheus. Urbana, Illinois: Project Gutenberg. Retrieved February 21, 2016, from www.gutenberg.org/ebooks/19033.

The rest of the chapter is locked

You have been reading a chapter from

Network Science with Python and NetworkX Quick Start Guide

Published in: Apr 2019Publisher: PacktISBN-13: 9781789955316

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Edward L. Platt

Edward L. Platt creates technology for communities and communities for technology. He is currently a researcher at the University of Michigan School of Information and the Center for the Study of Complex Systems. He has published research on large-scale collective action, social networks, and online communities. He was formerly a staff researcher at the MIT Center for Civic Media. He contributes to many free/open source software projects, including tools for media analysis, network science, and cooperative organizations. He has also done research on quantum computing and fault tolerance. He has an M.Math in Applied Mathematics from the University of Waterloo, as well as B.S degrees in both Computer Science and Physics from MIT.
Read more about Edward L. Platt

Other recommended products

Related to this chapter

Graph Machine Learning

Data scientists working with network data will be able to put their knowledge to work with this practical guide to building machine learning algorithms using graph data. The book provides a hands-on approach to implementation and associated methodologies that will have you up and running and productive in no time.

BookJun 2021338 pages

Hands-On Graph Analytics with Neo4j

To start with you will cover the basics of graph analytics, Cypher querying language, components of graph architecture, and more. You will implement Neo4j techniques to understand various graph analytics methods to reveal complex relationships in data. You will understand how machine learning can be used to perform smarter graph analytics.

BookAug 2020510 pages

Applying Math with Python

Python has a number of powerful packages to help anyone tackle complex mathematical problems in a simple and efficient way. This practical guide explains how to model real-world problems as mathematical objects in Python and how to perform computations, and interpret results. It explores Python lang to solve a variety of math and statistics problems.

BookJul 2020358 pages

Geospatial Data Science Quick Start Guide

This book will help you leverage the power of data analysis and apply it to location and geospatial data to gain interesting insights. It presents useful tools and location intelligence techniques in Python to implement geospatial analytics from scratch.

BookMay 2019170 pages

Practical Data Science Cookbook

As an increasing amount of data is generated each year, and the need to analyze and operationalize it is more important than ever. Companies that know what to do with their data have a competitive advantage over companies that don't. This drives a higher demand for knowledgeable and competent data professionals. By sequentially working through the steps presented in each chapter, you will quickly familiarize yourself with the data science process, and learn how to apply it to a variety of situations with examples using the two most popular programming languages for data analysis - R and Python.

BookJun 2017434 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages