Join our book community on Discord
https://packt.link/EarlyAccessCommunity
In this chapter, we will look at some of the basic aspects of data manipulation in Python. pandas is the de facto standard for processing tabled data. It is used extensively in bioinformatics and so it is a good core library to become comfortable with.We will start by looking at the basics of pandas as it provides a high-level library with very broad practical applicability. We will then discuss how to join tables in pandas and perform database-like operations. Finally, we’ll briefly touch on some strategies to reduce memory usage in pandas.In this chapter, we will cover the following recipes:
- Using pandas to process vaccine-adverse events
- Dealing with the pitfalls of joining pandas DataFrames
- Reducing the memory usage of pandas DataFrames