MATCHING AND SPLITTING STRINGS IN PANDAS
Listing B.11 shows the content of shirts_str.py, which illustrates how to match a column value with an initial string and how to split a column value based on a letter.
Listing B.11: shirts_str.py
import pandas as pd
shirts = pd.read_csv("shirts2.csv")
print("shirts:")
print(shirts)
print()
print("shirts starting with xl:")
print(shirts[shirts.ssize.str.startswith('xl')])
print()
print("Exclude 'xlarge' shirts:")
print(shirts[shirts['ssize'] != 'xlarge'])
print()
print("first three letters:")
shirts['sub1'] = shirts['ssize'].str[:3]
print(shirts)
print()
print("split ssize on letter 'a':")
shirts['sub2'] = shirts['ssize'].str.split('a')
print(shirts)
print()
print("Rows 3 through 5 and column 2:")
print(shirts.iloc[2:5, 2])
print()
Listing B.11 initializes the data frame...