Parsing mmCIF files using Biopython
The mmCIF file format is probably the future. Biopython doesn’t have full functionality to work with it yet, but we will take a look at what currently exists.
Getting ready
As Bio.PDB is not able to automatically download mmCIF files, you need to get your protein file and rename it to 1tup.cif. This can be found at https://github.com/PacktPublishing/Bioinformatics-with-Python-Cookbook-third-Edition/blob/master/Datasets.py under 1TUP.cif.
You can find this content in the Chapter08/mmCIF.py Notebook file.
How to do it...
Take a look at the following steps:
- Let’s parse the file. We just use the MMCIF parser instead of the PDB parser:
from Bio import PDB parser = PDB.MMCIFParser() p53_1tup = parser.get_structure('P53', '1tup.cif') - Let’s inspect the following chains:
def describe_model(name, pdb): print() for model in p53_1tup: ...