PDBnet is a collection of Python objects intended to model and contain PDB protein data.
PDBnet Copyright (C) 2012 Christian Blouin Contributions by Alex Safatli and Jose Sergio Hleap Major Refactoring (2014) done by Alex Safatli and Jose Sergio Hleap
E-mail: cblouin@cs.dal.ca, safatli@cs.dal.ca, jshleap@dal.ca Dependencies: Scipy, BioPython, FASTAnet (contained in LabBlouinTools)
Bases: object
ATOM in a PDB protein structure.
Bases: object
A PDB chain (collection of protein residues).
Add an index for a residue to this chain. Will generate own residue information if called upon (forces lazy evaluation).
Add a residue to this chain by its index in the PDB. The residue object will automatically be constructed from the file.
Alias for acquiring a residue from this instance using [] operator.
Acquire all of the residues in this chain. Note that if this is a large file, this will force the object to load all of these residues into memory from the file.
Iteratively yield all of the residues in this chain. Note that if this is a large file, this will force the object to load all of these residues into memory from the file.
Bases: object
Return whether or on the PDB file this object represents is incredibly large.
Yield the model number, chain name, and residue number for all residues present in this PDB file, not necessarily in order.
Acquire all remarks, and the indices of all models and residues. Returns remarks, and biological source information as a tuple (remarks, organism, taxid, mutant).
Parse a residue from the PDB file and return a PDBresidue.
Bases: labblouin.PDBnet.PDBchain
A PDB model (a special kind of chain).
Acquire all of the residues in this model. Note that if this is a large file, this will force the object to load all of these residues into memory from the file.
Iteratively yield all of the residues in this model. Note that if this is a large file, this will force the object to load all of these residues into memory from the file.
A residue (collection of ATOM fields) in a PDB protein structure.
Bases: object
A PDB protein structure (a collection of chains/models).
Add a residue to a chain. Deprecated; use chain class function.
Add a residue to a model. Deprecated; use model class function.
For every chain, check to see if every residue is complete (see aa_list dictionary).
Compute the contact map of all chains or a chain.
Parameters: |
|
---|---|
Returns: | A list of tuples of indices (integers) which correspond to chains or models and their residues. |
Compute the form difference matrix (FDM) as explained in Claude 2008. It relates to the identification of the most influential residue, with respect to the overall shape/structure. If the scaled option is True, will return an scaled list (from -1 to 1) of the of lenght equal to the number of residues. Otherwise will return the raw FDM, rounded so it can be included in a PDB. The scaled version is better for vizualization. By default the FDM is computed far all chains, but a subset can be passed to the chains option.
Acquire a new chain or model corresponding to an average of all present chains or models specified.
Given a PDBchain, find 1-to-1 correspondances between it and a FASTA sequence.
Store in residues the correct index to the fasta. Requires a 1-to-1 correspondance at least a portion of the way through. Deprecated; use GetFASTAIndices().
Produce an iterator to allow one to iterate over all residues for a subset of the structure.
Map a list of values (lis), that must have a lenght equal to that of the number of residues in the PDB to be mapped (chain). If a list of list is provided, the first list will be mapped as the beta factor and the second as occupancy
Acquire the radius of the gyration of the entire, or a portion of, the PDB protein molecule.
Write the information present in this PDB between multiple chains as a Geometric Morphometric text file. This file will be formatted such that individual lines correspond to chains and semi-colons separate the (x,y,z) coordinates between all homologous residue positions. Requires a FASTA alignment. Options include using alpha-carbon positions. By default, uses centroids of residues.
Write the information present in this PDB between multiple chains as a landmark text file. This file will be formatted such that the file is partitioned in sections starting with chain names and individual lines in these correspond to homologous residue positions denoted by homologous position, residue number, and residue name tab-delimited. Requires a FASTA file.
Get the GDT score between two chains. Requires a FASTA alignment.
Acquire Geometric Morphometric data corresponding to the (x,y,z) coordinates between all homologous residue positions. Requires a FASTA alignment. Options include using alpha-carbon positions. By default, uses centroids of residues. Returns a list of labels and a list of coordinates as raw GM data. The typeof option provides an option for coordinate output; they are returned as a semicolon-delimited string (str) or as a numpy 2d array (matrix).
Get the RRMSD between chains. Requires a FASTA alignment. See Betancourt & Skolnick, “Universal Similarity Measure for Comparison Protein Structures”.
Get the TMscore between two chains. Requires a FASTA alignment and a value for the length of the native structure (e.g., for a pairwise alignment, the length of the structure used as a reference before alignment was done). The latter is computed by assuming the first of both provided chains is the native structure; otherwise, uses a provided chain name (native input).