Description
Describe the bug
The config parameter keep_hets
is currently not working. It seems keep_hets
was recently updated from bool type to a list of strings, where it contains the specific residue name of a HETATM residue such as keep_hets=["HOH"]
. However, after updating the parameter, it doesn't include the specified residues in the graph. The tutorial installed the newest version of Graphein-1.7.6 and I haven't had a chance to back test the other versions to see when the keep_het
functionality broke but will updated this ticket when I have a chance.
To Reproduce
This can be seen in the tutorial example of 3EIY
, which contains 112 waters. However, when we run:
from graphein.protein.graphs import construct_graph
config = ProteinGraphConfig(keep_hets=["HOH"])
g = construct_graph(config=config, pdb_code="3eiy")
None of the waters are included in the graph. If we print the nodes with g.nodes()
and look that the last residues we see that no waters were included:
['A:ALA:171', 'A:ASN:172', 'A:PHE:173', 'A:LYS:174', 'A:LYS:175']
Expected behavior
If I understand correctly, the expected behavior of keep_hets
would be for the waters to now be included in the graph representation.
Screenshots
Here is a screen shot of the representation of 3EIY
, where we can see only the protein residues included.
Desktop (please complete the following information):
This reproduced using the google Collab notebook with graphein-1.7.6 installed.
No other modification where made to the tutorial.