-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can you provide more information on the features in the predfeatures numpy file? #26
Comments
Not sure if the authors are still replying but you can find the information that it saves in
Where |
Hi, @rmwu Thanks so much for such a quick answer and pointing towards data_iteration.py and model.py. "P1": P1, However, I still find it pretty difficult to get whatever is packed into P1 and P2. cheers, |
Thanks for your interest in our work! Just a quick word to say that we are still replying but:
I have lost track of the structure of our files, but an important thing to note is that our dMaSIF model produces trained ”neural” features that do not have a clear physical meaning. P1 and P2 just contain “one vector of trained features per surface point”, where our feature extractor has been optimized with respect to a certain task (see our three papers on this: Nature methods, CVPR and MLDD). The lack of interpretability is unavoidable here, even though we’re trying to mitigate this as much as possible with our architectures. I hope that @FreyrS will be able to confirm that |
Hi @jeanfeydy! Thanks for your input! Would be great if @FreyrS can come back with the column/features list at a later time. thank you guys, cheers! |
Hi @ngszyba, What @rmwu is absolutely correct. |
Dear dMaSIF team and users,
I am using the Google Colab version of dMaSIF to get the surface predictions from the model protein pdb files.
Among the outputs of dMaSIF I found predfeatures_emb1.npy file with 34 columns and the corresponding .npy file containing the coordinates.
If I understand correctly, this is an array of surface patches with biochemical features and coordinates of each patch.
Maybe I have overlooked it, but couldn't find anywhere hints on how to decipher the columns in the file.
Right now they are numbered from 0 to 33, but can you provide a list of columns/features, so people can check e.g. what feature is predicted in the column 10 and how this changes between different patches.
It would dramatically increase the usability of these predictions.
Thanks!
The text was updated successfully, but these errors were encountered: