-
Notifications
You must be signed in to change notification settings - Fork 142
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can you supply the instructions about how to use real-world data to train model #3
Comments
|
I am also confused about how to apply the model to the real data. |
Please give instructions on how to apply the actual dataset in the code. It is very difficult to understand what the variables represent in the code for dummy data. |
I am trying to apply the code to the real datasets. In the first step, I tried to check if I have the same parameters (number of proteins, drugs,...) for the network. The number of proteins as what has mentioned in the paper should be 19085. But, from the protein-protein network(bio-decagon-ppi), I get 19081 proteins. Has anyone tried applying the code to the real dataset? and have you got the same number of proteins for the network? Thanks. |
I am also confused about how to apply the model to the real data. Has anyone solved the problem? Thanks. |
Same problem for me, not quite sure how to apply that. |
@vidarmehr I also get 19081 proteins from the protein-protein network(bio-decagon-ppi), and 1317 side effects, not the same as mentioned in paper (1318). Is it the same with your parameters (number of proteins, drugs,...) ? Thanks. |
Any updates? Same issue here. We want to reproduce the paper's results. |
@chao1224 I was not able to reproduce the results of paper and I decided to stop working on Decagon for now. |
|
@vidarmehr I got it. Thank you so much for your reply. |
Thanks for the reply @vidarmehr. Just want to quickly clarify a number:
|
Are there any updates on this issue? I was also unable to reproduce the results in the paper. They say that they only focus on predicting the 964 polypharmacy side effects that each occurred in at least 500 drug combinations. However, the data they provide is the full TWOSIDES dataset. I don't know if they filter out some side effects in the code, but I couldn't find any evidence of this. |
@rubjim I only can get 963 side effect types which appear in more than 500 drug combinations. I think the decagon dataset is so confusing that we could not apply it in our research work. |
Was anyone ever able to reproduce the results? Or at least get it running properly? |
@Dinxin I agree with you, that's what I also get when I filter the side effects myself. However, they claim they predict for 964 which doesn't correspond to the actual numbers in the dataset. @christina-s-wang at least I wasn't able to do it. |
NO one cares for these people asking some help? I am in the same spot. |
to use this code with real data + python 3.6 try this fork: |
I viewed the whole code and found that the code only use toy dummy data to train model. So I don't really understand how you use those data to train GCN model. Can you supply the code or instructions about how to use real-world data to train model?
The text was updated successfully, but these errors were encountered: