Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can't seem to download the dataset #4

Open
alexberlaga opened this issue Sep 6, 2024 · 5 comments
Open

Can't seem to download the dataset #4

alexberlaga opened this issue Sep 6, 2024 · 5 comments

Comments

@alexberlaga
Copy link

Hi! I tried to clone the dataset from the ModelScope repo as the README suggests, but it looks like there's no data in there. I just get another README file which is under 1KB. Am I missing something?

@atong01
Copy link

atong01 commented Sep 8, 2024

I also see no data

@sankethvedula
Copy link

sankethvedula commented Sep 12, 2024

I see that the ModelScope repository has some sample data (1ab1_A) -- the full dataset seems to still be missing. Am I missing something? Could you please also let me know where I can find a list of PDBs for which the simulations were run?

@h0ngxuanli
Copy link

Hi there! I noticed there are currently around 500 data in the repository. I was wondering if you could kindly let me know the expected timeline for completing the full dataset upload? This would help me better plan my work. Thank you in advance!

@zqcai19
Copy link
Collaborator

zqcai19 commented Dec 31, 2024

@h0ngxuanli Hi, thank you for your interest in our dataset.

Before initiating the data upload, we carefully evaluated multiple cloud storage solutions, considering both capacity and bandwidth limitations, and ultimately chose the current platform for its balance of performance and accessibility.

Due to the immense size of the dataset, we are currently able to upload data files for an average of six proteins per day. But we are doing our best to continuously upload the data.

We sincerely appreciate your patience and understanding.

@h0ngxuanli
Copy link

Thanks for your efforts! It is true that those fine-grained MD data needs great efforts to be accessible to the public! I am wondering whether there is a way to quickly access only the MD trajectories of all proteins, maybe in a xtc format? 😊

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants