-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
For data files transferred from EGA to Collab, add EGA repo to 'File copy' section of the portal file index #460
Comments
@baminou can you please mirror all of the EGA file transfer git repositories hosted internally under http://142.1.177.124/jt-hub to the public github repo under https://github.com/icgc-dcc/? There should be 5 or 6 of them. |
The goal is to add a new fileCopy entry for files already transferred from EGA to Collaboratory. We first identify the SONG Analysis in Collab, this can be done by Analysis ID. The ID takes form of The git repos for ega transfer jobs have been mirrored from our internal server to GitHub:
The files contained needed information are:
Example files:
|
Just to give two examples here:
|
EGA indexing to be investigated in future; do not work on this until we know more about those specs, OR until we have more EGA data to transfer to collaboratory. |
Here is one such file: https://dcc.icgc.org/repositories/files/FI743257. It is originated from EGA, we transferred to Collaboratory, but the file page only shows this file exist in Collab but not in EGA.
We need a way to let the portal repo indexer know addition copy of the file exists in EGA as well. This could be as easy as detecting whether
dataBundleId
starts withEGA
, if so, there must be a copy of the file exist in EGA.We may also need additional EGA specific information for the file copy, such as
repoFileId
, in this case, we needEGAFxxxxx
ID to be populated, so will need a way to pass it to indexer.The text was updated successfully, but these errors were encountered: