Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Stage 091 (rucioDatasets) #79

Closed
wants to merge 7 commits into from
Closed

Conversation

mgolosova
Copy link
Collaborator

Add information about datasets from Rucio to the ora2es chain (009-019-069).

This script gets the NDJSON lines of tasks metadata from Oracle2JSON.py
and convert 'input' and 'output' strings of datasets to arrays,
adding 'bytes' from Rucio.
- removed unused 'remove_tid()' function as we mustn't remove _tid part from dataset names
- 'input' datasets are out of our scope, so only 'output' dataset types was left.
- added 'deleted' boolean value:
      dataset considered to be deleted if it wasn't found in Rucio
- the NULL value of dataset 'bytes' is set to -1
Added:
- comments in datasets_processing.py script
- README file
- input / output examples files
- primary_input
- ctag
- output_formats
- index name changed from 'prodsys' to 'prodsys_rucio_ami',
as we're going to store parameters from these 3 storages

- added type 'task' as PARENT
- added type 'output_datasets' as CHILD of 'task'

new fields was added:
- primary_input (prodSys)
- ctag (prodSys)
- output_formats (prodSys)
- cross_section_ref (AMI)
- cross_section (AMI)
- get_filt_eff (AMI)
- k-factor (AMI)

'input' datasets was removed and substituted to 'primary_input'
Is was decided to substitute array of 'input' dataset to single 'primary_input' string.
@mgolosova mgolosova changed the base branch from dataflow-ora2es-rebased to master November 22, 2017 17:30
@mgolosova mgolosova mentioned this pull request Nov 24, 2017
12 tasks
@mgolosova mgolosova changed the title [WIP] Stage 091 (rucioDatasets) [WIP] Stage 091 (datasetsRucio) Nov 24, 2017
@mgolosova mgolosova changed the title [WIP] Stage 091 (datasetsRucio) [WIP] Stage 091 (rucioDatasets) Nov 24, 2017
@mgolosova
Copy link
Collaborator Author

mgolosova commented Nov 24, 2017

Closed in favor of PR #83 , #84 and #85.

@mgolosova mgolosova closed this Nov 24, 2017
@mgolosova mgolosova deleted the stage-091-rucioDatasets branch November 24, 2017 13:31
@mgolosova mgolosova mentioned this pull request Nov 27, 2017
4 tasks
mgolosova added a commit that referenced this pull request Nov 28, 2017
Stage 019 transformation

Now it can accept data not only from Stage 016,
but also from 091 (see #79) -- or any other stage,
that produces data in the required format.
mgolosova added a commit that referenced this pull request Nov 29, 2017
Stage 019 transformation

Now it can accept data not only from Stage 016,
but also from 091 (see #79) -- or any other stage,
that produces data in the required format.

---
This merge overrides merge of PR #86
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants