Skip to content

SHARE is building a free, open, data set about research and scholarly activities across their life cycle.

Notifications You must be signed in to change notification settings

DanielSBrown/SHARE

 
 

Repository files navigation

SHAREv2

Gitter

Technical Documentation

http://share-research.readthedocs.io/en/latest/index.html

On the OSF

https://osf.io/sdxvj/

Pipeline

Harvester/Push/Curators -> Raw -> Normalization -> HoldingMaster -> Process -> Master (Versioned) -> Views (e.g., JamDB, ES, Neo4J) -> Provenance

Setup for testing

It is useful to set up a virtual environment to ensure python3 is your designated version of python and make the python requirements specific to this project.

mkvirtualenv share -p `which python3.5`
workon share

Once in the share virtual environment, install the necessary requirements.

pip install -r requirements.txt

docker-compose assumes Docker is installed and running. Finally, ./up.sh ensures everything has been installed properly.

docker-compose up -d rabbitmq postgres
./up.sh

---------------- or ----------------

pg
createuser share
psql
    CREATE DATABASE share;
python manage.py makemigrations
python manage.py maketriggermigrations
python manage.py makeprovidermigrations
python manage.py migrate
python manage.py createsuperuser

Run

Run the API server

python manage.py runserver

Run Celery

python manage.py celery worker -l DEBUG

Populate with data

This is particularly applicable to running ember-share, an interface for SHARE.

Harvest data from providers, for example

./manage.py harvest com.nature --async
./manage.py harvest io.osf --async

Pass data to elasticsearch with runbot. Rerunning this command will get the most recently harvested data. This can take a minute or two to finish.

./manage.py runbot elasticsearch

Build docs

cd docs/
pip install -r requirements.txt
make watch

About

SHARE is building a free, open, data set about research and scholarly activities across their life cycle.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.7%
  • Other 0.3%