Skip to content

Latest commit

 

History

History
49 lines (35 loc) · 1.36 KB

README.md

File metadata and controls

49 lines (35 loc) · 1.36 KB

postgres-docker

A docker-compose set up to run postgres on your local computer for testing - easy as that.

Quickstart

docker-compose up db
docker-compose up metrics
docker-compose run --rm client

# shutdown
docker-compose down -v

Example Data

Amazon Reviews

Here are the Docs

Set up

  1. Download the German data
wget https://s3.amazonaws.com/amazon-reviews-pds/tsv/amazon_reviews_multilingual_DE_v1_00.tsv.gz -O ./data/amazom_reviews_multilingual_DE_v1_00.tsv.gz
> --2020-08-02 04:26:02--  https://s3.amazonaws.com/amazon-reviews-pds/tsv/amazon_reviews_multilingual_DE_v1_00.tsv.gz
> Resolving s3.amazonaws.com (s3.amazonaws.com)... 52.216.25.14
> Connecting to s3.amazonaws.com (s3.amazonaws.com)|52.216.25.14|:443... connected.
> HTTP request sent, awaiting response... 200 OK
> Length: 241896005 (231M) [application/x-gzip]
> Saving to: ‘./data/amazom_reviews_multilingual_DE_v1_00.tsv.gz’
>
> ./data/amazom_revie 100%[===================>] 230.69M   622KB/s    in 97s
  1. Decompress the data
gunzip ./data/amazom_reviews_multilingual_DE_v1_00.tsv.gz
  1. Preprocess the data by escaping all literal \ characters to protect the Postgres COPY command on dbinit
# MacOS (BSD) sed
sed -i -e 's/\\/\\\\/g' ./data/amazom_reviews_multilingual_DE_v1_00.tsv