PMOD-Net: point-cloud-map-based metric scale obstacle detection by using a monocular camera

依存

NVIDIA-Driver >=418.81.07
Docker >=19.03
NVIDIA-Docker2

Dockerイメージ

pull
```
docker pull shikishimatasakilab/pmod
```

build

docker pull pytorch/pytorch:1.7.0-cuda11.0-cudnn8-devel

./docker/build.sh -i pytorch/pytorch:1.7.0-cuda11.0-cudnn8-devel

Optunaを使用する場合

次のコマンドでDockerイメージをプル.
```
docker pull mysql
```

データセットの準備

KITTI-360

KITTI-360データセットを"h5_kitti360"を用いてHDF5に変換する.
データローダの設定として, 学習時は./config/kitti360-5class.jsonを, Validation・評価時は./config/kitti360-5class-ins.jsonを使用する.

Other Datasets

このページを参照.

Dockerコンテナの起動

次のコマンドでDockerコンテナを起動する.

./docker/run.sh -d path/of/the/dataset/dir

Usage: run.sh [OPTIONS...]
OPTIONS:
    -h, --help          Show this help
    -i, --gpu-id ID     Specify the ID of the GPU
    -d, --dataset-dir   Specify the directory where datasets are stored

学習

次のコマンドで学習を開始する.

python train.py -t TAG -tdc path/of/the/config.json \
  -td path/of/the/dataset1.hdf5 [path/of/the/dataset1.hdf5 ...] \
  -vdc path/of/the/config.json \
  -bs BLOCK_SIZE

usage: train.py [-h] -t TAG -tdc PATH [-vdc PATH] [-bs BLOCK_SIZE]
                -td PATH [PATH ...] [-vd [PATH [PATH ...]]] [--epochs EPOCHS]
                [--epoch-start-count EPOCH_START_COUNT]
                [--steps-per-epoch STEPS_PER_EPOCH] [-ppa]
                [--tr-error-range TR_ERROR_RANGE TR_ERROR_RANGE TR_ERROR_RANGE]
                [--rot-error-range ROT_ERROR_RANGE] [-ae] [-edc PATH]
                [-ed [PATH [PATH ...]]] [--thresholds THRESHOLDS [THRESHOLDS ...]]
                [--seed SEED] [-b BATCH_SIZE] [--resume PATH] [-amp]
                [--clip-max-norm CLIP_MAX_NORM] [-op PATH]
                [-o {adam,sgd,adabelief}] [-lp {lambda,step,plateau,cos,clr}]
                [--l1 L1] [--seg-ce SEG_CE] [--seg-ce-aux1 SEG_CE_AUX1]
                [--seg-ce-aux2 SEG_CE_AUX2] [--detect-anomaly]

optional arguments:
  -h, --help            show this help message and exit

Training:
  -t TAG, --tag TAG     Training Tag.
  -tdc PATH, --train-dl-config PATH
                        PATH of JSON file of dataloader config for training.
  -vdc PATH, --val-dl-config PATH
                        PATH of JSON file of dataloader config for validation.
                        If not specified, the same file as "--train-dl-config" will be used.
  -bs BLOCK_SIZE, --block-size BLOCK_SIZE
                        Block size of dataset.
  -td PATH [PATH ...], --train-data PATH [PATH ...]
                        PATH of training HDF5 datasets.
  -vd [PATH [PATH ...]], --val-data [PATH [PATH ...]]
                        PATH of validation HDF5 datasets. If not specified,
                        the same files as "--train-data" will be used.
  --epochs EPOCHS       Epochs
  --epoch-start-count EPOCH_START_COUNT
                        The starting epoch count
  --steps-per-epoch STEPS_PER_EPOCH
                        Number of steps per epoch. If it is greater than the total number
                        of datasets, then the total number of datasets is used.
  -ppa, --projected-position-augmentation
                        Unuse Projected Positiion Augmentation
  --tr-error-range TR_ERROR_RANGE TR_ERROR_RANGE TR_ERROR_RANGE
                        Translation Error Range [m].
  --rot-error-range ROT_ERROR_RANGE
                        Rotation Error Range [deg].
  -ae, --auto-evaluation
                        Auto Evaluation.
  -edc PATH, --eval-dl-config PATH
                        PATH of JSON file of dataloader config.
  -ed [PATH [PATH ...]], --eval-data [PATH [PATH ...]]
                        PATH of evaluation HDF5 datasets.
  --thresholds THRESHOLDS [THRESHOLDS ...]
                        Thresholds of depth.
  --seed SEED           Random seed.

Network:
  -b BATCH_SIZE, --batch-size BATCH_SIZE
                        Batch Size
  --resume PATH         PATH of checkpoint(.pth).
  -amp, --amp           Use AMP.

Optimizer:
  --clip-max-norm CLIP_MAX_NORM
                        max_norm for clip_grad_norm.
  -op PATH, --optim-params PATH
                        PATH of YAML file of optimizer params.
  -o {adam,sgd,adabelief}, --optimizer {adam,sgd,adabelief}
                        Optimizer
  -lp {lambda,step,plateau,cos,clr}, --lr-policy {lambda,step,plateau,cos,clr}
                        Learning rate policy.

Loss:
  --l1 L1               Weight of L1 loss.
  --seg-ce SEG_CE       Weight of Segmentation CrossEntropy Loss.
  --seg-ce-aux1 SEG_CE_AUX1
                        Weight of Segmentation Aux1 CrosEntropy Loss.
  --seg-ce-aux2 SEG_CE_AUX2
                        Weight of Segmentation Aux2 CrosEntropy Loss.

Debug:
  --detect-anomaly      AnomalyMode

チェックポイントは"./checkpoints"ディレクトリに保存される.

checkpoints/
　├ YYYYMMDDThhmmss-TAG/
　│　├ config.yaml
　│　├ 00001_PMOD.pth
　│　├ :
　│　├ :
　│　├ EPOCH_PMOD.pth
　│　└ validation.xlsx

評価

次のコマンドで評価を行う.

python evaluate.py -t TAG -cp path/of/the/checkpoint.pth \
  -edc path/of/the/config.json \
  -ed path/of/the/dataset1.hdf5 [path/of/the/dataset2.hdf5 ...]

usage: evaluate.py [-h] -t TAG -cp PATH -edc PATH [-bs BLOCK_SIZE]
                   -ed PATH [PATH ...] [--train-config PATH]
                   [--thresholds THRESHOLDS [THRESHOLDS ...]]
                   [--seed SEED] [-b BATCH_SIZE]

optional arguments:
  -h, --help            show this help message and exit

Evaluation:
  -t TAG, --tag TAG     Evaluation Tag.
  -cp PATH, --check-point PATH
                        PATH of checkpoint.
  -edc PATH, --eval-dl-config PATH
                        PATH of JSON file of dataloader config.
  -bs BLOCK_SIZE, --block-size BLOCK_SIZE
                        Block size of dataset.
  -ed PATH [PATH ...], --eval-data PATH [PATH ...]
                        PATH of evaluation HDF5 datasets.
  --nomap               No map input.
  --tr-error-range TR_ERROR_RANGE TR_ERROR_RANGE TR_ERROR_RANGE
                        Translation Error Range [m]. This is used when the
                        data do not contain poses.
  --rot-error-range ROT_ERROR_RANGE
                        Rotation Error Range [deg]. This is used when the data
                        do not contain poses.
  --train-config PATH   PATH of "config.yaml"
  --thresholds THRESHOLDS [THRESHOLDS ...]
                        Thresholds of depth.
  --seed SEED           Random seed.

Network:
  -b BATCH_SIZE, --batch-size BATCH_SIZE
                        Batch Size

評価結果は"./results"ディレクトリに保存される.

results/
　├ YYYYMMDDThhmmss-TRAINTAG/
　│　├ YYYYMMDDThhmmss-TAG/
　│　│　├ config.yaml
　│　│　├ data.hdf5
　│　│　└ result.xlsx

data.hdf5 → ビデオ (.avi)

評価結果に含まれる "data.hdf5" は次のコマンドでビデオ(.avi)に変換する.

python data2avi.py -i path/of/the/data.hdf5

usage: data2avi.py [-h] -i PATH [-o PATH] [-r]

optional arguments:
  -h, --help            show this help message and exit
  -i PATH, --input PATH
                        Input path.
  -o PATH, --output PATH
                        Output path. Default is "[input dir]/data.avi"
  -r, --raw             Use raw codec

変換したビデオは入力のHDF5ファイルと同じディレクトリに保存される.

チェックポイント (.pth) → Torch Script model (.pt)

学習したモデルをLibTorchで使用する場合などに実行する.

次のコマンドでチェックポイント(.pth)をTorch Script model (.pt)に変換する.

python ckpt2tsm.py -c path/of/the/checkpoint.pth

usage: ckpt2tsm.py [-h] [-c PATH] [-x WIDTH] [-y HEIGHT]

optional arguments:
  -h, --help            show this help message and exit
  -c PATH, --check-point PATH
                        Path of checkpoint file.
  -x WIDTH, --width WIDTH
                        Width of input images.
  -y HEIGHT, --height HEIGHT
                        Height of input images.

変換したTorch Script modelは入力したチェックポイントと同じディレクトリに保存される.

Training with Optuna

MySQLのDockerコンテナを別のターミナルで起動する.
```
./optuna/run-mysql.sh
```

次のコマンドで学習を開始する.

python optuna_train.py -t TAG -tdc path/of/the/config.json \
  -td path/of/the/dataset1.hdf5 [path/of/the/dataset2.hdf5 ...] -bs BLOCK_SIZE

usage: optuna_train.py [-h] [--seed SEED] [-n N_TRIALS] -t TAG [-H HOST]
                       [-s {tpe,grid,random,cmaes}] -tdc PATH [-vdc PATH]
                       [-bs BLOCK_SIZE] -td PATH [PATH ...] [-vd [PATH [PATH ...]]]
                       [--epochs EPOCHS] [--epoch-start-count EPOCH_START_COUNT]
                       [--steps-per-epoch STEPS_PER_EPOCH] [-ppa]
                      [--tr-error-range TR_ERROR_RANGE]
                      [--rot-error-range ROT_ERROR_RANGE]
                      [-b BATCH_SIZE] [--resume PATH] [-amp]
                      [--clip-max-norm CLIP_MAX_NORM] [-op PATH]
                      [-o {adam,sgd,adabelief}] [-lp {lambda,step,plateau,cos,clr}]
                      [--l1 L1] [--seg-ce SEG_CE] [--seg-ce-aux1 SEG_CE_AUX1]
                      [--seg-ce-aux2 SEG_CE_AUX2] {multi} ...

positional arguments:
  {multi}
    multi               Multi Objective Trial

optional arguments:
  -h, --help            show this help message and exit

Optuna:
  --seed SEED           Seed for random number generator.
  -n N_TRIALS, --n-trials N_TRIALS
                        Number of trials.
  -t TAG, --tag TAG     Optuna training tag.
  -H HOST, --host HOST  When using a MySQL server, specify the hostname.
  -s {tpe,grid,random,cmaes}, --sampler {tpe,grid,random,cmaes}
                        Optuna sampler.

Training:
  -tdc PATH, --train-dl-config PATH
                        PATH of JSON file of dataloader config for training.
  -vdc PATH, --val-dl-config PATH
                        PATH of JSON file of dataloader config for validation.
                        If not specified, the same file as "--train-dl-config"
                        will be used.
  -bs BLOCK_SIZE, --block-size BLOCK_SIZE
                        Block size of dataset.
  -td PATH [PATH ...], --train-data PATH [PATH ...]
                        PATH of training HDF5 datasets.
  -vd [PATH [PATH ...]], --val-data [PATH [PATH ...]]
                        PATH of validation HDF5 datasets. If not specified,
                        the same files as "--train-data" will be used.
  --epochs EPOCHS       Epochs
  --epoch-start-count EPOCH_START_COUNT
                        The starting epoch count
  --steps-per-epoch STEPS_PER_EPOCH
                        Number of steps per epoch. If it is greater than the
                        total number of datasets, then the total number of
                        datasets is used.
  -ppa, --projected-position-augmentation
                        Unuse Projected Positiion Augmentation
  --tr-error-range TR_ERROR_RANGE
                        Translation Error Range [m].
  --rot-error-range ROT_ERROR_RANGE
                        Rotation Error Range [deg].

Network:
  -b BATCH_SIZE, --batch-size BATCH_SIZE
                        Batch Size
  --resume PATH         PATH of checkpoint(.pth).
  -amp, --amp           Use AMP.

Optimizer:
  --clip-max-norm CLIP_MAX_NORM
                        max_norm for clip_grad_norm.
  -op PATH, --optim-params PATH
                        PATH of YAML file of optimizer params.
  -o {adam,sgd,adabelief}, --optimizer {adam,sgd,adabelief}
                        Optimizer
  -lp {lambda,step,plateau,cos,clr}, --lr-policy {lambda,step,plateau,cos,clr}
                        Learning rate policy.

Loss:
  --l1 L1               Weight of L1 loss.
  --seg-ce SEG_CE       Weight of Segmentation CrossEntropy Loss.
  --seg-ce-aux1 SEG_CE_AUX1
                        Weight of Segmentation Aux1 CrosEntropy Loss.
  --seg-ce-aux2 SEG_CE_AUX2
                        Weight of Segmentation Aux2 CrosEntropy Loss.

他のマシンを用いて並列に学習する場合は, -Hオプションを用いてMySQLを実行しているサーバーを指定する.

引用

@article{pmodnet,
    author = {Junya Shikishima and Keisuke Urasaki and Tsuyoshi Tasaki},
    title = {PMOD-Net: point-cloud-map-based metric scale obstacle detection by using a monocular camera},
    journal = {Advanced Robotics},
    volume = {37},
    number = {7},
    pages = {458-466},
    year  = {2023},
    publisher = {Taylor & Francis},
    doi = {10.1080/01691864.2022.2153080},
    URL = {https://doi.org/10.1080/01691864.2022.2153080},
    eprint = {https://doi.org/10.1080/01691864.2022.2153080}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README-JP.md

README-JP.md

PMOD-Net: point-cloud-map-based metric scale obstacle detection by using a monocular camera

依存

Dockerイメージ

Optunaを使用する場合

データセットの準備

KITTI-360

Other Datasets

Dockerコンテナの起動

学習

評価

data.hdf5 → ビデオ (.avi)

チェックポイント (.pth) → Torch Script model (.pt)

Training with Optuna

引用

Files

README-JP.md

Latest commit

History

README-JP.md

File metadata and controls

PMOD-Net: point-cloud-map-based metric scale obstacle detection by using a monocular camera

依存

Dockerイメージ

Optunaを使用する場合

データセットの準備

KITTI-360

Other Datasets

Dockerコンテナの起動

学習

評価

data.hdf5 → ビデオ (.avi)

チェックポイント (.pth) → Torch Script model (.pt)

Training with Optuna

引用