Skip to content

Latest commit

 

History

History
146 lines (105 loc) · 7.07 KB

README.md

File metadata and controls

146 lines (105 loc) · 7.07 KB

Pepeketua ID GUI

This repository contains scripts and resources to develop a GUI that helps biologist identify individual Archey's frogs (Leiopelma archeyi).

Lior Uzan carried out this work, with the support of Bar Vinograd and Victor Anton.

Contributors Issues MIT License

App display screen

Overview

This project runs some scripts to process previously identified frog photos, then run a GUI that helps biologists identify individual frogs. The interface uses deep learning tools behind the scenes to extract ID vectors from new frogs and compares them to previously identified and extracted ID vectors using a nearest neighbor search.

Requirements

  • Docker

To start the app

  1. Create a folder called pepeketua_files.

  2. Rename the two previous capture excel files wharerino.xls and pukeokahu.xls.

  3. Place them in the pepeketua_files folder.

  4. Place the photo zips whareorino_a.zip whareorino_b.zip whareorino_c.zip whareorino_d.zip pukeokahu.zip in the same directory.

  5. Download all model directories from here and place them in the pepeketua_files directory as well.

  6. Download compose.yaml to the directory containing pepeketua_files (a.k.a ../pepeketua_files)

  7. Open a command line terminal in ../pepeketua_files, and type in and execute the command:

    docker compose run pepeketua-interface python process_previous_captures.py

    This step should take about an hour to complete.

  8. After the previous step finished running successfully, execute the command

    docker compose up -d
  9. Open http://localhost to access and use the app!

To close the app

Open a command line terminal to ../pepeketua_files and run the command:

docker compose down

Usage

  • After the dockers from the previous section finish processing the old capture data, the app will be available at this url.
  • See this presentation to learn about the app and how to use it.
  • After you update your capture excel sheets, run
    docker compose run pepeketua-interface python process_previous_captures.py
    again to update the internal database!

How it works

  • The scripts clean all frog sightings in the excel sheets, saves them to a SQL server and save corresponding pictures to a LMDB.
  • Then it extracts the id vectors from the frog images and saves those to Faiss indices, one per grid (to compare frogs only within their grids).
  • Then the Streamlit server is started (GUI) and is accessible at this url

Extra files saved to the shared folder:

  • pepeketua_files/parse_previous_captures.log Log for script which processes and saves previous captures to internal db.
  • pepeketua_files/extract_identity_vectors.log Log for script which extracts the id vectors from previous capture photos.
  • pepeketua_files/incorrect_filepaths.csv All rows where there is a mismatch between the photo path and the the "Frog ID #" column.
  • pepeketua_files/missing_photos.csv All rows that have no "filepath" column value.

Rebuilding base docker images

This project uses two base images specified by the following Dockerfiles. If you need to rebuild them here's how:

  • base_image.Dockerfile - This docker holds the Ubuntu packages needed to run the code. To rebuild and push it to my docker repository, download it and run the command
    docker build --no-cache -t ghostcow/pepeketua:base_image -f base_image.Dockerfile . && docker push ghostcow/pepeketua:base_image
  • python_env.Dockerfile - This docker is built from ghostcow/pepeketua:base_image and clones the repository to it's memory, then installs all python packages specified by requirements.txt. It provides the python environment used by the code. To rebuild and push it to my docker repository, download the file and run the command
    docker build --no-cache -t ghostcow/pepeketua:python_env -f python_env.Dockerfile . && docker push ghostcow/pepeketua:python_env

Enabling app to accept captures after 2020

Right now the app supports only captures from before the year 2020, and filters out the rest. To allow the app to accept captures after 2020-

  1. Close the app by following these instructions.
  2. Create a pull request to remove the date filter here. Remove or edit it to fit your needs, commit, and wait until it's merged.
  3. Rebuild the docker made from python_env.Dockerfile, see instructions here.
  4. Rerun the capture processing script by following step 7 from this section.
  5. Restart your app by following step 8 from this section.

E-mail me with any questions.

Citation

If you use this code or its models, please cite:

Uzan L, Vinograd B, Anton V (2023). Pepeketua ID - A frog identification app. https://github.com/wildlifeai/pepeketua_interface

Collaborations/questions

We are working to make our work available to anyone interested. Please feel free to contact us if you have any questions.