Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 471 Bytes

readme.md

File metadata and controls

18 lines (13 loc) · 471 Bytes

Installation

pip install -r requirements.txt

Pipeline

Run using main.py:

  1. Search for corresponding books by ISBN/Title
  2. Gather metadata for books, if matching ones are found
  3. Download preferred extension (ideally epub, txt, or html)
  4. Extract text from the downloaded file
  5. Extract questions from extracted text
  6. Compute topic mixtures for the books

This generates a file at data/library.json. Move this to cybersym/data/library.json.