Store each uploaded My Clippings.txt file #16

mammuth · 2019-09-22T08:03:07Z

We should do this in order to be able to migrate features like #5 to already uploaded clippings.

At best, we create a model which keeps track of every upload with a date and the txt file (associated to a user).

mammuth · 2019-09-22T08:03:54Z

It will also help debugging issues due to different formats (like date formats on different Kindle devices etc.)

mammuth · 2021-05-15T08:28:03Z

If we do this we could also refactor the import process into more decoupled "jobs" where the upload only stores the file and following distinct jobs parse and store it in the DB, get a book cover via an external API, etc. ...

Obviously not needed, but might be a nice opportunity for cleaning up the code a bit 😊

~~I think on the current hosting plan we even have Celery, so if we wanted we could actually run the jobs asynchronously (if not, there might still be value in separating the process)~~

Edit: Celery workers can only be added with the business plans and they're way too expensive for this hobby project, so we won't get Celery unless we migrate to a different hosting solution. We could also ask them to give us Celery workers, I'd expect this to work, but it will still be additional costs, so not really worth it I guess.

JSerwatka · 2021-06-28T08:38:47Z

The idea with an async queue is great, but for this price it of course doesn't make any sense.

I believe that book cover fetching will be the most time costly element here, because to work properly it will probably need to use 2 requests (ISBN and book cover).

JSerwatka · 2021-07-06T15:18:56Z

I've created a mind map to try to gather all possible solutions to our problem with new features and storing the latest data. You can find it here. My favorite solution is marked in green.

To compare clippings we have to use their hash, there is no other way here.

@mammuth, I'd love to hear your thoughts on that.

mammuth · 2021-07-30T16:42:55Z

've created a mind map to try to gather all possible solutions to our problem with new features and storing the latest data. You can find it here. My favorite solution is marked in green.

Sorry, I missed your miro board somehow, but I just reviewed it. To be honest, I'm not 100% sure what problem we're trying to solve with your green path.
Is it about the use case that the content of a clipping changes? 🤔
Is this really a thing? For my reading workflow, it's not. I read books, make highlights, upload the highlights, and won't change the highlights afterward. I actually don't even know how this tool behaves, when adding comments, I think I never tried that 🙈

To compare clippings we have to use their hash, there is no other way here.

Random note: We already store the hash in the DB and use it as a DB unique constraint to ensure that we're not importing a clipping twice:

kindle-clippings/clipping_manager/models/clipping.py

Line 67 in aeedfd1

unique_together = ('user', 'book', 'content_hash',)

mammuth added priority:high type:improvement labels Sep 22, 2019

mammuth added the good first issue Good for newcomers label May 6, 2021

mammuth mentioned this issue May 13, 2021

Books page #31

Closed

2 tasks

This was referenced Jul 25, 2021

Add more structured data when importing clippings & books #5

Open

Feat: store each uploaded MyClippings.txt #38

Merged

mammuth closed this as completed in #38 Jul 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store each uploaded My Clippings.txt file #16

Store each uploaded My Clippings.txt file #16

mammuth commented Sep 22, 2019

mammuth commented Sep 22, 2019

mammuth commented May 15, 2021 •

edited

Loading

JSerwatka commented Jun 28, 2021

JSerwatka commented Jul 6, 2021

mammuth commented Jul 30, 2021

Store each uploaded My Clippings.txt file #16

Store each uploaded My Clippings.txt file #16

Comments

mammuth commented Sep 22, 2019

mammuth commented Sep 22, 2019

mammuth commented May 15, 2021 • edited Loading

JSerwatka commented Jun 28, 2021

JSerwatka commented Jul 6, 2021

mammuth commented Jul 30, 2021

mammuth commented May 15, 2021 •

edited

Loading