fetch() stream capable? #1

mindplay-dk · 2023-12-11T14:49:52Z

Hi,

do you know if there's any way to add body streams from fetch calls with this library?

(e.g. downloading a list of files and compressing them on the fly.)

The text was updated successfully, but these errors were encountered:

hpp2334 · 2023-12-12T13:45:59Z

Now there are only some simple utility functions, such as adding text files, streaming serializing javascript objects into JSON strings, and adding them as files.
I think, if necessary, a utility API can be added in the form of the following:

function createFetchDataGenerator(factory: () => Promise<Response<unknown>>)

Users probably use it this way.

zip.addFile('a.bin', createFetchDataGenerator(() => fetch('xxx')))

mindplay-dk · 2023-12-12T13:55:39Z

Looks reasonable. :-)

createFetchDataGenerator could internally use a promise pool with a few threads - to keep the line busy when downloading many small files, keeping a few concurrent fetches going.

alternatively, for more control, maybe something like createFetchDataPool(4), which would return the createFetchDataGenerator function bound to a promise pool of e.g. 4 concurrent promises?

(it's usually a good idea to give people the option - some browsers limit the total number of connections, so if you app needs 1 or 2 connections, you might want 4 or 5 concurrent promises, or you might want no more than 2 or 3 to ease the server load, requirements are often different here.)

hpp2334 · 2023-12-12T14:48:27Z

As far as I know, the files in the zip are added one after another. For example, there is no way to add 50% of the content of a.bin first, then add 60% of the content of b.bin, and then add the remaining 50% of the content of a.bin. All the contents of a.bin must be added first, and then all the contents of b.bin. So actually concurrency doesn't make sense, requests should be made one after the other when considering "streaming".

If "streaming" and memory usage are not considered (for example, the requested files are small), A third-party AsyncTaskPool can be used to request and read the total content and then package it into zip. for example:

const asyncTaskPool = new AsyncTaskPool({concurrent: 4})
const zip = new StreamZip();
// fetch files and add them to zip
toRequests.forEach(([addr, filename]) => {
    asyncTaskPool.add(() => fetch(addr).then(resp => {
        // read total buffer
        const buf = resp.arrayBuffer()
        zip.add(filename, createBytesDataGenerator(new Uint8Array(buf)))
    }))
})

mindplay-dk · 2023-12-14T09:04:20Z

As far as I know, the files in the zip are added one after another. For example, there is no way to add 50% of the content of a.bin first, then add 60% of the content of b.bin, and then add the remaining 50% of the content of a.bin.

Yes, that's not what I'm asking for. 🙂

But you can start opening the fetch requests ahead of time - if you're downloading a lot of smaller files, that makes sense, since otherwise you will be waiting to connect/request/open each individual file, which introduces a lot of waiting, which is going to be very slow. Opening a few requests ahead of time, before you start adding them to the zip stream, will make this a lot faster.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fetch() stream capable? #1

fetch() stream capable? #1

mindplay-dk commented Dec 11, 2023

hpp2334 commented Dec 12, 2023

mindplay-dk commented Dec 12, 2023

hpp2334 commented Dec 12, 2023

mindplay-dk commented Dec 14, 2023

fetch() stream capable? #1

fetch() stream capable? #1

Comments

mindplay-dk commented Dec 11, 2023

hpp2334 commented Dec 12, 2023

mindplay-dk commented Dec 12, 2023

hpp2334 commented Dec 12, 2023

mindplay-dk commented Dec 14, 2023