feat(js): opanai chat completion streaming support #87

mikeldking · 2024-01-10T03:17:51Z

resolves #39

Adds the ability to get LLM spans from streaming responses. Does this by splitting the stream via .tee() and iterating over the chunks.

RogerHYang · 2024-01-10T17:51:41Z

js/packages/openinference-instrumentation-openai/src/instrumentation.ts

+              // This is a streaming response
+              // handle the chunks and add them to the span
+              // First split the stream via tee
+              const [leftStream, rightStream] = result.tee();


Does tee create a two-speed scenario? For example:

If user cancels the stream, do we go ahead and read the whole thing, incurring additional token costs?

What if the stream never ends?

If network error happens mid-stream, how does that error bubble up to the user?

I'm not fully sure on either - both valid concerns. Right now I'm trying to avoid an extremely convoluted instrumentation. Let me add some tests and get back to you.

So if stream.controller.abort is called on one, the SSE connection is killed, killing both streams. So as long as the stream abort controller is used, both streams see the same data technically. I think there is a case that you highlight that's real that if in user-land you break out of the steam and leave it for the other to consume the rest of the stream. I'll look into instrumenting the method finalizeChatCompletion

What if the stream never ends?
There's no real way that openAI would keep the stream open forever. But yeah if the stream was open forever the stream span would never terminate and never get exported.

If network error happens mid-stream, how does that error bubble up to the user?

technically the steam has an error event, which we could capture - I'm gonna leave that part off for now because I think there's probably other instrumentation that's better suited for capturing that info.

Sounds good, thanks!

RogerHYang · 2024-01-10T21:10:00Z

js/packages/openinference-instrumentation-openai/src/instrumentation.ts

+  let streamResponse = "";
+  for await (const chunk of stream) {
+    if (chunk.choices.length > 0 && chunk.choices[0].delta.content) {
+      streamResponse += chunk.choices[0].delta.content;


Will need a follow-up for function_call and tool_calls, in particular, the arguments attribute.

p.s. in which case the output value mime type would have to be JSON (instead of TEXT)

Good call out, I've completely not added tool call support so here's the ticket #90

there's also this auto tool calling that I probably need to consider (https://github.com/openai/openai-node?tab=readme-ov-file#automated-function-calls), though it looks to be in beta

mikeldking added 2 commits January 9, 2024 20:17

feat(js): opanai chat completion streaming support

f575c17

changeset

f0cd336

RogerHYang reviewed Jan 10, 2024

View reviewed changes

RogerHYang approved these changes Jan 10, 2024

View reviewed changes

RogerHYang reviewed Jan 10, 2024

View reviewed changes

mikeldking merged commit 82c5d83 into main Jan 10, 2024
3 checks passed

mikeldking deleted the js-openai-streaming branch January 10, 2024 21:18

github-actions bot mentioned this pull request Jan 10, 2024

chore(main): release js-openinference-instrumentation-openai 1.0.0 #78

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(js): opanai chat completion streaming support #87

feat(js): opanai chat completion streaming support #87

mikeldking commented Jan 10, 2024 •

edited

Loading

RogerHYang Jan 10, 2024

mikeldking Jan 10, 2024

mikeldking Jan 10, 2024 •

edited

Loading

RogerHYang Jan 10, 2024

RogerHYang Jan 10, 2024 •

edited

Loading

RogerHYang Jan 10, 2024

mikeldking Jan 10, 2024

feat(js): opanai chat completion streaming support #87

feat(js): opanai chat completion streaming support #87

Conversation

mikeldking commented Jan 10, 2024 • edited Loading

RogerHYang Jan 10, 2024

Choose a reason for hiding this comment

mikeldking Jan 10, 2024

Choose a reason for hiding this comment

mikeldking Jan 10, 2024 • edited Loading

Choose a reason for hiding this comment

RogerHYang Jan 10, 2024

Choose a reason for hiding this comment

RogerHYang Jan 10, 2024 • edited Loading

Choose a reason for hiding this comment

RogerHYang Jan 10, 2024

Choose a reason for hiding this comment

mikeldking Jan 10, 2024

Choose a reason for hiding this comment

mikeldking commented Jan 10, 2024 •

edited

Loading

mikeldking Jan 10, 2024 •

edited

Loading

RogerHYang Jan 10, 2024 •

edited

Loading