Adds blog on patching a Groq client instance to add tracing support + some general principles on tracing customization #135

djliden · 2024-12-18T16:05:46Z

No description provided.

github-actions · 2024-12-18T16:06:08Z

Preview for `7c58ae4`

For faster build, the doc pages are not included in the preview.
Redirects are disabled in the preview.

BenWilson2

Overall, this is fantastic. Minor nits and comments.

BenWilson2 · 2024-12-18T19:42:05Z

website/blog/2024-12-16-custom-tracing-provider/index.md

+
+[MLflow Tracing](https://mlflow.org/docs/latest/llms/tracing/index.html) is an observability tool in MLflow that captures detailed execution traces for GenAI applications and workflows. In addition to inputs, outputs, and metadata for individual calls, MLflow tracing can also capture intermediate steps such as tool calls, reasoning steps, retrieval steps, or other custom steps.
+
+MLflow provides [built-in Tracing support](https://mlflow.org/docs/latest/llms/tracing/index.html#automatic-tracing) for many popular LLM providers and orchestration frameworks. If you are using one of these providers, you can enable tracing with a single line of code: `mlflow.<provider>.autolog()`. While MLflow's autologging capabilities cover many of the most widely-used LLM providers and orchestration frameworks, there may be times when you need to add tracing to an unsupported provider or customize tracing beyond what autologging provides. This post demonstrates how flexible and extensible MLflow Tracing can be by:


Might be worth mentioning that we do support adding span collection to autolog-enabled sessions (although this feature is only available in MLflow >= 2.19.0

Can you expand on this point? Not sure what this means in practice.

BenWilson2 · 2024-12-18T19:50:20Z

website/blog/2024-12-16-custom-tracing-provider/index.md

+    When adding tracing to a new provider, the main task is to map the provider's API methods to MLflow Tracing spans with appropriate span types.
+
+3. **Structure and preserve key data:** For each operation we want to trace, we need to identify the key information we want to preserve and make sure it is captured and displayed in a useful way. For example, we may want to capture the input and configuration data that control the operation's behavior, the outputs and metadata that explain the results, errors that terminated the operation prematurely, etc. Looking at traces and tracing implementations for similar providers can provide a good starting point for how to structure and preserve these data.
+


Do we want to mention tagging here for things like capturing session information?

BenWilson2 · 2024-12-18T20:00:22Z

website/blog/2024-12-16-custom-tracing-provider/index.md

+
+A few points to note:
+
+- We are wrapping a method on a client *instance*, not a class. This is a fairly lightweight approach that does what we need it to do without requiring changes to the Groq SDK code.


Might want to use the term "patching" instead of wrapping. We are still guerilla patching here by overriding the existing implementation. Wrapping would be more of using the instance of the return obj + method reference of the return value of trace_groq_chat.

BenWilson2 · 2024-12-18T20:03:40Z

website/blog/2024-12-16-custom-tracing-provider/index.md

+
+### Step 3: Wrap the instance method and try it out
+
+Now that we have a tracing decorator, we can wrap the `chat.completions.create` method on a Groq client instance and try it out.


"apply our patch" instead of "wrap"

BenWilson2 · 2024-12-18T20:04:40Z

website/blog/2024-12-16-custom-tracing-provider/index.md

+
+![Tool Calls](./5_tool_call.png)
+
+## Orchestration: Building a tool calling


BenWilson2 · 2024-12-18T20:06:56Z

website/blog/2024-12-16-custom-tracing-provider/index.md

+                    if func is None:
+                        raise ValueError(f"No implementation for tool: {tool_call.function.name}")
+
+                    result = func(**tool_inputs)


We might want to warn users about executing a function locally that is non-deterministic (it's not safe to ask an LLM to generate code for execution and run that callable body within the local environment process). Deterministic functions are fine, though :)

BenWilson2 · 2024-12-18T20:09:27Z

cc @B-Step62 @daniellok-db for inputs on tracing messaging for this blog :D

B-Step62 · 2024-12-18T23:39:03Z

@BenWilson2

Actually Groq tracing is coming soon most likely in 2.20 (PR). Shall we change the client to sth else, for example, Ollama Python SDK?
Also the way we render chat messages on span UI changes significantly in 2.20 by this PR. After that, we should NOT recommend modifying the raw input/output when logging them to a span, instead suggest using the new custom attribute mlflow.chatMessage. I think this blog is a perfect place to introduce the new method🙂

Incorporating these changes means that we need to wait this until 2.20 release (mid Jan), but I feel we don't want to publish a blog that we know becomes stale in a month.

djliden · 2024-12-23T15:47:18Z

Thanks @B-Step62! I'll hold off until January and then revise with the updated approach & a different provider.

adds custom tracing blog

7c58ae4

BenWilson2 reviewed Dec 18, 2024

View reviewed changes

BenWilson2 requested review from B-Step62 and daniellok-db December 18, 2024 20:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds blog on patching a Groq client instance to add tracing support + some general principles on tracing customization #135

Adds blog on patching a Groq client instance to add tracing support + some general principles on tracing customization #135

djliden commented Dec 18, 2024

github-actions bot commented Dec 18, 2024

BenWilson2 left a comment

BenWilson2 Dec 18, 2024

djliden Dec 18, 2024

BenWilson2 Dec 18, 2024

BenWilson2 Dec 18, 2024

BenWilson2 Dec 18, 2024

BenWilson2 Dec 18, 2024

BenWilson2 Dec 18, 2024

BenWilson2 commented Dec 18, 2024

B-Step62 commented Dec 18, 2024 •

edited

Loading

djliden commented Dec 23, 2024


		[MLflow Tracing](https://mlflow.org/docs/latest/llms/tracing/index.html) is an observability tool in MLflow that captures detailed execution traces for GenAI applications and workflows. In addition to inputs, outputs, and metadata for individual calls, MLflow tracing can also capture intermediate steps such as tool calls, reasoning steps, retrieval steps, or other custom steps.

		MLflow provides [built-in Tracing support](https://mlflow.org/docs/latest/llms/tracing/index.html#automatic-tracing) for many popular LLM providers and orchestration frameworks. If you are using one of these providers, you can enable tracing with a single line of code: `mlflow.<provider>.autolog()`. While MLflow's autologging capabilities cover many of the most widely-used LLM providers and orchestration frameworks, there may be times when you need to add tracing to an unsupported provider or customize tracing beyond what autologging provides. This post demonstrates how flexible and extensible MLflow Tracing can be by:

		When adding tracing to a new provider, the main task is to map the provider's API methods to MLflow Tracing spans with appropriate span types.

		3. Structure and preserve key data: For each operation we want to trace, we need to identify the key information we want to preserve and make sure it is captured and displayed in a useful way. For example, we may want to capture the input and configuration data that control the operation's behavior, the outputs and metadata that explain the results, errors that terminated the operation prematurely, etc. Looking at traces and tracing implementations for similar providers can provide a good starting point for how to structure and preserve these data.


		A few points to note:

		- We are wrapping a method on a client instance, not a class. This is a fairly lightweight approach that does what we need it to do without requiring changes to the Groq SDK code.


		### Step 3: Wrap the instance method and try it out

		Now that we have a tracing decorator, we can wrap the `chat.completions.create` method on a Groq client instance and try it out.


		![Tool Calls](./5_tool_call.png)

		## Orchestration: Building a tool calling

Adds blog on patching a Groq client instance to add tracing support + some general principles on tracing customization #135

Are you sure you want to change the base?

Adds blog on patching a Groq client instance to add tracing support + some general principles on tracing customization #135

Conversation

djliden commented Dec 18, 2024

github-actions bot commented Dec 18, 2024

Preview for 7c58ae4

BenWilson2 left a comment

Choose a reason for hiding this comment

BenWilson2 Dec 18, 2024

Choose a reason for hiding this comment

djliden Dec 18, 2024

Choose a reason for hiding this comment

BenWilson2 Dec 18, 2024

Choose a reason for hiding this comment

BenWilson2 Dec 18, 2024

Choose a reason for hiding this comment

BenWilson2 Dec 18, 2024

Choose a reason for hiding this comment

BenWilson2 Dec 18, 2024

Choose a reason for hiding this comment

BenWilson2 Dec 18, 2024

Choose a reason for hiding this comment

BenWilson2 commented Dec 18, 2024

B-Step62 commented Dec 18, 2024 • edited Loading

djliden commented Dec 23, 2024

Preview for `7c58ae4`

B-Step62 commented Dec 18, 2024 •

edited

Loading