[Feat]: Structured Outputs / Function Calling for vertexAI Batch predictions #1655

davidfeiz · 2025-01-24T09:28:37Z

Is your feature request related to a problem? Please describe.

I already posted a question regarding this in the googlecloud-forum( https://www.googlecloudcommunity.com/gc/AI-ML/Structured-Output-in-vertexAI-BatchPredictionJob/m-p/862525) but I'll provide a brief description of the problem.
For the evaluation of an app we want to process a large dataset using the Batch predictions API. For comparability with the actual dev/prod pipeline, we need to ensure that the output is generated in the same way. We us a pydantic basemodel and pass it to the real-time API, so do not just use it for the validation of the output. Unfortunately, this seems to not be possible using Batch predictions nor can I use function calling to mitigate this.

Describe the solution you'd like

Ideally it would be possible to set additional parameters like response_schema in GenerationConfig() or a function-calling mechanism when initializing the model, as done for the real-time API.
Implementations in vertex AI GenerativeModel:
-> response_schema
https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/control-generated-output
-> function-calling
https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/function-calling#python-from-function

Describe alternatives you've considered

Currently we use the real-time API, and introduce a time-separation between consecutive calls, to avoid hitting the APIs rate limits. But this is unreliable, since we still reach these limits sometimes. Moreover, there is no easy way to run the evaluation pipeline asynchronously, which would enable us to send batches containing different configurations for evaluation without waiting for the response first.

Additional context

No response

Code of Conduct

I agree to follow this project's Code of Conduct

holtskinner · 2025-01-24T16:05:42Z

Hi, there is an example showing how to use Controlled Generation with Batch Prediction in this notebook https://github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/use-cases/document-processing/patents_understanding.ipynb

holtskinner closed this as completed Jan 24, 2025

holtskinner self-assigned this Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feat]: Structured Outputs / Function Calling for vertexAI Batch predictions #1655

[Feat]: Structured Outputs / Function Calling for vertexAI Batch predictions #1655

davidfeiz commented Jan 24, 2025

holtskinner commented Jan 24, 2025

[Feat]: Structured Outputs / Function Calling for vertexAI Batch predictions #1655

[Feat]: Structured Outputs / Function Calling for vertexAI Batch predictions #1655

Comments

davidfeiz commented Jan 24, 2025

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Code of Conduct

holtskinner commented Jan 24, 2025