Specifying number of speakers in the Huggingface Inference API #1088

Jeannotisintheplace · 2022-09-21T15:40:18Z

Jeannotisintheplace
Sep 21, 2022

Hi team,

Could you please explain how to pass the known-in-advance number of speakers to the API ?

Shall "num_speaker" be specified in the Headers (or parameters?), and in which format ?

Sorry for possible trivial question but thanks in advance,

Julien

hbredin · 2022-09-22T07:10:44Z

I do not think this is supported.

I guess it should be possible as there are tasks for which options can be passed to the API:
https://huggingface.co/docs/api-inference/detailed_parameters#fill-mask-task

cc @julien-c who might be able to indicate what change should be made to add this feature to the API.

This is more or less what is run on Huggingface servers:

pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization")
pipeline("audio.wav")

This is what @Jeannotisintheplace would like to run:

pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization")
pipeline("audio.wav", num_speakers=2)

0 replies