Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

User stories/usecases/experiences (please add!) #626

Open
dvdsk opened this issue Oct 3, 2024 · 19 comments
Open

User stories/usecases/experiences (please add!) #626

dvdsk opened this issue Oct 3, 2024 · 19 comments

Comments

@dvdsk
Copy link
Collaborator

dvdsk commented Oct 3, 2024

A collection of user stories/usecases & experiences to be used to inform (breaking) api changes.

Please add your own!

@dvdsk dvdsk pinned this issue Oct 3, 2024
@dvdsk dvdsk changed the title User stories User stories (please add!) Oct 3, 2024
@dvdsk dvdsk changed the title User stories (please add!) User stories/applications (please add!) Oct 4, 2024
@dvdsk dvdsk changed the title User stories/applications (please add!) User stories/usecases/experiences (please add!) Oct 4, 2024
@ugochukwu-850
Copy link
Contributor

@dvdsk , should we add it here .

@dvdsk
Copy link
Collaborator Author

dvdsk commented Oct 4, 2024

please do, we can always move it to its own issue if we have questions. And thank you very much!

@dvdsk
Copy link
Collaborator Author

dvdsk commented Oct 19, 2024

Something that seems very hard/impossible with rodio now:

A user adds two songs to a queue some time after eachother. Then they enable cross-fade while the first song is almost at the end but just before adding the second song. Crossfade then works and the two songs fade into each-other.

The music plays without any stuttering while crossfade is toggled on and off.

I am trying to write an example on how to do this using the current API, maybe I am wrong and it is relatively doable. In that case an example would help out.

@bluenote10
Copy link

I'm simply adding a few use cases I had in the past -- without re-checking if they would currently be possible already.

Sample perfect scheduling

Think of a metronome app that needs to schedule its click sound dynamically (the user can control bpm e.g. via a slider), but with utmost temporal precision (you don't want any fluctuation in a metronome). This requires a to have sample/frame perfect scheduling, i.e., one would need to express start playing this sound at sample index i=xxx.

Sample perfect looping

Sometimes I had to loop sounds also without any gap in the audio output (note that waiting for the sound to finish, and re-schedule it again introduces a gap). This would require to set up a loop information that specifies same (loop_sample_index_from, loop_sample_index_upto) so that the playback can implicitly wrapped-around once the "upto" sample is reached. Ideally the playback interface would expose e.g. a loop_counter to allow users to query once a loop-wrap occurred. The most obvious use case for that are "infinite" background sounds that are constructed in a way that they loop around seamlessly, and simply setting the loop specification to (0, last_sample) would repeat the entire sound indefinitely.

Low-latency scheduling

I was once trying to implement a kind of drum synthesizer in rodio, where certain key-presses triggered certain sounds. However I noticed that the input latency was way too large for musical purposes. I would assume that mainly comes down to the large default buffer size, and #512 should improve that.

@dvdsk
Copy link
Collaborator Author

dvdsk commented Oct 21, 2024

I'm simply adding a few use cases I had in the past

Very helpful thank you!

@nednoodlehead
Copy link

My primary use case:

Rust-based music player (iced gui).

I got pretty much everything I need out of rodio, without too much hassle either. (All inside of this function).

The only thing that I am waiting on (or perhaps, I do not know how to implement. And frankly, haven't tried too much) is crossfade between songs. I do see the fade in / fade out stuff. But that applies more to a setup where you take source A, and source B, then add them into the sink or whatever, then apply it. I add my sources more 'dynamically'. Where once the playing song ends a new one is appended. So there is no point where I can apply it.

I had an idea for a potential workaround, but I have literally 0 clue how this library works under the hood (and not too much experience with this type of programming, so I don't think I could help too much). But be able to somehow change when sink.is_empty() returns true, and for example, if you choose to have a crossfade of 5 seconds, the sink would return true 5 seconds before it actually ends (and it is fading out during this time). Idk if this would create bugs or not, or if this is possible, but it is my first thought for making my idea work with my use case.

Also, weird bug, when music is playing, and I drag my scrubbing bar, the timer got reset to 0 nano seconds for a split second my comment here. Dunno if it is related at all.

@dvdsk
Copy link
Collaborator Author

dvdsk commented Oct 21, 2024

Thanks for all the feedback!

I add my sources more 'dynamically'. Where once the playing song ends a new one is appended. So there is no point where I can apply it.

This is an issue I ran into myself too, good to hear others have trouble with it too. We should make this easy/doable and add a full example.

@tramhao
Copy link

tramhao commented Nov 8, 2024

I used rodio for music playback in my project termusic: https://github.com/tramhao/termusic .
Later on, when more features needed, but rodio is not merging the PRs. I have to use the code from Rodio and merge the necessary code myself.

@dvdsk
Copy link
Collaborator Author

dvdsk commented Nov 8, 2024

I used rodio for music playback in my project termusic: https://github.com/tramhao/termusic . Later on, when more features needed, but rodio is not merging the PRs. I have to use the code from Rodio and merge the necessary code myself.

Could you specify which features you specifically need for your app?

@dvdsk
Copy link
Collaborator Author

dvdsk commented Nov 11, 2024

From #636 by callum-jones19

Hi there!

I just had a brief issue dealing with the following situation:

You create a new sink.
You open two sources from files.
Both sources are appended to the sink
During playback of source 1, you decide that you want to remove source 2, and append a new source in its place.

However, I want to do this without interrupting the flow of playback. This is for a music player, so the possibility of the next upcoming song changing (say, because of a queue change) without interrupting the current playback (stutters, moment of silence, etc) is quite important. I would also like to keep the gapless playback functionality.

Right now, the only solution I could think of is having a secondary sink which is switched out, but this seems a bit cumbersome and likely to cause one of the issues I mentioned above. The way my player works right now is there is a Vec of song file paths (my 'lightweight' queue). When you add a song to this file path queue, it checks to see if the sink contains < 4 song sources in it. If it does, then it opens the file path into an actual source, and adds it to the sink. An EmptyCallback that sits after this then runs whenever the song source ends, and opens the next song path in the file path vec into a Decoder, which is then appended to the sink.

I do this so that I can have a very large queue that is easy to control (plus has song struct info in it) with minimal open file pointers, etc, but then the sink is constantly kept at 2 songs ready to go (unless there is only 1 left to play) so gapless playback works. This works well, but like I said, if I want to change the next song in the queue during playback of the first source, I am not sure how to remove the second source from the sink and append a new one in its place.

Thanks for all the hard work on Rodio - it's been incredibly helpful, and I've learnt a lot working with the library. Hopefully when I understand it a bit more fully I can contribute back to it too :)

@IRSMsoso
Copy link

IRSMsoso commented Nov 12, 2024

Some of these are pretty specific and my fall outside the scope of this project/may be implemented I just haven't found them yet, buuut...

I would love easier ways to manage channels. Including:

  • Ways of taking certain channels of a source (for example, taking the first channel of a wave file and making it mono)
  • Customizing the way that channels from a source are mixed up and down to different numbers of channels (for example a stereo wave file mixing up to any number of device channels).
  • Ways for sources to be aware of the channel situation of the output (or maybe the next channel configuration in the chain?) (for example, a source that always has the same number of channels as the device output.
    • This is useful if, for example, you wanted to implement a source that does the sound design trick of delaying each channel of a mono non-transient audio recording and pumping that out to each and every speaker.

@dvdsk dvdsk mentioned this issue Dec 6, 2024
dvdsk added a commit that referenced this issue Dec 6, 2024
Rodio has existed for about 9 years. When it was written there where no
audio libraries for rust. Therefore rodio can do everything but not
everything perfectly. A lot has changed since then and new libraries
have popped up for specific goals such as game-audio (kira) & digital
signal processing. This is a good thing in my opinion, when developing a
library you have to make choices and they exclude some use cases.

When I started maintaining rodio I had a short exchange with the bevy
devs (1/3 of rodio downloads are from bevy) about their planned move from
rodio to kira. Note the have not migrated to kira as of this writing.
See bevyengine/bevy#9076 (comment)

I also had a short exchange with the kira dev to see if kira could
support all audio usecases, that seems to not be the case.
See: tesselode/kira#87

Over the past few weeks we have collected the use-cases for rodio, see
issue #626.

After talking to the bevy dev and kira dev I made some goals for rodio
in my head. Now that rodio is (very) actively maintained we need to
write those down and discuss them. That way we do not make contradicting
decisions in rodio.

Please let me know what you think.
@IRSMsoso
Copy link

IRSMsoso commented Dec 8, 2024

Also, I've used the crate Hodaun as an alternative to Rodio in the past for some of the features it has, mostly automation of parameters. Could be a good place to look for desirable features.

@dvdsk
Copy link
Collaborator Author

dvdsk commented Dec 8, 2024

Also, I've used the crate Hodaun as an alternative to Rodio in the past for some of the features it has, mostly automation of parameters. Could be a good place to look for desirable features.

very useful, we are definitely looking at it closely!

@geeseofbeverlyroad
Copy link

For my use case, which is a Tauri-based music player (a Rust app with a web UI), it would help a lot to have a callback assignable to source changes directly on the Sink. When one source ends and possibly another begins, it would be great if there's a callback that provides both the ended and newly-begun (if any) source as function params.

@dvdsk
Copy link
Collaborator Author

dvdsk commented Dec 22, 2024

from #349 audio processing without playback (as some sort of lighter implementation of DASP).

@roderickvd
Copy link
Contributor

I maintain two projects that use Rodio:

  • librespot - the open source Spotify library & player;
  • pleezer - the same for Deezer.

At one point in time, librespot considered moving to Rodio altogether but in the end did not, because there are also special kind of backends like PulseAudio, SDL, and GStreamer. So, direct ALSA stayed as well but Rodio is the default backend.

To accomodate all those backends, librespot does most of the heavy lifting itself:

  • decoding audio with lewton (legacy) or Symphonia (default)
  • volume attenuation
  • normalisation & dynamic limiting

As such, librespot doesn't use any of its DSP capabilities, rather, uses it as a simple abstraction over the system audio backends. At one point we tried moving to cpal directly. That was a learning experience, as we found out that more than a few sound cards on Windows support 48 but not 44.1 kHz output.

So, I learned that Rodio is nice because it takes care of that resampling. Feed it audio, boom, it works without thinking. Nice. Yes we could have added resampling to librespot, but it's where things start to get out of scope for a music streaming library: better hook it up to ALSA plughw, feed it to CamillaDSP (which uses Rubato), or rely on something else that's made for that.

In pleezer, none of those special PulseAudio / GStreamer / whatever backends scratch my itch. My goal was to have careless cross-platform audio playback on Linux, macOS and Windows. So needs were again: decoding, playback, volume attenuation, normalisation, resampling. The recent addition of try_seek was timely, so Rodio did everything I needed it to.

As I went head-first at integrating it into pleezer more tightly than in librespot (which just feeds Rodio samples) I did encounter a number of less-than-ideal situations which I'll contribute to #654.

@ChenHaolinOlym
Copy link

Hello maintainers,

Thank you for your incredible work on this crate! I've been building a music player based on Rodio's Sink. I know there's currently a plan on refactoring Sink or introducing a new Player struct, so the following is my experience and some thoughts on this. I would refer to the new struct that would take on the current role of Sink as Player for clarity.


1. Difficulty in Implementing a Playlist

I’ve encountered the same issue as described in #636 . Since it’s already on the list, I’ll just add some additional thoughts here.

The key challenges are:

  1. Playlists can be large, making it impractical to open all files at the start.
  2. For gapless playback, at least two songs need to be preloaded into Source.

If the new Player implements Source, it could remove the need to dequeue the upcoming Source object manually if there's a queue change. We could have something similar to the Queue logic currently, where the next Source is loaded when the current Source returns None.

However, we still need to preload file paths/URLs into Source. Most of the time, we could just preload the upcoming sound. But if there's a queue change at the end of a sound, gaps between sounds may still occur.


2. cpal::Stream and the !Send + !Sync Constraint

Due to cpal::Stream being !Send + !Sync, we typically end up using two mpsc channels and an enum to manage playback. I wonder if there’s a more elegant way to handle this.

If not, and if a new Player struct is introduced, would it make sense to encapsulate this logic within the struct itself, simplifying the interface for users?


3. Metadata Attached to a Source and Callbacks for Source Completion

These are features I’ve found myself wishing for:

  1. Total Duration: I'd like to be able to get the total_duration of the currently playing Source from the decoder. Adding a total_duration_of_current_source method to the current Sink, which unwraps the queue to the currently playing Source and returns result of total_duration(), should work.
  2. Metadata: I'd like to be able to read metadata, seek back to the beginning, open files to Source, and attach metadata to Source. This would save the need to reopen files for metadata retrieval.
  3. Callbacks: It would be helpful to have built-in callbacks for when a Source finishes playing. This would allow GUI updates to be triggered. While EmptyCallback can achieve this currently, I think native support in the new Player would be better.

4. Device Switching

Building on point 2, it might be beneficial to keep the OutputStream inside the new Player. Since I'm a Windows user, and cpal doesn't support following default output device change on Windows #463 , I was wondering about the possibility of manually switching devices.

I understand this may not be a common requirement for most users.


5. Code Organization

The current structure of the source folder can feel overwhelming for newcomers trying to understand Rodio’s features and functionality. Perhaps splitting it into separate folders—one for wrappers that modify a Source and another for generators that create a new Source—could improve clarity.


Thank you again for maintaining this fantastic crate and for all the work you’ve put into it. I’d love to hear more about your plans and vision for the new Player, and I’m happy to contribute to Rodio in any way I can.

@dvdsk
Copy link
Collaborator Author

dvdsk commented Jan 18, 2025

First off all thank you for contributing your thoughts and love your offer to contribute, I'm going to make use of that when we have a sketch of what Player will look like. I have a few questions but little time to I am going to ask one now and the rest well follow someday later.

  1. Playlists can be large, making it impractical to open all files at the start.
  2. For gapless playback, at least two songs need to be preloaded into Source.

If the new Player implements Source, it could remove the need to dequeue the upcoming Source object manually if there's a queue change. We could have something similar to the Queue logic currently, where the next Source is loaded when the current Source returns None.

However, we still need to preload file paths/URLs into Source. Most of the time, we could just preload the upcoming sound. But if there's a queue change at the end of a sound, gaps between sounds may still occur.

I wonder where the line should lie between rodio and your own app. I can imagine we could provide some of that but it would end up quite close to maintaining a Vec of Box<dyn Fn() -> Result<Box<dyn Source >, Box<dyn Error>>, that's not particularly easy either. It might be better to just have 3 sources in the rodio player's queue (?). It can then manage cross-fade and gap-less and the user would get a callback when the player needs a new source. Something like this:

fn get_next_source(new_source_id: rodio::PlayerQueueId, app_state: Arc<State>) -> Box<dyn Source> {
  // store the id in your app so you can use it to cancel/skip this source later.
}

let state_clone = state.clone();
let (player_handle, player_source) = Player::builder()
  .queue_depth(3)
  .need_new_source_callback(move |id| get_next_source(state))
  .crossfade(Duration::from_secs(5));
stream.mix(player_source);

// somewhere in another thread (lets make handle clone & sync & send & debug etc etc)
player_handle.skip(source_id);
player_handle.total_duration_for(another_source_id);

@PetrGlad
Copy link
Collaborator

Among sources there are several with too specialized or duplicating functionality. So that can be cleaned up also.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

10 participants