Can I use the same code to do the opposite. i.e. Remove background music then add my own audio as a background? Also does it matter who is speaking. Does it support new speakers joining in suddenly?
Unfortunately, we don’t yet support user-provided audio as replacement content. Of course, once filtered you could add an audio track over the processed video.
In the future we can try to add this functionality, basically to add any audio track on the source video. It’s an interesting idea!
For your second question: dialogues and multi-speaker videos work pretty well. Shouldn’t be an issue
Maybe I can try implementing it. Do you know any open-source models/frameworks out there for replacing background music? Can audio be logically represented as layers like that (foreground/background)?
Can I use the same code to do the opposite. i.e. Remove background music then add my own audio as a background? Also does it matter who is speaking. Does it support new speakers joining in suddenly?
Hi, thanks for the question!
Unfortunately, we don’t yet support user-provided audio as replacement content. Of course, once filtered you could add an audio track over the processed video.
In the future we can try to add this functionality, basically to add any audio track on the source video. It’s an interesting idea!
For your second question: dialogues and multi-speaker videos work pretty well. Shouldn’t be an issue
Maybe I can try implementing it. Do you know any open-source models/frameworks out there for replacing background music? Can audio be logically represented as layers like that (foreground/background)?