AudioScope: in-the-wild on-screen sound enhancement

Input video

Video clip from "MVI_0917" by jeminichronicles, license: CC-BY-SA 2.0.

Audio clip from "MVI_0917" by jeminichronicles, license: CC-BY-SA 2.0.


On-screen estimate from the input video and corresponding attention map

Video clip from "MVI_0917" by jeminichronicles with modified audio and overlaid attention map, license: CC-BY-SA 2.0.





Separated sources with input video and corresponding attention maps

Video clip from "MVI_0917" by jeminichronicles with modified audio and overlaid attention map, license: CC-BY-SA 2.0.


Video clip from "MVI_0917" by jeminichronicles with modified audio and overlaid attention map, license: CC-BY-SA 2.0.


Video clip from "MVI_0917" by jeminichronicles with modified audio and overlaid attention map, license: CC-BY-SA 2.0.


Video clip from "MVI_0917" by jeminichronicles with modified audio and overlaid attention map, license: CC-BY-SA 2.0.



Attention maps over time for on-screen estimate

Still frames from "MVI_0917" by jeminichronicles with overlaid attention maps, license: CC-BY-SA 2.0.