VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer
Published in Under review, 2022
AV Transformer for voice separation
Recommended citation: Juan F. Montesinos, Venkatesh S. Kadandale, Gloria Haro (2022). "VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformerreview https://arxiv.org/abs/2104.09946