Laddar…
Sparad:
Utgivningsår: | 2025 |
---|---|
Ämnestermer: |
Computer Science - Multimedia, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Sound, Electrical Engineering
|
Beskrivning: |
Current vision-guided audio captioning systems frequently fail to address audiovisual misalignment in real-world scenarios, such as dubbed c
|
Databas: | arXiv |