Nemati AI | Script-a-Video: Deep Structured Audio-visual Captions via Factorized Streams and Relational Grounding | Nemati AI