Towards Long Video Understanding via Fine-detailed Video Story Generation

Published in TCSVT, 2024