Fr0zencr4nE
/

Cockatiel-13B

Video-Text-to-Text

Model card Files Files and versions Community

Cockatiel-13B / README.md

Fr0zencr4nE's picture

Update README.md

6ea9361 verified about 1 month ago

|

history blame contribute delete

732 Bytes

metadata

license: cc-by-4.0
library_name: transformers
pipeline_tag: video-text-to-text

A competitive and human-aligned detailed video captioner model based on VILA-v1.5-13B and described in Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption.

This model produces detailed captions for input video, as presented in Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption.

For more details, please refer to our project page: https://sais-fuxi.github.io/projects/cockatiel

Code: https://github.com/Fr0zenCrane/Cockatiel