图文识别
输入图片和音频合并关键帧视频
minicpm 视频理解
提取视频中的图片
提取视频中的音频
视频合并
视频识别自动分割场景
视频转换工具包
This model is cold. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.