Batch Multimodal Video Generation

Batch generate videos by combining any mix of audio, video, images, and text through multimodal driving.

meitu batch video-multimodal-generate

Note: Batch processing is available from meitu-cli v2.1.1 onward. Please upgrade if you're on an older version.

Usage Examples

# Config file mode (config file required)
meitu batch video-multimodal-generate \
  --config ./batch.video-multimodal-generate.yaml \
  --output-dir ./outputs \
  --json

Config File Example

version: 1
defaults:
  outputDir: ./outputs
items:
  - prompt: Generate a visualization video driven by music
    audio: ./music/song.mp3
  - prompt: Generate a new video referencing the style of a sample clip
    referenceVideo: ./refs/style.mp4

Parameter Reference

ParameterRequiredDescription
--output-dirYesOutput directory
--configYesPath to YAML/JSON config file (config file is required)
--concurrencyNoType: number; Default: 3; Number of parallel tasks (recommended 1-2 for videos)
--max-retriesNoType: number; Default: 0; Number of retries on failure
--skip-existNoSkip existing output files
--dry-runNoPreview the plan without executing
--no-progressNoDisable per-task progress logging
--jsonNoOutput results in JSON format
--json-outputNoWrite results to a specified JSON file
--skill-nameNoServer-side skill identifier