MMGDreamer is a dual-branch diffusion model for scene generation that incorporates a novel Mixed-Modality Graph, visual enhancement module, and relation predictor. Feel free to contact Zhifei Yang ...
Abstract: With the rapid rise of short video social platforms, the spread of fake news videos has become a global challenge. Short videos, which integrate multiple modalities such as text, images, and ...
The dataset used for fine-tuning the model. Code for generating the dataset. Scripts for fine-tuning the model on high-performance GPUs. Inference scripts for real-time task execution. SG_VLM utilizes ...