文章目录~ 1.Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations2.Bootstrap3D: Improving 3D Content Creation with Synthetic Data3.Video-MME: The First-Ever Compreh
文章目录~ 1.Diff-BGM: A Diffusion Model for Video Background Music Generation2.Rethinking Overlooked Aspects in Vision-Language Models3.Unifying 3D Vision-Language Understanding via Promptable Queries4