Multimodal LLM benchmarks: MMIE, HumanEval-V, MixEval-X. AI video generation: MovieGen, PyramidFlow. Multimodal LLMs: Emu3, MIO, MM-1.5. DepthPro for depth perception.
Share this post
AI Research Roundup 24.10.18 - Video and…
Share this post
Multimodal LLM benchmarks: MMIE, HumanEval-V, MixEval-X. AI video generation: MovieGen, PyramidFlow. Multimodal LLMs: Emu3, MIO, MM-1.5. DepthPro for depth perception.