DeepMind’s highly functional multimodal model Gemin reaches human expert level
Multimodal large-scale language models (MLLMs) have recently emerged as a prominent research topic, leveraging the capabilities of powerful large-scale language models (LLMs) to perform diverse multimodal tasks. MLLM’s notable features,…