Home
News
People
Publications
Seminar
Courses
Events
Vacancies
Contact
Pulp Fictions
Light
Dark
Automatic
multimodal large language models
Fill the GAP: A Granular Alignment Paradigm for Visual Reasoning in Multimodal Large Language Models
Visual latent reasoning lets a multimodal large language model (MLLM) create intermediate visual evidence as continuous tokens, …
Yanting Miao
,
Yutao Sun
,
Dexin Wang
,
Mengyu Zhou
,
Pascal Poupart
,
Lei Lv
,
Qi Zhao
,
Li Wang
,
Hao Li
,
Xiaoxi Jiang
,
Guanjun Jiang
PDF
Cite
Source Document
Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression
Humans can retain old knowledge while learning new information, but Large Language Models (LLMs) often suffer from catastrophic …
Zilun Zhang
,
Yutao Sun
,
Tiancheng Zhao
,
Leigang Sha
,
Ruochen Xu
,
Kyusong Lee
,
Jianwei Yin
PDF
Cite
Source Document
Cite
×