论文速览

m0_6038887112 天前
ai·prompt·论文速览
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt EnsemblesAuthors: Eric Slyman, Mehrab Tanjim, Kushal Kafle, Stefan Lee
m0_6038887113 天前
ai·原型模式·论文速览
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual GroundingAuthors: Jiangnan Xie, Xiaolong Zheng, Liang ZhengDeep-Dive Summary:
m0_6038887118 天前
人工智能·ai·语言模型·自然语言处理·论文速览
Delta Activations A Representation for Finetuned Large Language ModelsAuthors: Zhiqiu Xu, Amish Sethi, Mayur Naik, Ser-Nam Lim
m0_603888711 个月前
ai·论文速览
HumanPCR Probing MLLM Capabilities in Diverse Human-Centric ScenesAuthors: Keliang Li, Hongze Shen, Hao Shi, Ruibing Hou, Hong Chang, Jie Huang, Chenghao Jia, Wen Wang, Yiling Wu, Dongmei Jiang, Shiguang Shan, Xilin Chen
m0_603888711 个月前
人工智能·ai·语言模型·自然语言处理·论文速览
Infusing fine-grained visual knowledge to Vision-Language ModelsAuthors: Nikolaos-Antonios Ypsilantis, Kaifeng Chen, André Araujo, Ondřej Chum
m0_603888711 个月前
人工智能·ai·stable diffusion·论文速览
Stable Diffusion Models are Secretly Good at Visual In-Context LearningAuthors: Trevine Oorloff, Vishwanath Sindagi, Wele Gedara Chaminda Bandara, Ali Shafahi, Amin Ghiasi, Charan Prakash, Reza Ardekani
m0_603888711 个月前
人工智能·深度学习·ai·llama·论文速览
LLaMA-Adapter V2 Parameter-Efficient Visual Instruction ModelAuthors: Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao
我是有底线的