论文速览

m0_603888714 天前
ai·论文速览
EmbeddingRWKV State-Centric Retrieval with Reusable StatesAuthors: Haowen Hou, Jie YangDeep-Dive Summary: 以下是论文部分的中文总结:
m0_603888714 天前
人工智能·算法·机器学习·ai·论文速览
More Images, More Problems A Controlled Analysis of VLM Failure ModesAuthors: Anurag Das, Adrian Bulat, Alberto Baldrati, Ioannis Maniadis Metaxas, Bernt Schiele, Georgios Tzimiropoulos, Brais Martinez
m0_603888716 天前
人工智能·ai·语言模型·自然语言处理·论文速览
Over-Searching in Search-Augmented Large Language ModelsAuthors: Roy Xie, Deepak Gopinath, David Qiu, Dong Lin, Haitian Sun, Saloni Potdar, Bhuwan Dhingra
m0_603888718 天前
ai·去中心化·区块链·论文速览
Decentralized Autoregressive GenerationAuthors: Stepan Maschan, Haoxuan Qu, Jun LiuDeep-Dive Summary: 以下是论文部分的中文摘要:
m0_603888718 天前
人工智能·算法·ai·语言模型·论文速览
Scaling Trends for Multi-Hop Contextual Reasoning in Mid-Scale Language ModelsAuthors: Brady Steele, Micah KatzDeep-Dive Summary:
m0_6038887112 天前
ai·论文速览
TeleWorld Towards Dynamic Multimodal Synthesis with a 4D World ModelAuthors: Yabo Chen, Yuanzhi Liang, Jiepeng Wang, Tingxi Chen, Junfei Cheng, Zixiao Gu, Yuyang Huang, Zicheng Jiang, Wei Li, Tian Li, Weichen Li, Zuoxin Li, Guangce Liu, Jialun Liu, Junqi Liu, Haoyuan Wang, Qizhen Weng, Xuan’er Wu, Xunzhi Xiang, Xiaoyan Ya
m0_6038887113 天前
人工智能·ai·论文速览
RIMRULE Improving Tool-Using Language Agents via MDL-Guided Rule LearningAuthors: Xiang Gao, Yuguang Yao, Qi Zhang, Kaiwen Dong, Avinash Baidya, Ruocheng Guo, Hilaf Hasson, Kamalika Das
m0_6038887115 天前
ai·论文速览
VLN-MME Diagnosing MLLMs as Language-guided Visual Navigation agentsAuthors: Xunyi Zhao, Gengze Zhou, Qi WuDeep-Dive Summary: 以下是论文的中文摘要:
m0_603888714 个月前
ai·prompt·论文速览
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt EnsemblesAuthors: Eric Slyman, Mehrab Tanjim, Kushal Kafle, Stefan Lee
m0_603888714 个月前
ai·原型模式·论文速览
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual GroundingAuthors: Jiangnan Xie, Xiaolong Zheng, Liang ZhengDeep-Dive Summary:
m0_603888714 个月前
人工智能·ai·语言模型·自然语言处理·论文速览
Delta Activations A Representation for Finetuned Large Language ModelsAuthors: Zhiqiu Xu, Amish Sethi, Mayur Naik, Ser-Nam Lim
m0_603888715 个月前
ai·论文速览
HumanPCR Probing MLLM Capabilities in Diverse Human-Centric ScenesAuthors: Keliang Li, Hongze Shen, Hao Shi, Ruibing Hou, Hong Chang, Jie Huang, Chenghao Jia, Wen Wang, Yiling Wu, Dongmei Jiang, Shiguang Shan, Xilin Chen
m0_603888715 个月前
人工智能·ai·语言模型·自然语言处理·论文速览
Infusing fine-grained visual knowledge to Vision-Language ModelsAuthors: Nikolaos-Antonios Ypsilantis, Kaifeng Chen, André Araujo, Ondřej Chum
m0_603888715 个月前
人工智能·ai·stable diffusion·论文速览
Stable Diffusion Models are Secretly Good at Visual In-Context LearningAuthors: Trevine Oorloff, Vishwanath Sindagi, Wele Gedara Chaminda Bandara, Ali Shafahi, Amin Ghiasi, Charan Prakash, Reza Ardekani
m0_603888715 个月前
人工智能·深度学习·ai·llama·论文速览
LLaMA-Adapter V2 Parameter-Efficient Visual Instruction ModelAuthors: Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao
我是有底线的