论文速览

m0_603888711 天前
ai·prompt·论文速览
Mitigating Long-Tail Bias via Prompt-Controlled Diffusion AugmentationAuthors: Buddhi Wijenayake, Nichula Wasalathilake, Roshan Godaliyadda, Vijitha Herath, Parakrama Ekanayake, Vishal M. Patel
m0_603888711 天前
人工智能·ai·语言模型·自然语言处理·论文速览
Language Models Struggle to Use Representations Learned In-ContextAuthors: Michael A. Lepori, Tal Linzen, Ann Yuan, Katja Filippova
m0_603888711 天前
算法·机器学习·ai·剪枝·论文速览
POP Prefill-Only Pruning for Efficient Large Model InferenceAuthors: Junhui He, Zhihui Fu, Jun Wang, Qingan LiDeep-Dive Summary:
m0_603888712 天前
ai·论文速览
A Multi-scale Linear-time Encoder for Whole-Slide Image AnalysisAuthors: Jagan Mohan Reddy Dwarampudi, Joshua Wong, Hien Van Nguyen, Tania Banerjee
m0_603888713 天前
人工智能·机器学习·ai·语言模型·论文速览
Toward Cognitive Supersensing in Multimodal Large Language ModelAuthors: Boyi Li, Yifan Shen, Yuanzhe Liu, Yifan Xu, Jiateng Liu, Xinzhuo Li, Zhengyuan Li, Jingyuan Zhu, Yunhan Zhong, Fangzhou Lan, Jianguo Cao, James M. Rehg, Heng Ji, Ismini Lourentzou, Xu Cao
m0_603888713 天前
人工智能·ai·语言模型·自然语言处理·论文速览
VEQ Modality-Adaptive Quantization for MoE Vision-Language ModelsAuthors: Guangshuo Qin, Zhiteng Li, Zheng Chen, Weihang Zhang, Linghe Kong, Yulun Zhang
m0_603888714 天前
人工智能·深度学习·机器学习·ai·论文速览
Structured Over Scale Learning Spatial Reasoning from Educational VideoAuthors: Bishoy Galoaa, Xiangyu Bai, Sarah Ostadabbas
m0_603888716 天前
人工智能·深度学习·机器学习·ai·论文速览
FineInstructions Scaling Synthetic Instructions to Pre-Training ScaleAuthors: Ajay Patel, Colin Raffel, Chris Callison-Burch
m0_6038887119 天前
ai·论文速览
UR-Bench A Benchmark for Multi-Hop Reasoning over Ultra-High-Resolution ImagesAuthors: Siqi Li, Xinyu Cai, Jianbiao Mei, Nianchen Deng, Pinlong Cai, Licheng Wen, Yufan Shen, Xuemeng Yang, Botian Shi, Yong Liu
m0_6038887123 天前
ai·论文速览
EmbeddingRWKV State-Centric Retrieval with Reusable StatesAuthors: Haowen Hou, Jie YangDeep-Dive Summary: 以下是论文部分的中文总结:
m0_6038887124 天前
人工智能·算法·机器学习·ai·论文速览
More Images, More Problems A Controlled Analysis of VLM Failure ModesAuthors: Anurag Das, Adrian Bulat, Alberto Baldrati, Ioannis Maniadis Metaxas, Bernt Schiele, Georgios Tzimiropoulos, Brais Martinez
m0_6038887125 天前
人工智能·ai·语言模型·自然语言处理·论文速览
Over-Searching in Search-Augmented Large Language ModelsAuthors: Roy Xie, Deepak Gopinath, David Qiu, Dong Lin, Haitian Sun, Saloni Potdar, Bhuwan Dhingra
m0_603888711 个月前
ai·去中心化·区块链·论文速览
Decentralized Autoregressive GenerationAuthors: Stepan Maschan, Haoxuan Qu, Jun LiuDeep-Dive Summary: 以下是论文部分的中文摘要:
m0_603888711 个月前
人工智能·算法·ai·语言模型·论文速览
Scaling Trends for Multi-Hop Contextual Reasoning in Mid-Scale Language ModelsAuthors: Brady Steele, Micah KatzDeep-Dive Summary:
m0_603888711 个月前
ai·论文速览
TeleWorld Towards Dynamic Multimodal Synthesis with a 4D World ModelAuthors: Yabo Chen, Yuanzhi Liang, Jiepeng Wang, Tingxi Chen, Junfei Cheng, Zixiao Gu, Yuyang Huang, Zicheng Jiang, Wei Li, Tian Li, Weichen Li, Zuoxin Li, Guangce Liu, Jialun Liu, Junqi Liu, Haoyuan Wang, Qizhen Weng, Xuan’er Wu, Xunzhi Xiang, Xiaoyan Ya
m0_603888711 个月前
人工智能·ai·论文速览
RIMRULE Improving Tool-Using Language Agents via MDL-Guided Rule LearningAuthors: Xiang Gao, Yuguang Yao, Qi Zhang, Kaiwen Dong, Avinash Baidya, Ruocheng Guo, Hilaf Hasson, Kamalika Das
m0_603888711 个月前
ai·论文速览
VLN-MME Diagnosing MLLMs as Language-guided Visual Navigation agentsAuthors: Xunyi Zhao, Gengze Zhou, Qi WuDeep-Dive Summary: 以下是论文的中文摘要:
m0_603888715 个月前
ai·prompt·论文速览
Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt EnsemblesAuthors: Eric Slyman, Mehrab Tanjim, Kushal Kafle, Stefan Lee
m0_603888715 个月前
ai·原型模式·论文速览
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual GroundingAuthors: Jiangnan Xie, Xiaolong Zheng, Liang ZhengDeep-Dive Summary:
m0_603888715 个月前
人工智能·ai·语言模型·自然语言处理·论文速览
Delta Activations A Representation for Finetuned Large Language ModelsAuthors: Zhiqiu Xu, Amish Sethi, Mayur Naik, Ser-Nam Lim