Img-Diff: 多模态大型语言模型的对比数据合成ArxivGitHubHigh-performance Multimodal Large Language Models (MLLMs) rely heavily on data quality. This study introduces a novel dataset named Img-Diff, designed to enhance fine-grained image recognition in MLLMs by leveraging insights from contrastive le