关注与优化:用于骨龄评估的交互式关键点定位与颈椎定量分析|文献速递-深度学习人工智能医疗图像

Title

题目

Attend-and-Refine: Interactive keypoint estimation and quantitative cervical vertebrae analysis for bone age assessment

关注与优化:用于骨龄评估的交互式关键点定位与颈椎定量分析

01

文献速递介绍

在儿童和青少年时期,准确预测生长潜力在正畸诊断和治疗规划中具有重要价值(Singh 等,2015;Türkoz 等,2017)。例如,在生长高峰期前治疗下颌前突等问题可能导致复发,而在生长高峰期后处理颌骨发育不全则可能效果不佳。因此,准确识别生长高峰期对于确定儿童正畸问题的最佳治疗时期至关重要。众多研究表明,骨骼成熟度与颌骨、身高增长等身体各部位的发育潜力存在可靠关联(Brown 等,1970;Sato 等,2001;Tanner 等,2001;Flores-Mir 等,2004;Madhu 等,2004;Flores-Mir 等,2005;Carmichael 和 Sandor,2008)。传统上,使用手腕骨龄片评估骨龄是评价骨骼成熟度的标准方法(Greulich 和 Pyle,1959a;Bala 等,2010;Beit 等,2013;Safavi 等,2015;Bhadana 等,2019)。颈椎成熟度(CVM)方法在临床正畸实践中受到关注(Mito 等,2002;Uysal 等,2006;Beit 等,2013;Türkoz 等,2017)。因此,采用 CVM 进行骨龄评估可避免正畸主要患者群体------儿童和青少年接受额外的辐射暴露。研究表明,CVM 分期与下颌生长突增、身高变化等重要发育阶段存在相关性(Franchi 等,2000;Baccetti 等,2002;Hosni 等,2018),且与实际年龄存在关联(Mito 等,2002;Uysal 等,2006;Safavi 等,2015;Singh 等,2015)。这些研究强调 CVM 是颌骨生长发育的可靠指标。 尽管有这些见解,以往研究往往依赖对颈椎形态的定性评估,且常基于有限的数据集。为解决这一问题,本研究对颈椎形态进行全面的定量分析,以更准确地评估儿童和青少年青春期生长高峰期。通过分析大规模数据集,我们旨在评估颈椎的年生长速率、确定生长高峰期的时间,并描述相关的形态特征。我们的方法将生长高峰期的颈椎形态作为预测患者生长潜力的指标,提供详细的诊断指导,帮助临床医生确定正畸治疗的最佳时机。 这种定量分析的关键环节是对颈椎关键点的精确标注,手动标注时这一任务可能十分耗时。自动化方法虽旨在减少人工工作量,但往往准确性不足,仍需临床医生验证,因此与手动标注一样繁琐(Kim 等,2022)。为应对这些挑战,我们提出了"关注与优化网络(ARNet)"------一种新型交互式关键点定位框架,旨在通过有效的用户交互准确估计关键点。该方法将自动化预测与用户专业知识相结合,显著提高了标注过程的准确性和效率。 据我们所知,关于交互式关键点定位框架的研究较为有限。交互式图像分割是一个研究广泛的领域,它通过整合用户反馈来提高准确性,与交互式关键点定位存在概念上的相似性,可为开发交互式关键点定位模型提供宝贵基础。交互式分割模型,如 BRS(Jang 和 Kim,2019)、f-BRS(Sofiiuk 等,2020)和 RTIM(Sofiiuk 等,2022),在精度和效率方面取得了显著进步。 然而,将这些模型应用于我们提出的交互式关键点定位框架时存在明显局限性。用户交互信息通常仅影响用户修正点周围的局部区域,无法将修正传播到远处的关键点。在某些情况下,模型修正甚至会降低初始准确性(参见附录 E 的图 E.1)。 这些局限性源于交互式分割和关键点定位的独特挑战。例如,交互式分割聚焦于分割用户输入附近的相邻像素,而交互式关键点定位需要将修正传播到非连接的远处关键点。这凸显了针对交互式关键点定位采用专门方法的必要性。 ARNet 通过"交互引导重校准网络"和"形态感知损失函数"应对这些挑战:前者利用用户修改信息重校准图像特征,确保与用户反馈对齐;后者维持关键点间的一致性关系,确保当用户调整某个关键点时,相关关键点也会相应更新。我们先前的研究(Kim 等,2022)中提出的ARNet-v1,利用全局池化和挤压-激励层(Hu 等,2018),基于用户交互信号对骨干网络的图像特征进行通道级重校准,实现了用户反馈向远处关键点的传播。 但 ARNet-v1 在空间位置上均匀应用交互信号,缺乏像素级特异性。如图 2 所示,这往往会导致显著的残留误差,在复杂病例中尤为明显。为克服这些局限性,ARNet-v2引入了基于交叉注意力的机制(Chefer 等,2023),在像素级选择性整合用户交互信号。图像特征作为查询,用户交互信号作为键和值,实现全局感知的、像素特异性的相关信息检索。这种设计确保用户交互信号自适应地传播到所有关键点,实现精确修正。如图 2 所示,ARNet-v2 通过单次用户修正显著减少了所有错误关键点的误差,且初始预测误差显著降低,性能优于 ARNet-v1。 总之,ARNet-v2 利用基于交叉注意力的机制,实现对用户交互信号的全局感知和选择性检索,有效将修正传播到所有相关关键点。这种方法性能优越,显著提高了关键点定位的准确性和效率。大量实验表明,ARNet-v2 在四个数据集上均持续取得最先进的结果,性能优于 ARNet-v1 和基准模型。例如,在 AASCE 数据集上,与 ARNet-v1 相比,ARNet-v2 将失败率降低了 37%,与最接近的基准模型相比降低了 67%。此外,ARNet-v2 的点击次数比 ARNet-v1 减少了 25%,比最接近的基准模型减少了 43%。这些发现凸显了其在以最少用户输入实现准确关键点定位方面的有效性。 此外,我们的研究还开发了基于网络的人工智能辅助生长潜力评估工具,如图 1 所示。本研究为临床医生提供了实用且易于使用的诊断指南,用于确定正畸治疗的最佳时机,标志着该领域的一项重要进展。

Aastract

摘要

In pediatric orthodontics, accurate estimation of growth potential is essential for developing effective treatmentstrategies. Our research aims to predict this potential by identifying the growth peak and analyzing cervicalvertebra morphology solely through lateral cephalometric radiographs. We accomplish this by comprehensivelyanalyzing cervical vertebral maturation (CVM) features from these radiographs. This methodology providesclinicians with a reliable and efficient tool to determine the optimal timings for orthodontic interventions,ultimately enhancing patient outcomes. A crucial aspect of this approach is the meticulous annotation ofkeypoints on the cervical vertebrae, a task often challenged by its labor-intensive nature. To mitigate this,we introduce Attend-and-Refine Network (ARNet), a user-interactive, deep learning-based model designed tostreamline the annotation process. ARNet features Interaction-guided recalibration network, which adaptivelyrecalibrates image features in response to user feedback, coupled with a morphology-aware loss function thatpreserves the structural consistency of keypoints. This novel approach substantially reduces manual effort inkeypoint identification, thereby enhancing the efficiency and accuracy of the process. Extensively validatedacross various datasets, ARNet demonstrates remarkable performance and exhibits wide-ranging applicabilityin medical imaging. In conclusion, our research offers an effective AI-assisted diagnostic tool for assessinggrowth potential in pediatric orthodontics, marking a significant advancement in the field.

在儿童正畸领域,准确评估生长潜力对于制定有效的治疗方案至关重要。我们的研究旨在通过仅依靠头颅侧位片识别生长高峰期并分析颈椎形态来预测这种生长潜力。我们通过全面分析这些X线片中的颈椎成熟度(CVM)特征来实现这一目标。该方法为临床医生提供了一种可靠且高效的工具,用于确定正畸干预的最佳时机,最终改善患者预后。 这种方法的关键环节是对颈椎关键点进行细致标注,而这项任务往往因耗时费力而面临挑战。为缓解这一问题,我们提出了"关注与优化网络(ARNet)"------一种基于深度学习的用户交互式模型,旨在简化标注流程。ARNet包含"交互引导重校准网络",该网络能根据用户反馈自适应地重校准图像特征;同时结合"形态感知损失函数",以保持关键点的结构一致性。这种创新方法大幅减少了关键点识别中的人工工作量,从而提高了该过程的效率和准确性。 经过多个数据集的广泛验证,ARNet展现出优异的性能,并在医学影像领域具有广泛的应用前景。总之,我们的研究为儿童正畸生长潜力评估提供了一种有效的人工智能辅助诊断工具,标志着该领域的一项重要进展。

Conclusion

结论

The ability to accurately estimate bone age and predict the remaining growth potential during the crucial developmental stages ofchildhood and adolescence is of immense importance in the fieldof orthodontics, particularly for diagnosis and treatment planning.Our study contributes significantly to this area by enabling precisepredictions of a patient's pubertal growth peak based on their current cervical vertebra morphology. By analyzing cervical vertebralmaturation (CVM) feature values, we can determine the morphological characteristics at the growth peak, which are key predictors ofgrowth potential. This approach provides invaluable diagnostic guidance and assists clinicians in identifying the optimal timing for initiating dentofacial orthopedic treatments. Furthermore, we propose anovel interactive keypoint estimation network, referred to as ARNet.This network integrates advanced components, such as the Interactionguided recalibration network mechanism and morphology-aware lossfunction, to facilitate accurate keypoint revision with minimal manualintervention. Our comprehensive experiments and analyses, conductedacross four medical datasets, have demonstrated the efficacy of thisapproach. The ability to simultaneously correct numerous keypointsaccurately with minimal user input underscores the practical utilityand advanced capabilities of our model in streamlining the keypointannotation process.

在儿童和青少年关键发育阶段,准确评估骨龄并预测剩余生长潜力在正畸领域具有极其重要的意义,尤其对诊断和治疗计划制定而言。本研究通过基于患者当前颈椎形态精准预测青春期生长高峰期,为该领域做出了重要贡献。通过分析颈椎成熟度(CVM)特征值,我们能够确定生长高峰期的形态学特征,而这些特征正是生长潜力的关键预测指标。这种方法提供了极具价值的诊断指导,帮助临床医生确定牙颌面正畸治疗的最佳启动时机。此外,我们提出了一种新型交互式关键点定位网络,称为ARNet。该网络整合了交互引导重校准网络机制、形态感知损失函数等先进组件,以最少的人工干预实现关键点的精准修正。我们在四个医学数据集上开展的全面实验与分析,验证了该方法的有效性。仅需极少的用户输入即可同时精准修正多个关键点,这凸显了我们的模型在简化关键点标注流程方面的实际应用价值和先进性能。

Figure

Fig. 1. Growth potential assessment via interactive keypoint estimation. Our method significantly enhances the accuracy and efficiency of growth potential assessment in pediatricorthodontics. The process begins with the initial keypoint estimation phase, where the objective is to identify (a) the keypoints on the second, third, and fourth cervical vertebrae.However, the model occasionally fails to detect these keypoints accurately, (b) missing the second vertebrae at the top in this example. Conventionally, rectifying such an errorwould necessitate manually revising all 13 keypoints, akin to starting the annotation from scratch. However, our method introduces a substantial improvement: a user needs tocorrect only (c) one keypoint, which automatically prompts (d) the adjustment of the remaining keypoints. Consequently, this leads to (e) a final output that accurately reflects theuser's interaction as well as correctly detects the target keypoints, thereby demonstrating the accuracy and the efficiency of our approach. The keypoints identified through thisprocess are then utilized to analyze the cervical vertebrae morphology, quantified as cervical vertebral maturation (CVM) features. These features enable us to assess the patient'sgrowth potential using standard growth curves

图1 通过交互式关键点定位评估生长潜力。我们的方法显著提高了儿童正畸中生长潜力评估的准确性和效率。该流程始于初始关键点定位阶段,目标是识别(a)第2、第3和第4颈椎上的关键点。然而,模型偶尔会出现关键点定位不准确的情况,例如在本案例中(b)漏检了顶部的第2颈椎。传统上,修正此类错误需要手动修改全部13个关键点,相当于从头开始标注。但我们的方法实现了重大改进:用户只需修正(c)1个关键点,系统就会自动触发(d)其余关键点的调整。最终,这将生成(e)准确反映用户交互且正确检测目标关键点的输出结果,充分体现了我们方法的准确性和高效性。通过该过程识别的关键点将用于分析颈椎形态,并量化为颈椎成熟度(CVM)特征。这些特征使我们能够利用标准生长曲线评估患者的生长潜力。

Fig. 2. Prediction results of ARNet-v1 and ARNet-v2 on the public datasets. Modelrevision* represents the updated outputs from each model after incorporating a singleuser correction into the initial predictions. The mean radial error (MRE) for eachprediction is denoted.

图2 ARNet-v1和ARNet-v2在公开数据集上的预测结果。"模型修正"指每个模型在初始预测中纳入单次用户修正后得到的更新输出。图中标注了各预测结果的平均径向误差(MRE)。

Fig. 3. Interaction-guided recalibration network. (a) ARNet-v1. (b) ARNet-v2. (c) Details of the cross-attention layer in ARNet-v2. User interaction information is used as value(V) and key (K), and image features are used as query (Q). Notations are summarized in Table A.1 in Appendix A.

图3 交互引导重校准网络。(a)ARNet-v1;(b)ARNet-v2;(c)ARNet-v2中交叉注意力层的细节。用户交互信息被用作值(V)和键(K),图像特征被用作查询(Q)。符号说明详见附录A的表A.1。

Fig. 4. Morphology-aware loss. A set of predefined keypoints, say, 𝑝𝑛 , 𝑝𝑚 , 𝑝𝑙 , is used toregularize the model to preserve the consistent inter-keypoint relationships, focusingon (a) distance and (b) angle between keypoints.

图4 形态感知损失。一组预定义关键点(如𝑝ₙ、𝑝ₘ、𝑝ₗ)用于正则化模型,以保持关键点间的一致性关系,重点关注(a)关键点之间的距离和(b)关键点之间的角度。

Fig. 5. Attend-and-Refine Network (ARNet). Our model is designed to process radiographs alongside user interactions (User Inter.) and its previous predictions (Prev. Pred.). Themodel generates a heatmap of keypoint locations, which is dynamically adjusted to reflect user feedback

图5 关注与优化网络(ARNet)。我们的模型设计用于处理X线片,同时结合用户交互(User Inter.)及其先前预测结果(Prev. Pred.)。该模型生成关键点位置的热力图,并根据用户反馈动态调整该热力图。

Fig. 6. Keypoints on the cervical vertebrae and CVM features. (a) Lateral cephalometric radiograph. (b) Thirteen keypoints on cervical vertebrae, upper posterior (UP), upperanterior (UA), lower posterior (LP), lower middle (LM), and lower anterior (LA). (c) Up to five vertex points per vertebra. (d--f) Measurements for CVM estimation: concavity/widthratio (c/w ratio), length/width ratio (l/w ratio), and height/width ratio (h/w ratio).

图6 颈椎关键点与颈椎成熟度(CVM)特征。(a)头颅侧位片;(b)颈椎上的13个关键点,包括上后点(UP)、上前点(UA)、下后点(LP)、下中点(LM)和下前点(LA);(c)每个颈椎最多5个顶点;(d--f)用于CVM评估的测量指标:凹度/宽度比(c/w比)、长度/宽度比(l/w比)和高度/宽度比(h/w比)。

Fig. 7. Analysis of pairwise relationships between chronological age and CVM features: Concavity/width (c/w) ratio of C2, C3, and C4; length/width (l/w) ratio of C3 and C4;and height/width (h/w) ratio of C3 and C4. This analysis focuses on patients aged 6--18 years to examine the relationships during the growth phase. (a) Pairwise scatter plotsvisualize relationships between variables in the off-diagonal elements, while diagonal elements show individual variable distributions. (b) Spearman correlation matrices quantifythe monotonic correlation between variables

图7 实际年龄与颈椎成熟度(CVM)特征间的成对关系分析:C2、C3、C4的凹度/宽度(c/w)比;C3、C4的长度/宽度(l/w)比;以及C3、C4的高度/宽度(h/w)比。本分析聚焦6--18岁患者,旨在探究生长阶段内的变量关系。(a)成对散点图中,对角线外元素可视化变量间关系,对角线元素展示各变量的分布情况;(b)斯皮尔曼相关矩阵量化变量间的单调相关性。

Fig. 8. Standard growth curves for C4 height/width ratio. Each box plot illustrates the interquartile range, marked by the first and third quartiles, with the median value indicatedby a horizontal line inside the box. The whiskers extending from each box denote the minimum and maximum values

图8 C4高度/宽度比的标准生长曲线。每个箱线图展示四分位距(由第一四分位数和第三四分位数标记),箱内水平线表示中位数。从箱线延伸出的须线代表最小值和最大值。

Fig. 9. Comparison with state-of-the-art models on the AASCE dataset. (a) Keypoint prediction errors for increasing number of user interactions in comparison with existingbaseline models. (b) Number of user interactions for increasing target MRE in comparison with existing baseline models.

图9 在AASCE数据集上与最先进模型的对比结果。(a)随着用户交互次数增加,与现有基准模型相比的关键点预测误差变化;(b)随着目标平均径向误差(MRE)提高,与现有基准模型相比的用户交互次数变化。

Fig. 10. Error refinement results of ARNet-v2 after a single user correction. Prediction errors are visualized by lines connecting each predicted keypoint to its correspondinggroundtruth location. The length of these lines indicates the magnitude of the error, with shorter lines representing lower errors.

图10 ARNet-v2经单次用户修正后的误差优化结果。预测误差通过连接每个预测关键点与其对应真实位置的线条可视化呈现。线条长度表示误差大小,线条越短则误差越小。

Fig. 11. Annual growth rates based on standard growth curves. The results highlight the growth peak for each cervical vertebra. The complete results are available in Figs. F.1and F.2 in Appendix F. The left 𝑦-axis indicates the CVM feature value, while the right 𝑦-axis represents the annual growth rate.

图11 基于标准生长曲线的年生长速率。结果突出显示了每个颈椎的生长高峰期。完整结果详见附录F的图F.1和图F.2。左侧𝑦轴表示颈椎成熟度(CVM)特征值,右侧𝑦轴表示年生长速率。

Fig. 12. Correlation between SMI and the C3 and C4 length/width ratios.

图12 骨成熟度指数(SMI)与C3、C4长度/宽度比的相关性。

Fig. 13. Individual growth rates alongside the median growth rate. For each sex, thegrowth rate of each patient is depicted in the same color for C3 and C4.

图13 个体生长速率与中位生长速率对比。对于每种性别,每位患者的C3和C4生长速率用相同颜色标注。

Fig. D.1. A screenshot of our AI-assisted growth potential estimation tool, showcasing its interface and functionality. This tool enables clinicians to (a) upload lateral cephalometricradiographs, (b) perform annotations, and (c) obtain growth potential estimations. This highlights its utility in facilitating orthodontic and orthopedic planning with enhancedprecision and ease of use

图D.1 我们的人工智能辅助生长潜力评估工具截图,展示了其界面和功能。该工具支持临床医生(a)上传头颅侧位片、(b)进行标注,并(c)获取生长潜力评估结果。这凸显了其在提升精度和易用性的同时,为正畸与骨科治疗计划制定提供的实用价值。

Fig. E.1. Qualitative comparison of ARNet-v1 and the baseline models on the AASCE dataset. Initial prediction errors and errors after a single user modification are comparedfor each model. Prediction errors are visualized as lines connecting each predicted keypoint to its corresponding ground truth location. The length of these lines represents themagnitude of the prediction error: shorter lines indicate lower errors, while longer lines reflect greater errors.

图E.1 ARNet-v1与基准模型在AASCE数据集上的定性对比。对比了每个模型的初始预测误差以及经单次用户修正后的误差。预测误差通过连接每个预测关键点与其对应真实位置的线条可视化呈现,线条长度代表预测误差的大小:线条越短表示误差越小,线条越长则表示误差越大。

Fig. F.1. Comprehensive annual growth rates based on standard growth curves of concavity/width ratio, demonstrating the growth peak for each cervical vertebra. The left 𝑦-axisindicates the CVM feature value, while the right 𝑦-axis represents the annual growth rate

图F.1 基于凹度/宽度比标准生长曲线的综合年生长速率图,展示了每个颈椎的生长高峰期。左侧𝑦轴表示颈椎成熟度(CVM)特征值,右侧𝑦轴表示年生长速率。

Fig. F.2. Comprehensive annual growth rates based on standard growth curves of length/width and height/width ratios, demonstrating the growth peak for each cervical vertebra.The left 𝑦-axis indicates the CVM feature value, while the right 𝑦-axis represents the annual growth rate

图F.2 基于长度/宽度比和高度/宽度比标准生长曲线的综合年生长速率图,展示了每个颈椎的生长高峰期。左侧𝑦轴表示颈椎成熟度(CVM)特征值,右侧𝑦轴表示年生长速率。

Table

Table 1Baseline characteristics of the enrolled patients used in the growth peak analysis study.stdevindicates standard deviation

表1 用于生长高峰期分析研究的入组患者基线特征。 stdev 表示标准差。

Table 2Performance comparison of interactive keypoint estimation on the AASCE, BUU-AP, and BUU-LA datasets

表2 AASCE、BUU-AP和BUU-LA数据集上交互式关键点定位的性能对比。

Table 3Performance comparison of interactive keypoint estimation on the Lateral cephalometricradiograph dataset

表3 头颅侧位片数据集上交互式关键点定位的性能对比。

Table 4Performance comparison of mean radial error across four datasets. We compare manual revision (Manual) and model revision (Model). For Dai et al., model revision results arenot provided, as no interactive module was proposed.

表4 四个数据集上平均径向误差的性能对比。我们对比了手动修正(Manual)和模型修正(Model)的结果。对于Dai等人的方法,未提供模型修正结果,因其未提出交互式模块。

Table 5Ablation study using ARNet-v2 on the AASCE and BUU-AP datasets.

表5 在AASCE和BUU-AP数据集上使用ARNet-v2进行的消融实验结果。

Table 6Sensitivity analysis of Interaction-guided recalibration network and Morphology-aware loss using ARNet-v2 on the AASCE and BUU-AP datasets. We apply different criteria for𝑑 and 𝑎 . For low 𝑡𝑑 , 𝑡𝑎 , morphology-aware loss targets keypoint sets with the lowest variance, while high 𝑡𝑑 , 𝑡𝑎 targets those with the highest variance. The adjacency criterionfocuses on keypoint sets forming an edge or internal angle of each vertebra

表 6 在 AASCE 和 BUU-AP 数据集上,使用 ARNet-v2 对交互引导重校准网络和形态感知损失函数进行的敏感性分析结果。我们对𝑑和𝑎采用了不同的评判标准:对于低阈值(low 𝑡𝑑 , 𝑡𝑎),形态感知损失函数针对方差最小的关键点集进行优化;而对于高阈值(high 𝑡𝑑 , 𝑡𝑎),则针对方差最大的关键点集进行优化。此外,邻接性(adjacency)标准聚焦于构成每个椎体边缘或内角的关键点集。

Table 7Sensitivity analysis of the keypoint subset size for morphology-aware loss on the AASCE dataset using ARNet-v1

表 7 在 AASCE 数据集上,使用 ARNet-v1 对形态感知损失函数的关键点子集大小进行的敏感性分析结果。

Table 8Cervical vertebrae morphology at growth peak. It outlines detailed ratios indicating peak growth stages.

表8 生长高峰期的颈椎形态特征。该表列出了指示生长高峰期的详细比例指标。

Table 9Comparison of failure rates of keypoint estimation models in impacting the growth peakanalysis results. Failure rates are measured for the target MRE thresholds ranging from1 pixel to 2 pixels, based on initial keypoint estimation on the Lateral cephalometricradiograph dataset

表9 关键点定位模型的失败率对生长高峰期分析结果的影响。基于头颅侧位片数据集上的初始关键点定位结果,在目标平均径向误差(MRE)阈值为1像素至2像素的范围内测量失败率。

Table A.1Summary of notations.

表 A.1 符号汇总表

相关推荐
晚霞apple28 分钟前
Graph + Agents 融合架构:2025年七大创新路径
论文阅读·人工智能·深度学习·神经网络·机器学习
浣熊-论文指导1 小时前
人工智能与生物医药融合六大创新思路
论文阅读·人工智能·深度学习·计算机网络·机器学习
晚霞apple3 小时前
三维重建技术的未来创新方向
论文阅读·人工智能·深度学习·神经网络·机器学习
大象耶6 小时前
自然语言处理前沿创新方向与技术路径
论文阅读·人工智能·深度学习·计算机网络·机器学习
何如千泷20 小时前
【论文阅读】Qwen2.5-VL Technical Report
论文阅读·大模型·多模态·1024程序员节
大象耶21 小时前
计算机视觉六大前沿创新方向
论文阅读·人工智能·深度学习·计算机网络·机器学习
墨绿色的摆渡人1 天前
论文笔记(九十六)VGGT: Visual Geometry Grounded Transformer
论文阅读
DuHz2 天前
基于MIMO FMCW雷达的二维角度分析多径抑制技术——论文阅读
论文阅读·物联网·算法·信息与通信·毫米波雷达
CV-杨帆2 天前
论文阅读:ICML 2025 Adversarial Reasoning at Jailbreaking Time
论文阅读
_AaRong_2 天前
《Hiding Images in Diffusion Models by Editing Learned Score Functions》 论文阅读
论文阅读·人工智能·计算机视觉