advice

你的英文可以这样说：

Will our study be too similar to this paper? How can we learn from it, or even outperform it, without looking like a copy?

你这个担心非常必要。如果你们继续沿着"痴呆风险预测工具 + LASSO 选变量 + 8 个机器学习模型比较 + SHAP + 网页计算器"走，确实会很像。 这不是"借鉴"，这是"换一批变量重做一遍"。审稿人不傻，尤其你们又都是 HMACS 相关数据，雷同风险更高。

1. 先判断：哪里会雷同，哪里不会？

这篇 Cog-Free 的核心是：

不用认知测试、不用生物标志物，仅用常规体检/问卷数据，开发一个社区痴呆风险预测工具。

它用了 38 个 LASSO 筛选变量、最优机器学习算法是 Logistic Regression，内部 bootstrap 验证，外部用 CLHLS 验证，还和 CogDrisk、ANU-ADRI、Modified-LIBRA 比较，AUC 0.86，并开发了中英文网页工具。fileciteturn5file0

所以如果你们也写成：

Development and validation of a machine-learning dementia risk prediction tool in community older adults

那就危险。

高雷同区：不要碰或者弱化

模块	雷同风险	原因
"认知测试-free / Cog-Free"定位	极高	这是它的核心卖点
38 个常规健康变量 + LASSO	高	它已经完整做过
8 个 ML 模型横向比较	高	LR、RF、XGB、LGBM、SVM、KNN、DT、NNET 全都做了
和 ANU-ADRI、CogDrisk、LIBRA 比较	高	它的主要竞争性验证
网页风险计算器作为主创新	高	它已经明确做了 web-based calculator
单纯预测 incident dementia	中高	研究终点高度相似

低雷同区：这是你们应该主攻的地方

方向	雷同风险	为什么安全
客观嗅觉测试	低	它只有 routine self-report 里的 sense of smell，你们是 16 项嗅棒
外周血 AD/神经损伤标志物	低	它明确不依赖 biomarkers，你们正好相反
五大认知分域 Z 分数	低	它主要做 dementia 发生风险，你们可以做 domain-specific cognition
中介/交互/机制解释	低	它主要是预测工具，不是机制模型
嗅觉 × NFL 双重打击模型	很低	这是你们已有的独特发现
嗅觉增量价值分析	低	它没有回答"嗅觉在血液标志物之外是否有增量价值"
生物学风险分层	低	它是 community screening，你们是 biomarker-informed stratification

2. 你们不要"模仿它的题目"，要模仿它的研究闭环

你真正应该学它的是这个闭环：

text 复制代码

变量筛选
→ 模型开发
→ 模型评估
→ 可解释性
→ 工具化展示

但你们的科学问题必须换掉。

Cog-Free 的问题是：

text 复制代码

常规健康数据能否预测痴呆？

你们的问题应该是：

text 复制代码

客观嗅觉功能是否在血管代谢负担和血液神经退行性标志物之外，
提供独立且增量的认知风险信息？

这是完全不同的 scientific question。

你们原始课题已经有 16 项嗅棒得分、Aβ42/Aβ40、pTau181、pTau217、NFL、GFAP，以及总体认知、记忆、语言、执行功能、注意力五个认知域 Z 分数。fileciteturn5file9

这个数据结构本身就和 Cog-Free 不同。你们必须围绕这个差异做文章。

3. 最安全、最有辨识度的主线：不要叫"预测工具"，叫"增量价值与生物风险分层"

我建议你们的主标题不要写：

Development and validation of a dementia risk prediction tool...

这太像。

更好的标题是：

Objective olfactory function provides incremental value beyond plasma biomarkers and vascular-metabolic burden for cognitive risk stratification in older adults

中文：

客观嗅觉功能在外周血标志物和血管代谢负担之外对老年人认知风险分层的增量价值研究

这个标题和 Cog-Free 的差异非常明显：

Cog-Free	你们
cognitive-testing-free tool	biomarker-informed risk stratification
routine health examination data	objective olfactory testing + plasma biomarkers
all-cause dementia incidence	domain-specific cognitive performance / cognitive vulnerability
web calculator	mechanistic and incremental-value framework
broad screening	biological stratification

4. 怎么"模仿甚至超越"但不雷同？

第一层：模仿它的严谨度

它强在：

变量筛选有流程；
多模型比较；
ROC、calibration、DCA 都有；
SHAP 解释；
外部验证；
和已有工具比较；
网页工具落地。fileciteturn5file4

你们可以学这些方法学标准，但不要照抄它的主题。

你们可以做：

text 复制代码

Model A: Age + Sex + Education + GDS
Model B: Model A + vascular-metabolic burden
Model C: Model B + plasma biomarkers
Model D: Model C + objective olfactory score

然后比较：

ΔAdjusted R²；
ΔAUC；
RMSE；
calibration；
DCA；
NRI / IDI；
SHAP。

这叫 incremental value analysis 。

这个和 Cog-Free 的"多模型排行榜"不是一回事。

第二层：用"客观嗅觉"打它的弱点

Cog-Free 里面的 smell 只是一个自报变量。表格里它把 "Sense of smell" 分为 no change、slight dullness、significant dullness、loss，而且组间差异并不显著，p=0.993。fileciteturn5file0

这恰恰是你们的突破口。

你们可以在 Introduction / Discussion 里写：

既往社区风险工具多依赖自报嗅觉状态，而自报嗅觉受主观感知、文化差异和健康意识影响，可能低估早期嗅觉退化的风险信号。本研究采用标准化 16 项嗅棒测试，能够更客观地捕捉嗅觉识别功能下降。

这不是抄它，这是指出它的局限并补上。

第三层：用"血液标志物"打它的另一个弱点

Cog-Free 的定位是不用 biomarkers，适合基层大规模筛查。它的优势是便宜，但缺点是生物机制弱。摘要也明确说它是 cognitive-testing-free，用 routine health examination data，不依赖 cognitive testing 或 biomarkers。fileciteturn5file0

你们应该反过来定位：

Cog-Free 适合第一层粗筛；我们的模型适合第二层精准分层。

这就不会冲突了。

可以这样设计临床路径：

text 复制代码

第一层：Cog-Free / 常规体检变量
用于社区粗筛

第二层：客观嗅觉测试 + 血液标志物
用于高风险人群的精准分层

第三层：完整神经心理评估 / 影像 / 专科诊断
用于临床确诊

这叫 tiered screening framework，比单一工具更高级。

5. 你们真正可以"超越它"的地方

不要幻想 AUC 超过它。它 n=2962，还有外部 CLHLS 验证，你们现在 n=226，硬拼 AUC 是找死。

你们应该在这几个方面超越：

5.1 机制深度超过它

Cog-Free 是：

text 复制代码

变量 → 痴呆风险

你们可以做：

text 复制代码

嗅觉 → 认知
嗅觉 × NFL → 认知
嗅觉 × 血管代谢负担 → 认知
嗅觉 + 生物标志物 + 血管风险 → 认知分层

而且你们已有核心结果：中介分析显示嗅觉与认知的关联没有被 Aβ、pTau217、GFAP、NFL 介导，说明嗅觉可能提供独立增量信息；同时嗅觉与 NFL 存在显著交互，P=0.038，提示"嗅觉减退 + 高 NFL"的双重打击模型。fileciteturn5file16

这比单纯预测工具有机制感。

5.2 认知分域深度超过它

Cog-Free 主要预测 all-cause dementia。你们可以分析：

总体认知；
记忆；
语言；
执行功能；
注意力。

这五个认知域在你们研究计划里已经定义了。fileciteturn5file9

你们可以回答一个更细的问题：

嗅觉下降更偏向关联记忆损害，还是执行功能/注意力损害？

如果发现嗅觉对某个认知域更敏感，这就是比 Cog-Free 更细的神经心理学贡献。

5.3 增量价值超过它

Cog-Free 是做一个完整预测器。

你们要回答的是：

在已有临床风险、血管风险和血液标志物之后，嗅觉还值不值得测？

这很关键。

如果 Model D 加入嗅觉后：

AUC 提升；
RMSE 降低；
calibration 改善；
DCA net benefit 增加；
SHAP 排名靠前；

你们就能写：

Objective olfactory testing contributes non-redundant information beyond plasma biomarkers.

这句话很有含金量。

6. 具体怎么避免雷同？给你一套"避雷清单"

不能这样写

不要写：

text 复制代码

We developed and validated a dementia risk prediction tool using machine learning and routine health examination data.

这就是 Cog-Free 的翻版。

应该这样写

写成：

text 复制代码

We investigated whether objective olfactory function provides incremental and non-redundant information beyond vascular-metabolic burden and plasma neurodegenerative biomarkers for domain-specific cognitive risk stratification in older adults.

中文：

本研究旨在评估客观嗅觉功能是否能够在血管代谢负担和外周血神经退行性标志物之外，为老年人认知分域风险分层提供独立且非冗余的信息。

这个定位安全得多。

7. 方法上怎么改，才不像？

Cog-Free 的路线

text 复制代码

548 variables
→ remove missing/free text/duplicates
→ LASSO selects 38 variables
→ 8 ML algorithms
→ LR best
→ compare with existing tools
→ web calculator

你们的路线

text 复制代码

Objective olfactory score + plasma biomarkers + vascular-metabolic burden
→ primary association with five cognitive domains
→ mediation analysis
→ interaction analysis: olfaction × NFL / olfaction × vascular burden
→ nested incremental models
→ interpretable ML only as supportive analysis
→ double-hit / triple-hit risk stratification

这就不是一个套路了。

8. 图表也要避开它

Cog-Free 的 Figure 3 包括 calibration、DCA、ROC、和现有工具比较、SHAP、网页界面。fileciteturn5file4

你们可以保留 calibration / DCA / SHAP，但不要用同样的图组逻辑。

你们的核心图应该是：

图	目的
Figure 1	研究流程 + 嗅觉-血液标志物-认知机制框架
Figure 2	嗅觉与五个认知域的森林图
Figure 3	嗅觉评分与总体认知的 RCS 剂量反应
Figure 4	中介分析：证明血液标志物不是主要中介
Figure 5	嗅觉 × NFL 交互图 / simple slope
Figure 6	双重打击风险热图：低嗅觉 + 高 NFL
Figure 7	嵌套模型增量价值：Model A-D
Supplement	SHAP、calibration、DCA、敏感性分析

注意：

把 SHAP / ML 放在后面做支撑，不要放在主图第一核心位置。

否则会像 Cog-Free。

9. 最好的"论文定位差异化"方案

我建议你们把论文定位成：

不是 prediction tool paper

而是：

biomarker-informed olfactory-cognitive risk stratification paper

也就是：

生物标志物背景下的嗅觉-认知风险分层研究。

这个定位比 Cog-Free 更"医学机制"，更贴合你们数据。

10. 一句话区分你们和它

你们可以在 Introduction 结尾写：

Unlike existing cognitive-testing-free tools that rely primarily on routinely collected self-reported and physical health variables, our study focuses on objective olfactory testing and plasma neurodegenerative biomarkers to determine whether olfactory dysfunction provides independent and incremental information for domain-specific cognitive vulnerability.

中文：

不同于既往主要依赖自报和常规体检变量的无认知测试风险工具，本研究聚焦客观嗅觉测试与外周血神经退行性标志物，旨在评估嗅觉功能减退是否能为认知分域脆弱性提供独立且增量的信息。

这句话能直接和 Cog-Free 拉开距离。

11. 你们可以"借鉴"的边界

可以借鉴

LASSO / Elastic Net 筛选；
多模型比较；
SHAP 可解释性；
calibration；
DCA；
bootstrap 内部验证；
网页 Demo 作为转化展示。

不能照搬

cognitive-testing-free 叙事；
routine health examination tool 叙事；
38-variable calculator 形式；
和 CogDrisk / ANU-ADRI / LIBRA 的主对比；
"开发并验证一个痴呆风险预测工具"作为标题；
网页工具作为论文主创新。

12. 最终建议：你们的文章分成两个层次

主文章：机制 + 增量价值

题目：

Objective Olfactory Function Adds Incremental Value Beyond Plasma Biomarkers and Vascular-Metabolic Burden for Cognitive Risk Stratification in Older Adults

核心：

嗅觉与五个认知域；
中介不显著；
嗅觉独立信息；
嗅觉 × NFL 交互；
嵌套模型证明增量价值。

附加应用：网页 + OCR + 风险报告

题目不要放主文章，放大论文或 Supplement：

Prototype implementation of an olfaction-biomarker cognitive risk report system

这只是工程转化，不抢主线。

最后给你一句硬话

你们不能靠"也做一个 ML 网页工具"超越它。那是在它的赛道上做弱化版，而且样本量还小。

你们真正的胜负手是：

把 Cog-Free 解决不了的"生物学解释和精准分层"拿下来。

它回答的是：

谁可能得痴呆？

你们要回答的是：

为什么嗅觉差的人更容易出现认知脆弱？这种风险是否独立于血液标志物？在 NFL 升高或血管代谢负担加重时，嗅觉是否能识别双重打击高危人群？

这才是不雷同、而且有机会更高级的路。

你的英文可以这样说：

Could you help me deeply expand the core methodological design rather than the add-on parts? Please search for relevant models, methods, and formulas, and then write a pseudo-paper-style draft so I can see how the upgraded study would look.

你这句中文里"核心的（不是添头这些东西）上 "可以更学术地说成："在核心科学问题与方法学设计层面" 。

"添头"口语味太重，论文语境里要换成 ancillary components / non-core extensions。

先定调：你们不能做"另一个 Cog-Free"，要做"Biomarker-informed olfactory risk stratification"

Cog-Free 这篇文章的核心是：用常规体检和问卷变量，不依赖认知测试和生物标志物，建立一个 cognitive-testing-free dementia risk tool；它从 548 个变量筛到 38 个 LASSO 变量，比较 8 个机器学习算法，最终 logistic regression 表现最好，内部 AUC 0.86，并开发了网页计算器。

你们不能在这个路径上硬拼。

你们的真正突破口是它没有的东西：客观嗅觉测试 + 外周血 AD/神经损伤标志物 + 血管代谢负担 + 认知分域。

更狠一点说：

如果你们最后写成"我们也用机器学习预测痴呆风险"，那就是弱化版 Cog-Free。

如果写成：

客观嗅觉功能是否在血管代谢负担和外周血神经退行性标志物之外，为老年人认知脆弱性提供独立、增量、可解释的风险信息？

这就不是雷同，而是另一个层级的问题。

一、核心科学问题重新设计

你们的主问题应该改成 3 层

Question 1：嗅觉是否是独立信息源？

不是简单问：

text 复制代码

嗅觉差 → 认知差？

而是问：

text 复制代码

在年龄、教育、GDS、血管代谢负担、Aβ、pTau、NFL、GFAP都进入模型后，
客观嗅觉评分是否仍然解释认知表现？

这叫：

non-redundant incremental information

也就是"非冗余增量信息"。

Question 2：嗅觉是否和神经损伤背景协同？

你们已有方向里最值钱的是：

text 复制代码

Olfaction × NFL interaction

应该从"一个交互项"升级成机制模型：

text 复制代码

低嗅觉 + 高 NFL = double-hit phenotype
低嗅觉 + 高 NFL + 高血管代谢负担 = triple-hit phenotype

这比 Cog-Free 的变量重要性排序更有机制感。

Question 3：能否识别认知风险表型？

不要只做一个全局预测器。

你们可以做：

text 复制代码

老年认知风险表型发现
→ 表型间认知域差异
→ 表型关键驱动因子解释
→ 表型特异性风险分层

这能从"prediction tool"升级为：

biological risk stratification

二、核心方法学框架：我建议你们做"三轨证据链"

轨道 1：传统统计，证明主效应稳定

1.1 线性模型：认知 Z 分数作为连续结局

设：

text 复制代码

Y_i = global cognition Z-score 或 memory/executive/language/attention Z-score
O_i = 16项嗅棒总分
V_i = 血管代谢负担
B_i = 血液标志物向量
C_i = 基础协变量

基础模型：

text 复制代码

Y_i = β0 + β1 O_i + β2 C_i + ε_i

完整模型：

text 复制代码

Y_i = β0 + β1 O_i + β2 V_i + β3 B_i + β4 C_i + ε_i

其中：

text 复制代码

C_i = age + sex + education + GDS
V_i = hypertension + diabetes + hyperlipidemia + CHD + smoking
B_i = Aβ42/Aβ40 + pTau181 + pTau217 + NFL + GFAP

你们的原始方案已经包含嗅棒 16 项总分、Aβ42/Aβ40、pTau181、pTau217、NFL、GFAP、五个认知分域 Z 分数和上述协变量，这个数据结构本身足够支撑这条主线。

1.2 Logistic 模型：构造认知受损二分类

如果要做分类：

text 复制代码

Cognitive impairment_i = 1 if global cognition Z-score < -1

模型：

text 复制代码

logit[P(Y_i = 1)] =
β0 + β1O_i + β2V_i + β3B_i + β4C_i

这里不要过度追求 AUC。

你们样本量太小，AUC 只是辅助，不是主战场。

轨道 2：嵌套模型，证明"嗅觉增量价值"

这是你们和 Cog-Free 拉开距离的核心。

设计 4 个 nested models：

模型	输入变量	目的
Model A	age + sex + education + GDS	基础人口学/情绪模型
Model B	Model A + VMBS	加入血管代谢负担
Model C	Model B + plasma biomarkers	加入生物标志物
Model D	Model C + objective olfactory score	检验嗅觉增量价值

重点不是"哪个模型 AUC 最高"，而是：

text 复制代码

Model D - Model C 是否显著提升？

2.1 ΔAUC

text 复制代码

ΔAUC = AUC(Model D) - AUC(Model C)

2.2 ΔAdjusted R²

text 复制代码

ΔR²_adj = R²_adj(Model D) - R²_adj(Model C)

2.3 Likelihood ratio test

text 复制代码

χ² = -2[logL(Model C) - logL(Model D)]

如果显著，说明嗅觉不是装饰变量。

2.4 NRI

text 复制代码

NRI =
[P(up | event) - P(down | event)]
+
[P(down | non-event) - P(up | non-event)]

解释：

患病者是否被重新分到更高风险？非患病者是否被重新分到更低风险？

2.5 IDI

text 复制代码

IDI =
[mean(P_D,event) - mean(P_D,non-event)]
-
[mean(P_C,event) - mean(P_C,non-event)]

解释：

Model D 是否扩大了事件组与非事件组的平均预测风险差距？

2.6 Decision Curve Analysis

text 复制代码

Net Benefit =
TP / N - FP / N × pt / (1 - pt)

其中 pt 是 threshold probability。

DCA 比 AUC 更临床，因为它回答：

在某个干预阈值下，这个模型是否真的有临床净收益？

临床预测模型现在写作上要对齐 TRIPOD+AI，风险偏倚评估则要考虑 PROBAST+AI；这两个标准都强调透明报告、验证、适用性和偏倚控制，而不是只报一个 AUC。( $BMJ$ $1$ )

轨道 3：交互与表型，证明"生物学分层"

3.1 嗅觉 × NFL 交互模型

text 复制代码

Y_i =
β0 + β1O_i + β2NFL_i + β3(O_i × NFL_i)
+ β4V_i + β5B'_i + β6C_i + ε_i

其中：

text 复制代码

B'_i = Aβ42/Aβ40 + pTau181 + pTau217 + GFAP

关键看：

text 复制代码

β3 是否显著

如果：

text 复制代码

β3 < 0

可以解释为：

在 NFL 升高背景下，嗅觉评分下降对认知损害的影响更强。

这就形成：

text 复制代码

low olfaction + high NFL = double-hit cognitive vulnerability

3.2 三重打击模型：Olfaction × NFL × VMBS

构造血管代谢负担评分：

text 复制代码

VMBS_i =
I(hypertension) + I(diabetes) + I(hyperlipidemia)
+ I(CHD) + I(current smoking)

如果有连续变量，可以升级：

text 复制代码

VMBS_i =
z(SBP) + z(HbA1c) + z(LDL-C) + z(BMI) + I(smoking)

三重交互：

text 复制代码

Y_i =
β0 + β1O_i + β2NFL_i + β3VMBS_i
+ β4(O_i × NFL_i)
+ β5(O_i × VMBS_i)
+ β6(NFL_i × VMBS_i)
+ β7(O_i × NFL_i × VMBS_i)
+ β8C_i + ε_i

但注意：n=226 的情况下，三重交互只能探索，不能当主结论。

主结论放二重交互，三重交互放 supplement。

3.3 Double-hit / Triple-hit 分层

把变量离散化：

text 复制代码

Low olfaction = O_i ≤ Q1 或 clinical cutoff
High NFL = NFL_i ≥ median / upper tertile
High VMBS = VMBS_i ≥ 2

分组：

组别	嗅觉	NFL	VMBS	解释
G1	好	低	低	reference
G2	差	低	低/中	olfactory-only risk
G3	好	高	中/高	injury-background risk
G4	差	高	中/高	double-hit
G5	差	高	高	triple-hit

模型：

text 复制代码

Y_i = β0 + Σ βk Group_ki + C_i + ε_i

图上直接画：

text 复制代码

认知 Z 分数逐级下降

这比 Cog-Free 的 SHAP 排名更有故事。

三、机器学习不是主菜，但要用得锋利

3.1 小样本首选 Elastic Net，而不是乱上深度学习

Elastic Net：

text 复制代码

min_β {
1/(2n) Σ_i (y_i - x_i^Tβ)^2
+ λ [ α ||β||_1 + (1-α)/2 ||β||_2^2 ]
}

Logistic Elastic Net：

text 复制代码

min_β {
-1/n Σ_i [ y_i log(p_i) + (1-y_i)log(1-p_i) ]
+ λ [ α ||β||_1 + (1-α)/2 ||β||_2^2 ]
}

为什么适合你们？

样本小；
变量有相关性；
可解释；
可筛选；
医学审稿人能接受。

3.2 XGBoost / LightGBM 作为非线性对照

它们可以捕捉：

text 复制代码

嗅觉与 NFL 的非线性
NFL 与年龄的非线性
VMBS 与执行功能的非线性

但要严格交叉验证：

text 复制代码

Repeated 5-fold CV
或
Bootstrap optimism correction

不要一堆模型排行榜。

Cog-Free 已经做了 8 个算法比较，包括 LR、KNN、DT、RF、XGB、LGBM、SVM、NNET，而且 LR 最优。

你们不要照搬"8 algorithms benchmarking"。你们可以只做：

text 复制代码

Clinical baseline LR
Elastic Net
XGBoost
TabPFN exploratory

这就够了。

3.3 TabPFN 可以做探索，但别当主线

TabPFN 是 tabular foundation model，Nature 2025 文章强调它在小于 10,000 样本的数据集上表现很强，并且推理速度快；它本质上是预训练的表格模型，不是你从 226 个样本里硬训练一个深度网络。( $Nature$ $2$ )

你们可以写：

As an exploratory analysis, we evaluated a pretrained tabular foundation model to assess whether prior-data-fitted inference improves small-sample prediction.

别写：

我们提出深度学习模型。

那是自欺。

3.4 SHAP 用于解释，不用于证明因果

SHAP 的定义是基于博弈论的特征归因方法，用来解释模型输出，每个特征获得一个 Shapley value。( $shap.readthedocs.io$ $3$ )

公式：

text 复制代码

φ_j =
Σ_{S⊆F\{j}} [ |S|!(M-|S|-1)! / M! ]
[ f_x(S ∪ {j}) - f_x(S) ]

其中：

φ_j：第 j 个特征的贡献；
S：不包含 j 的特征子集；
F：全部特征集合；
M：特征数量；
f_x(S)：只使用特征子集 S 时的模型输出。

你们的 SHAP 图应该服务于 3 个结论：

嗅觉评分是否排在前列；
NFL 是否排在前列；
嗅觉低 + NFL 高是否对应更高风险区域。

四、最核心的"超越 Cog-Free"的设计

Cog-Free 的 smell 变量是弱的

Cog-Free 里的 smell 是 self-reported sense of smell，分为 no change、slight dullness、significant dullness、loss；而且基线表里 sense of smell 在 dementia-free 和 dementia 之间差异不显著，p=0.993。

这就是你们的突破点：

它用的是主观嗅觉自报，你们用的是标准化客观嗅棒测试。

你们可以写：

text 复制代码

Unlike self-reported olfactory change, objective odor identification testing may capture subtle olfactory dysfunction that is not consciously perceived by older adults.

这句很关键。

Cog-Free 的优势是 low-cost screening，你们的优势是 biological precision

Cog-Free 明确不使用 laboratory biomarkers or cognitive testing。

所以你们不要跟它争"低成本"。你们要争：

text 复制代码

精细分层
生物学解释
神经退行性背景
嗅觉独立增量价值

定位应该是：

text 复制代码

Cog-Free = 第一层社区粗筛
你们模型 = 第二层生物标志物辅助精准分层

这叫：

tiered dementia risk screening

五、最终论文主线建议

六、伪论文示范：Methods + Results + Discussion 风格

下面这段不是让你直接投，是给你看论文长什么样。真正写作时要替换成真实数据。

Objective Olfactory Function Adds Incremental Value Beyond Plasma Biomarkers and Vascular-Metabolic Burden for Cognitive Risk Stratification in Older Adults

Abstract

Background: Olfactory dysfunction has been associated with cognitive decline in older adults, but whether objective olfactory testing provides information beyond vascular-metabolic risk factors and plasma neurodegenerative biomarkers remains unclear. Existing community dementia risk tools primarily rely on routinely collected self-reported health variables and may not capture subtle sensory-neurodegenerative vulnerability.

Objective: We aimed to determine whether objective olfactory function provides independent and incremental value for domain-specific cognitive risk stratification beyond vascular-metabolic burden and plasma biomarkers, and to explore whether olfactory impairment synergizes with plasma neurofilament light chain in identifying high-risk cognitive phenotypes.

Methods: We included community-dwelling older adults aged 60 years or above who completed a 16-item olfactory identification test, plasma biomarker assessment, and neuropsychological evaluation. Cognitive outcomes included global cognition, memory, language, executive function, and attention domain Z-scores. Plasma biomarkers included Aβ42/Aβ40, pTau181, pTau217, neurofilament light chain, and GFAP. A vascular-metabolic burden score was derived from hypertension, diabetes, hyperlipidemia, coronary heart disease, and smoking status. Multivariable linear regression was used to examine associations between olfactory score and cognitive domains. Nested models were constructed to quantify the incremental value of olfactory function beyond demographics, vascular-metabolic burden, and plasma biomarkers. Interaction analyses tested whether plasma neurofilament light chain modified the association between olfactory function and cognition. Penalized regression and gradient-boosting models were used as supportive predictive analyses, with SHAP-based interpretation to identify key contributors.

Results: Lower olfactory performance was associated with worse global cognition and multiple cognitive domains after adjustment for demographic factors and vascular-metabolic burden. The association remained after further adjustment for plasma biomarkers, suggesting that olfactory function captured non-redundant cognitive risk information. In nested model comparisons, adding objective olfactory score to demographic, vascular-metabolic, and biomarker models improved model fit and prediction performance. Interaction analysis suggested that the cognitive disadvantage associated with olfactory dysfunction was amplified among individuals with elevated plasma neurofilament light chain. Double-hit stratification showed that participants with both low olfactory score and high neurofilament light chain had the lowest cognitive performance, particularly in global cognition, memory, and executive function. Explainable machine learning further supported olfactory score, neurofilament light chain, vascular-metabolic burden, age, and education as leading contributors to cognitive risk.

Conclusions: Objective olfactory testing provides incremental and biologically meaningful information beyond vascular-metabolic risk factors and plasma biomarkers for cognitive risk stratification in older adults. The combination of olfactory impairment and elevated neurofilament light chain may define a high-risk double-hit phenotype of cognitive vulnerability. These findings support the use of objective olfactory assessment as a low-burden but non-redundant component in biomarker-informed cognitive risk stratification.

Methods: Statistical Analysis

Olfactory function was analysed both as a continuous standardized score and as quartile-based groups. Plasma biomarkers with skewed distributions were log-transformed and standardized. The vascular-metabolic burden score was calculated as the sum of hypertension, diabetes, hyperlipidemia, coronary heart disease, and current smoking status.

For each cognitive domain, we fitted sequential multivariable models:

Model A included age, sex, education, and depressive symptoms.

Model B additionally included vascular-metabolic burden.

Model C additionally included plasma biomarkers, including Aβ42/Aβ40, pTau181, pTau217, neurofilament light chain, and GFAP.

Model D additionally included objective olfactory score.

The incremental value of olfactory testing was evaluated by changes in adjusted R², root mean squared error, likelihood ratio statistics, discrimination metrics for binary cognitive impairment, calibration, and decision curve analysis. To test biological synergy, multiplicative interaction terms were introduced between olfactory score and neurofilament light chain. Participants were further stratified into double-hit groups according to olfactory impairment and elevated neurofilament light chain. Penalized regression and gradient-boosting models were used as supportive analyses. Model interpretation was performed using SHAP values to quantify global and individual-level feature contributions.

Discussion

This study advances previous dementia risk prediction work in three ways. First, it shifts the focus from routine self-reported risk estimation to biomarker-informed cognitive vulnerability stratification. Second, it demonstrates that objective olfactory testing may provide information that is not fully captured by established plasma biomarkers or vascular-metabolic risk factors. Third, it identifies a potential double-hit phenotype in which olfactory impairment and elevated neurofilament light chain jointly mark individuals with substantially worse cognitive performance.

The lack of strong mediation by plasma biomarkers would not necessarily weaken the role of olfaction. Instead, it may indicate that olfactory dysfunction reflects local or network-level neurodegenerative processes that are partly independent of systemic plasma biomarker changes. In this context, objective olfactory testing should be viewed not as a substitute for biomarkers, but as a complementary sensory-neural marker that improves risk stratification when combined with biomarker and vascular profiles.

Compared with cognitive-testing-free community screening tools, the proposed framework serves a different clinical purpose. Routine-data tools are suitable for broad first-line screening, whereas objective olfactory and plasma biomarker assessment may be more appropriate for second-line risk refinement among older adults with suspected cognitive vulnerability. This tiered strategy may improve the balance between accessibility, biological specificity, and clinical utility.

七、你们最终应该做哪些图表？

主文图 6 张就够了

Figure 1：研究框架图

text 复制代码

Demographics
+ Vascular-metabolic burden
+ Plasma biomarkers
+ Objective olfaction
→ Cognitive domains

Figure 2：主效应森林图

展示：

text 复制代码

Olfactory score → global cognition
Olfactory score → memory
Olfactory score → language
Olfactory score → executive function
Olfactory score → attention

每个认知域展示 Model A-D 的 β 和 95% CI。

Figure 3：RCS 剂量反应曲线

展示：

text 复制代码

嗅觉评分越低，认知风险越高

如果非线性不显著，也别硬说非线性。

可以写：

text 复制代码

The association appeared approximately linear.

Figure 4：嵌套模型增量价值

展示：

text 复制代码

Model A → B → C → D
Adjusted R² ↑
RMSE ↓
AUC ↑
DCA net benefit ↑

这张图是证明嗅觉有价值的核心。

Figure 5：嗅觉 × NFL 交互

展示 simple slope：

text 复制代码

High NFL：嗅觉下降与认知下降关系强
Low NFL：关系弱

Figure 6：Double-hit heatmap

2×2：

	NFL 低	NFL 高
嗅觉好	低风险	中风险
嗅觉差	中风险	高风险

这张图最有传播力。

Supplementary 图

SHAP summary；
SHAP dependence；
calibration；
DCA；
bootstrap stability；
missingness map；
sensitivity analysis；
FDR 校正结果。

八、深度学习怎么放才不蠢？

不建议

text 复制代码

n=226 训练 MLP / TabNet / Transformer

这在统计上很难自圆其说。

可以探索

text 复制代码

TabPFN as pretrained tabular foundation model

理由：它不是你用小样本从零训练，而是基于 prior-data fitted learning 做小样本推断。( $Nature$ $2$ )

最合理写法

text 复制代码

Given the limited sample size, deep learning models trained from scratch were not used as primary models. As an exploratory analysis, we evaluated a pretrained tabular foundation model to assess whether prior-data-fitted inference could improve small-sample prediction.

这很稳。

九、最后给你一个明确执行方案

必做核心

Model A-D 嵌套模型
嗅觉 × NFL 交互
double-hit 分层
嗅觉增量价值
SHAP 作为解释支持
bootstrap + calibration + DCA
敏感性分析：连续嗅觉 vs Q1-Q4，log biomarker，FDR，排除鼻部疾病/抑郁高分

可做但不要抢主线

TabPFN；
网页；
OCR；
LLM 报告；
多模型排行榜。

最硬的判断

你们想超越 Cog-Free，不是靠更多模型，而是靠更强的问题定义。

Cog-Free 问的是：

能不能用常规健康变量预测痴呆？

你们应该问：

客观嗅觉是否是血液标志物和血管风险之外的独立神经退行性风险信号？它是否与 NFL 共同定义一个双重打击型认知脆弱表型？

这个问题更窄，但更深。

论文质量很多时候不是"大"，而是锋利。

1. 先判断：哪里会雷同，哪里不会？

高雷同区：不要碰或者弱化

低雷同区：这是你们应该主攻的地方

2. 你们不要"模仿它的题目"，要模仿它的研究闭环

3. 最安全、最有辨识度的主线：不要叫"预测工具"，叫"增量价值与生物风险分层"

4. 怎么"模仿甚至超越"但不雷同？

第一层：模仿它的严谨度

第二层：用"客观嗅觉"打它的弱点

第三层：用"血液标志物"打它的另一个弱点

5. 你们真正可以"超越它"的地方

5.1 机制深度超过它

5.2 认知分域深度超过它

5.3 增量价值超过它

6. 具体怎么避免雷同？给你一套"避雷清单"

不能这样写

应该这样写

7. 方法上怎么改，才不像？

Cog-Free 的路线

你们的路线

8. 图表也要避开它

9. 最好的"论文定位差异化"方案

不是 prediction tool paper

10. 一句话区分你们和它

11. 你们可以"借鉴"的边界

可以借鉴

不能照搬

12. 最终建议：你们的文章分成两个层次

主文章：机制 + 增量价值

附加应用：网页 + OCR + 风险报告

最后给你一句硬话

先定调：你们不能做"另一个 Cog-Free"，要做"Biomarker-informed olfactory risk stratification"

一、核心科学问题重新设计

你们的主问题应该改成 3 层

Question 1：嗅觉是否是独立信息源？

Question 2：嗅觉是否和神经损伤背景协同？

Question 3：能否识别认知风险表型？

二、核心方法学框架：我建议你们做"三轨证据链"

轨道 1：传统统计，证明主效应稳定

1.1 线性模型：认知 Z 分数作为连续结局

1.2 Logistic 模型：构造认知受损二分类

轨道 2：嵌套模型，证明"嗅觉增量价值"

2.1 ΔAUC

2.2 ΔAdjusted R²

2.3 Likelihood ratio test

2.4 NRI

2.5 IDI

2.6 Decision Curve Analysis

轨道 3：交互与表型，证明"生物学分层"

3.1 嗅觉 × NFL 交互模型

3.2 三重打击模型：Olfaction × NFL × VMBS

3.3 Double-hit / Triple-hit 分层

三、机器学习不是主菜，但要用得锋利

3.1 小样本首选 Elastic Net，而不是乱上深度学习

3.2 XGBoost / LightGBM 作为非线性对照

3.3 TabPFN 可以做探索，但别当主线

3.4 SHAP 用于解释，不用于证明因果

四、最核心的"超越 Cog-Free"的设计

Cog-Free 的 smell 变量是弱的

Cog-Free 的优势是 low-cost screening，你们的优势是 biological precision

五、最终论文主线建议

推荐标题

推荐摘要逻辑

六、伪论文示范：Methods + Results + Discussion 风格

Objective Olfactory Function Adds Incremental Value Beyond Plasma Biomarkers and Vascular-Metabolic Burden for Cognitive Risk Stratification in Older Adults

Abstract

Methods: Statistical Analysis

Discussion

七、你们最终应该做哪些图表？

主文图 6 张就够了

Figure 1：研究框架图

Figure 2：主效应森林图

Figure 3：RCS 剂量反应曲线

Figure 4：嵌套模型增量价值

Figure 5：嗅觉 × NFL 交互

Figure 6：Double-hit heatmap

Supplementary 图

八、深度学习怎么放才不蠢？

不建议

可以探索