lightning的hook顺序

结果

setup: 训练循环开始前设置数据加载器和模型。

configure_optimizers: 设置优化器和学习率调度器。

on_fit_start: 训练过程开始。

on_train_start: 训练开始。

on_train_epoch_start: 每个训练周期开始。

on_train_batch_start: 每个训练批次开始。

on_before_backward: 反向传播之前。

on_after_backward: 反向传播之后。

on_before_zero_grad: 清空梯度之前。

on_after_zero_grad: 清空梯度之后。

on_before_optimizer_step: 优化器步骤之前。

on_train_batch_end: 每个训练批次结束。

on_train_epoch_end: 每个训练周期结束。

on_train_end: 训练结束。

on_fit_end: 训练过程结束。

测试代码

py 复制代码
import torch
from torch.utils.data import DataLoader, TensorDataset
from pytorch_lightning import LightningModule, Trainer, Callback
from pytorch_lightning.callbacks import Callback

# 定义一个简单的线性回归模型
class LinearRegression(LightningModule):
    def __init__(self):
        super().__init__()
        self.linear = torch.nn.Linear(1, 1)

    def forward(self, x):
        return self.linear(x)

    def training_step(self, batch, batch_idx):
        x, y = batch
        y_hat = self(x)
        loss = torch.nn.functional.mse_loss(y_hat, y)
        return loss
    
    def on_after_backward(self, *args, **kwargs):
        print("After backward is called!", args, kwargs)
        return super().on_after_backward(*args, **kwargs)
    
    def on_before_zero_grad(self, *args, **kwargs):
        print("Before zero grad is called!", args, kwargs)
        return super().on_before_zero_grad(*args, **kwargs)
    
    def on_after_zero_grad(self, *args, **kwargs):
        print("After zero grad is called!", args, kwargs)
        return super().on_after_zero_grad(*args, **kwargs)
    
    def on_before_backward(self, *args, **kwargs):
        print("Before backward is called!", args, kwargs)
        return super().on_before_backward(*args, **kwargs)
    
    def on_before_optimizer_step(self, *args, **kwargs):
        print("Before optimizer step is called!", args, kwargs)
        return super().on_before_optimizer_step(*args, **kwargs)
    
    def on_after_optimizer_step(self, *args, **kwargs):
        print("After optimizer step is called!", args, kwargs)
        return super().on_after_optimizer_step(*args, **kwargs)
    
    def on_fit_start(self, *args, **kwargs):
        print("Fit is starting!", args, kwargs)
        return super().on_fit_start(*args, **kwargs)
    
    def on_fit_end(self, *args, **kwargs):
        print("Fit is ending!", args, kwargs)
        return super().on_fit_end(*args, **kwargs)
    
    def setup(self, *args, **kwargs):
        print("Setup is called!", args, kwargs)
        return super().setup(*args, **kwargs)
    
    def configure_optimizers(self, *args, **kwargs):
        print("Configure Optimizers is called!", args, kwargs)
        return super().configure_optimizers(*args, **kwargs)
    
    def on_train_start(self, *args, **kwargs):
        print("Training is starting!", args, kwargs)
        return super().on_train_start(*args, **kwargs)
    
    def on_train_end(self, *args, **kwargs):
        print("Training is ending!", args, kwargs)
        return super().on_train_end(*args, **kwargs)
    
    def on_train_batch_start(self, *args, **kwargs):
        print(f"Training batch is starting!", args, kwargs)
        return super().on_train_batch_start(*args, **kwargs)
    
    def on_train_batch_end(self, *args, **kwargs):
        print(f"Training batch is ending!", args, kwargs)
        return super().on_train_batch_end(*args, **kwargs)
    
    def on_train_epoch_start(self, *args, **kwargs):
        print(f"Training epoch is starting!", args, kwargs)
        return super().on_train_epoch_start(*args, **kwargs)
    
    def on_train_epoch_end(self, *args, **kwargs):
        print(f"Training epoch is ending!", args, kwargs)
        return super().on_train_epoch_end(*args, **kwargs)
    
    
# 创建数据集
x_train = torch.tensor([[1.0], [2.0], [3.0], [4.0]], dtype=torch.float)
y_train = torch.tensor([[2.0], [4.0], [6.0], [8.0]], dtype=torch.float)
train_dataset = TensorDataset(x_train, y_train)
train_loader = DataLoader(train_dataset, batch_size=2)

# 创建模型和训练器
model = LinearRegression()
trainer = Trainer(max_epochs=2)

# 开始训练
trainer.fit(model, train_loader)
相关推荐
用户252736278141 天前
【踩坑复盘】我在本地跑 RAG 知识库时踩了 5 个大坑,吐血整理避坑指南
人工智能
大模型真好玩1 天前
LangChain DeepAgents 速通指南(九)—— 生产级智能体框架 DeepAgents Code 源码导读
人工智能·langchain·agent
金銀銅鐵1 天前
n^5 和 n 的个位数是否总相等?
python·数学
用户018349301691 天前
用Zustand管理AI多会话状态
人工智能
武子康1 天前
调查研究-198 Agent 到底该记住什么?读懂《What Must Generalist Agents Remember?》
人工智能·openai·agent
aqi001 天前
15天学会AI应用开发(九)利用Chroma持久化向量数据
人工智能·python·大模型·ai编程·ai应用
金銀銅鐵1 天前
借助 Pygame 探索最大公约数的规律
python·数学·游戏
武子康1 天前
调查研究-197 FAISS vs Elasticsearch 全面对比:从向量检索、全文搜索到 RAG 选型指南
人工智能·elasticsearch·agent
青禾网络1 天前
Web 前端如何接入 AI 音效生成:从零到可用的完整方案
人工智能·设计模式