clickhouse Checksum doesn‘t match: 问题修复

  1. 问题描述

SQL 错误 [40] [07000]: Code: 40. DB::Exception: Checksum doesn't match: corrupted data. Reference: 50e8c1efa78de2881b725d44b04be1fe. Actual: 161c99eb681ec36b83540ecdd65ad8c9. Size of compressed block: 32846. The mismatch is caused by single bit flip in data block at byte 10059, bit 6. This is most likely due to hardware failure. If you receive broken data over network and the error does not repeat every time, this can be caused by bad RAM on network interface controller or bad controller itself or bad RAM on network switches or bad CPU on network switches (look at the logs on related network switches; note that TCP checksums don't help) or bad RAM on host (look at dmesg or kern.log for enormous amount of EDAC errors, ECC-related reports, Machine Check Exceptions, mcelog; note that ECC memory can fail if the number of errors is huge) or bad CPU on host. If you read data from disk, this can be caused by disk bit rot. This exception protects ClickHouse from data corruption due to hardware failures.: (while reading column id): (while reading from part /data/clickhouse/store/979/9795066e-6ea5-4550-8361-6c35d8ed9dca/7-20230727_1282473_1283947_9/ from mark 2232 with max_rows_to_read = 33022): While executing MergeTreeThread. (CHECKSUM_DOESNT_MATCH) (version 23.3.1.2823 (official build))

, server ClickHouseNode [uri={socket_timeout=300000,use_server_time_zone=false,use_time_zone=false}]@-1484630932

2.问题解决

https://github.com/marliotto/clickhouse-bitflip

使用clickhouse-bitflip修复损坏的clickhouse数据,将代码下载下来,进行build,然后修复出问题的文件。 像上面的报错,需要修复的文件是

/data/clickhouse/store/979/9795066e-6ea5-4550-8361-6c35d8ed9dca/7-20230727_1282473_1283947_9/id.bin

修复命令

./clickhouse-bitflip /data/clickhouse/store/979/9795066e-6ea5-4550-8361-6c35d8ed9dca/7-20230727_1282473_1283947_9/id.bin

相关推荐
梦想与想象-广州大智汇4 天前
MySQL 同步数据到 ClickHouse 方案对比分析
数据库·mysql·clickhouse
Smile_2542204185 天前
clickhouse日志疯涨问题
linux·运维·服务器·clickhouse
计算机魔术师5 天前
【技术硬核 | 存储】ClickHouse 原理与 Langfuse 存储实践:当 LLM Trace 爆炸时,PG 还扛得住吗?
人工智能·clickhouse·工程实践·sbti·职场焦虑
fire-flyer8 天前
ClickHouse系列(九):慢查询、内存 OOM 与稳定性治理
android·clickhouse
fire-flyer8 天前
ClickHouse系列(十):生产架构与最佳实践总结
clickhouse·架构
fire-flyer8 天前
ClickHouse系列(八):ClickHouse 的 UPDATE / DELETE 正确姿势
大数据·数据库·clickhouse
fire-flyer8 天前
ClickHouse系列(七):Materialized View 与多分辨率 Rollup 设计
大数据·数据库·clickhouse·架构
fire-flyer10 天前
ClickHouse系列(二):MergeTree 家族详解
大数据·数据库·clickhouse
fire-flyer10 天前
ClickHouse系列(四):压缩不是为了省磁盘,而是为了更快的查询
数据库·clickhouse
l1t10 天前
测试clickhouse 26.3的新功能
数据库·clickhouse