doris/clickhouse常用sql

一、doris常用SQL

1、doris统计数据库的总大小(单位:MB)

sql 复制代码
SELECT 
    table_schema AS database_name,
    ROUND(SUM(data_length) / 1024 / 1024, 2) AS database_size_MB
FROM information_schema.tables
WHERE table_schema NOT IN ('information_schema', 'sys', 'mysql')
GROUP BY table_schema;

2、doris查询所有数据库大小排名(从大到小)

sql 复制代码
SELECT 
    table_schema AS database_name,
    ROUND(SUM(data_length) / 1024 / 1024, 2) AS database_size_MB
FROM information_schema.tables
GROUP BY table_schema
ORDER BY database_size_MB DESC;

3、 综合查询某数据库下各表的:表名、行数、大小(MB)

sql 复制代码
SELECT 
    table_name,
    table_rows AS row_count,
    ROUND(data_length / 1024 / 1024, 2) AS table_size_MB
FROM information_schema.tables
WHERE table_schema = 'your_database'
ORDER BY table_size_MB DESC;

4、整个数据库的总行数 & 总大小:

sql 复制代码
SELECT 
    table_schema AS database_name,
    SUM(table_rows) AS total_rows,
    ROUND(SUM(data_length) / 1024 / 1024, 2) AS total_size_MB
FROM information_schema.tables
WHERE table_schema = 'your_database'
GROUP BY table_schema;

5、查询数据库中是否存在 AGGREGATE 类型的表

sql 复制代码
SELECT 
    table_schema,
    table_name,
    engine
FROM information_schema.tables
WHERE engine = 'OLAP'
  AND create_statement LIKE '%AGGREGATE KEY%'
  AND table_schema = 'your_database';

二、clickhouse常用SQL

1、统计所有表的原始数据大小(未压缩)

sql 复制代码
SELECT
    database,
    name AS table_name,
    formatReadableSize(sum(bytes_on_disk)) AS raw_size
FROM system.tables
WHERE database = 'your_database_name'  -- 指定数据库名(可选)
GROUP BY database, table_name
ORDER BY sum(bytes_on_disk) DESC;

2、统计所有表的压缩后磁盘大小

sql 复制代码
SELECT
    database,
    name AS table_name,
    formatReadableSize(sum(data_compressed_bytes)) AS compressed_size,
    formatReadableSize(sum(data_uncompressed_bytes)) AS uncompressed_size,
    round((sum(data_compressed_bytes) / sum(data_uncompressed_bytes)) * 100, 2) AS compression_ratio
FROM system.parts
WHERE active  -- 仅统计活跃的分区
  AND database = 'your_database_name'  -- 指定数据库名(可选)
GROUP BY database, table_name
ORDER BY sum(data_compressed_bytes) DESC;

3、统计所有表的行数及压缩后大小

sql 复制代码
SELECT
    database,
    name AS table_name,
    formatReadableSize(sum(bytes_on_disk)) AS disk_size,
    sum(rows) AS total_rows,
    formatReadableSize(sum(data_compressed_bytes)) AS compressed_size,
    formatReadableSize(sum(data_uncompressed_bytes)) AS uncompressed_size,
    round((compressed_size / uncompressed_size) * 100, 2) AS compression_ratio
FROM system.parts
WHERE active  -- 仅统计活跃分区(排除临时或已删除的分区)
  AND database = 'your_database_name'  -- 指定数据库名(可选)
GROUP BY database, table_name
ORDER BY compressed_size DESC;
相关推荐
摩羯座-185690305941 小时前
爬坑 10 年!京东店铺全量商品接口实战开发:从分页优化、SKU 关联到数据完整性闭环
linux·网络·数据库·windows·爬虫·python
编程充电站pro2 小时前
SQL 面试高频:INNER JOIN vs LEFT JOIN 怎么考?
数据库·sql
这周也會开心2 小时前
SQL-窗口函数做题总结
数据库·sql
间彧3 小时前
TiDB详解与Spring Boot实战指南
数据库
极限实验室3 小时前
Easysearch 字段'隐身'之谜:source_reuse 与 ignore_above 的陷阱解析
数据库·redis
2301_772093563 小时前
tuchuang_后端_前端_注册登录
数据库·后端·网络协议·mysql·wireshark
武子康3 小时前
Java-141 深入浅出 MySQL Spring事务失效的常见场景与解决方案详解(3)
java·数据库·mysql·spring·性能优化·系统架构·事务
间彧3 小时前
脏读、不可重复读、幻读详解与对比
数据库
间彧3 小时前
数据库事务隔离级别详解
数据库
fwerfv3453454 小时前
使用PyTorch构建你的第一个神经网络
jvm·数据库·python