sql中select查询大数据量表时很慢

不恋水的雨2024-05-15 2:31

有一张表table1数据量大概130万，查询sql如下：

select count(*) from(

select distinct a.example_id,a.plan_id,a.trustor_id from table1 a

where DEL_STATUS=0 and (example_id,1,2) in(...)

and plan_id in(...)

)

查询花了将近40秒钟，结果集的数据量就有50万，而且多加一个where条件，都会导致查询耗时增加，就算加索引也没有效果，因为我这里必须要加几个where条件，而这些条件又无法过滤掉大部分的数据，所以只会白白增加耗时。

经过一段时间对sql的修改调整，调整为如下的sql：

select count(*) from(

select a.example_id,a.plan_id,a.trustor_id,count(*) cnt from table1 a

where DEL_STATUS=0 and (example_id,1,2) in(...)

group by a.example_id,a.plan_id,a.trustor_id

) where plan_id in()

不使用distinct，而是使用count + group by来去重，然后在外层过滤where条件，发现查询只要5秒多，速度一下子快了太多，可能是数据库中group by有着天然的优势吧