mysql中count(1)与count(*)比较
sql调优,主要是考虑降低:consistent gets和physical reads的数量.
count(1)与count(*)比较:
如果你的数据表没有主键,那么count(1)比count(*)快,如果有主键的话,那主键(联合主键)作为count的条件也比count(*)要快,如果你的表只有一个字段的话那count(*)就是最快的啦,count(*) count(1) 两者比较,主要还是要count(1)所相对应的数据字段.
如果count(1)是聚索引,id,那肯定是count(1)快。但是差的很小的,因为count(*),自动会优化指定到那一个字段,所以没必要去count(?),用count(*),sql会帮你完成优化的.
count详解:count(*)将返回表格中所有存在的行的总数包括值为null的行,然而count(列名)将返回表格中除去null以外的所有行的总数,有默认值的列也会被计入.
distinct 列名,得到的结果将是除去值为null和重复数据后的结果
总结三条经验:
1.任何情况下SELECT COUNT(*) FROM tablename是最优选择;
2.尽量减少SELECT COUNT(*) FROM tablename WHERE COL = 'value’这种查询;
3.杜绝SELECT COUNT(COL) FROM tablename的出现。
国个找一文章不懂英文没译,代码如下:
- COUNT(*)vsCOUNT(col)
- LookingathowpeopleareusingCOUNT(*)andCOUNT(col)itlookslikemostofthemthinktheyaresynonymsandjustusingwhattheyhappentolike,whilethereissubstantialdifferenceinperformanceandevenqueryresult.
- Letslookatthefollowingseriesofexamples:
- CREATETABLE`fact`(
- `i`int(10)unsignedNOTNULL,
- `val`int(11)defaultNULL,
- `val2`int(10)unsignedNOTNULL,
- KEY`i`(`i`)
- )ENGINE=MyISAMDEFAULTCHARSET=latin1
- mysql>selectcount(*)fromfact;
- +———-+
- |count(*)|
- +———-+
- |7340032|
- +———-+
- 1rowinset(0.00sec)
- mysql>selectcount(val)fromfact;
- +————+
- |count(val)|
- +————+
- |7216582|
- +————+
- 1rowinset(1.17sec)
- mysql>selectcount(val2)fromfact;
- +————-+
- |count(val2)|
- +————-+
- |7340032|
- +————-+
- 1rowinset(0.00sec)
As this is MYISAM table MySQL has cached number of rows in this table. This is why it is able to instantly answer COUNT(*) and
COUNT(val2) queries, but not COUNT(val). Why ? Because val column is not defined as NOT NULL there can be some NULL values in it and so MySQL have to perform table scan to find out. This is also why result is different for the second query.
So COUNT(*) and COUNT(col) queries not only could have substantial performance performance differences but also ask different question.
MySQL Optimizer does good job in this case doing full table scan only if it is needed because column can be NULL.
Now lets try few more queries,代码如下:
- mysql>selectcount(*)fromfactwherei<10000;
- +———-+
- |count(*)|
- +———-+
- |733444|
- +———-+
- 1rowinset(0.40sec)
- mysql>explainselectcount(*)fromfactwherei<10000G
- ***************************1.row***************************
- id:1
- select_type:SIMPLE
- table:fact
- type:range
- possible_keys:i
- key:i
- key_len:4
- ref:NULL
- rows:691619
- Extra:Usingwhere;Usingindex
- 1rowinset(0.00sec)
- mysql>selectcount(val)fromfactwherei<10000;
- +————+
- |count(val)|
- +————+
- |720934|
- +————+
- 1rowinset(1.29sec)
- mysql>explainselectcount(val)fromfactwherei<10000G
- ***************************1.row***************************
- id:1
- select_type:SIMPLE
- table:fact
- type:range
- possible_keys:i
- key:i
- key_len:4
- ref:NULL
- rows:691619
- Extra:Usingwhere
- 1rowinset(0.00sec)
- mysql>selectcount(val2)fromfactwherei<10000;
- +————-+
- |count(val2)|
- +————-+
- |733444|
- +————-+
- 1rowinset(1.30sec)
- mysql>explainselectcount(val2)fromfactwherei<10000G
- ***************************1.row***************************
- id:1//phpfensi.com
- select_type:SIMPLE
- table:fact
- type:range
- possible_keys:i
- key:i
- key_len:4
- ref:NULL
- rows:691619
- Extra:Usingwhere
- 1rowinset(0.00sec)
As you can see even if you have where clause performance for count(*) and count(col) can be significantly different. In fact this example shows just 3 times performance difference because all data fits in memory, for IO bound workloads you frequently can see 10 and even 100 times performance difference in this case.
The thing is count(*) query can use covering index even while count(col) can’t. Of course you can extend index to be (i,val) and get query to be index covered again but I would use this workaround only if you can’t change the query (ie it is third party application) or in case column name is in the query for reason, and you really need count of non-NULL values.
It is worth to note in this case MySQL Optimizer does not do too good job optimizing the query. One could notice (val2) column is not null so count(val2) is same as count(*) and so the query could be run as index covered query. It does not and both queries have to perform row reads in this case.代码如下:
- mysql>altertablefactdropkeyi,addkey(i,val);
- QueryOK,7340032rowsaffected(37.15sec)
- Records:7340032Duplicates:0Warnings:0
- mysql>selectcount(val)fromfactwherei<10000;
- +————+
- |count(val)|
- +————+
- |720934|
- +————+
- 1rowinset(0.78sec)
热门评论