Mysql - How to optimize retrival time in a table

Question

I have query like this! which has 200 million Records in a single table.. I am using BTree Indexes in my table...

mysql> select COUNT(DISTINCT id) from [tablename] where [columname] >=3;
+------------------------------+
| COUNT(DISTINCT id) |
+------------------------------+
| 8242063
+------------------------------+
1 row in set (3 min 23.53 sec)

I am not satisfy with this timing ..! how can I reduce the result time less than 30sec. Kindly give me any suggessions! It will be more helpful to me!

thanking you!

You could use the explain keyword in front of your statement, you'll get a bit of information of what the query executes
is the id unique across the table? Why do you do 'COUNT(DISTINCT(id))' instead of 'COUNT(*)'?
I also tried Explain keyword.. which give only table rows count..
@newtover no, id is not unique in my table.. bcz of tat I m using distinct id

newtover · Answer 1 · 2013-03-11 07:01:55Z

Assuming you have an index on 'columnname' from the condition, and you really need 'COUNT(DISTINCT(id))', instead of just 'COUNT(*)' (that is the id is not UNIQUE or can be NULL), there is hardly anything you can do. What you request is basically an index scan starting from a given value '3', which is rather hard to express in other SQL terms.

ravnur · Answer 2 · 2013-03-11 08:30:13Z

If you are using InnoDB engine you can try partitions over you column (list and range preferable to test).

Another option:

add column is_more_than_2
is_more_than_2 = if(col > 2, 1, 0)
create index over this field and use it

But it will require some workaround to push this changes to production. Options:

long downtime
you can try do it without downtime but it will require a lot of workaround with additional table, triggers and replacing table at the end;

Hopes it helps

PS. Pure indexes will not helps you. Also you can think about re-implementing logic.

Aiias · Answer 3 · 2013-03-11 06:16:58Z

up vote 0 down vote

Maybe GROUP BY can help. See this MySQL reference on DISTINCT optimization

SELECT COUNT(*)
FROM (
  SELECT 1
  FROM [tablename]
  WHERE [columnname] >= 3
  GROUP BY id
) q

answered Mar 11 at 6:16

Aiias

	The following query gives me individual counts of each id.. I need whole table distince id count! thank you! – Zameer Ahmed Mar 11 at 6:48
	Edited answer and wrapped in outer `SELECT`. – Aiias Mar 11 at 13:40
	The given query was very helpful to me.. using this code I am getting the result within 1min.7sec.. thanks you! Do u have any other idea to optimize further to get a result within 30sec.. thank in advance! – Zameer Ahmed Mar 12 at 9:40

asked	2 months ago
viewed	18 times
active	28 days ago

Mysql - How to optimize retrival time in a table

migrated from stackoverflow.com Mar 31 at 1:04

3 Answers

Your Answer

Community Bulletin

Mysql - How to optimize retrival time in a table

migrated from stackoverflow.com Mar 31 at 1:04

3 Answers

Your Answer

Sign up or log in

Post as a guest

Community Bulletin

Related