Postgres slow limit query

Question

I have this simple query in pg

EXPLAIN ANALYZE 
select * from email_events 
where act_owner_id = 500
order by date desc
limit 500

The first query execution take very long time about 7 seconds.

"Limit  (cost=0.43..8792.83 rows=500 width=2311) (actual time=3.064..7282.497 rows=500 loops=1)"
"  ->  Index Scan Backward using email_events_idx_date on email_events  (cost=0.43..233667.36 rows=13288 width=2311) (actual time=3.059..7282.094 rows=500 loops=1)"
"        Filter: (act_owner_id = 500)"
"        Rows Removed by Filter: 1053020"
"Total runtime: 7282.818 ms"

After the first execution the query i guess is cached and goes in 20-30 ms.

Why the LIMIT is so slow when there is no cache? How can i fix this?

Yes i do, even if is order by act_owner_id the result is the same — Vasil Atanasov, Feb 21 at 20:06
Try a composite index on act_owner_id + date (in this order). — kordirko, Feb 21 at 20:12
How huge is your database aprox? How looks your load on the system? (Thinking of maybe a real huge database with small resources is maybe lagging on I/O on first attempt) — frlan, Feb 21 at 21:04

Craig Ringer · Answer 1 · 2014-02-22 03:41:16Z

up vote 1 down vote

PostgreSQL thinks it will be faster to scan the date-ordered index backwards (i.e. in DESC order), reading every row and throwing away the rows that don't have the right act_owner_id. It's having to do 1053020 random reads to do this, and backward index scans aren't very fast either.

Try creating an index on email_events(date DESC, act_owner_id). I think Pg will be able to do a forward index scan on that and then use the second index term to filter rows, so it shouldn't have to do a heap lookup. Test with EXPLAIN and see.

answered Feb 22 at 3:41

Craig Ringer
75.6k681140

Not helping... The DB was filled with dummy data generated from me, loaded table by table in bulk...I made CLUSTER on one smaller table, and the problem seems to be fixed for that table. After cleared cache the first time the query is executed in less than 500ms, before CLUSTER used to go up to 20 sec.. will do CLUSTER on the biggest table to see if that will fix the problem – Vasil Atanasov Feb 22 at 14:20

add a comment |

Vasil Atanasov · Answer 2 · 2014-02-22 14:44:35Z

up vote 1 down vote

CLUSTER TABLE on INDEX seems to fix the problem. It seems that after bulk data loading that data is all over the hard drive. CLUSTER table will re-order the data on the hard drive

answered Feb 22 at 14:44

Vasil Atanasov
2115

Nice to see CLUSTERing on an index being useful for once. – Craig Ringer Feb 24 at 8:08

add a comment |

asked	6 months ago
viewed	88 times
active	6 months ago

current community

your communities

more stack exchange communities

Postgres slow limit query

2 Answers 2

Your Answer

Not the answer you're looking for? Browse other questions tagged sql postgresql postgresql-9.2 or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

Postgres slow limit query

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged sql postgresql postgresql-9.2 or ask your own question.

Related

Hot Network Questions