Dynamic Programming— Variable Width Bin (Equi-Depth) Histogram

Question

Given some data, and a fixed number of bins (k)-- How can I design a Dynamic Programming algorithm that minimizes the largest difference between bin sizes?

In other words, with a set number of bins (k), I am looking to adjust the width to make them all as equal as possible.

I have read this paper, however, it doesn't really handle what I am trying to accomplish. http://arxiv.org/pdf/1207.5578v3.pdf

Can I somehow modify their algorithm for my purposes or is there another approach I should look at?

John Moeller · Answer 1 · 2013-03-09 10:10:19Z

up vote 0 down vote

EDIT: It looks like I misunderstood the question, so after the discussion settles down I'll delete this answer.

Hint: The key to designing this algorithm is observing where the bin boundaries lie. Start with the smallest possible bins that will cover the data with $k$ bins (i.e., the width of the data divided by k).

By increasing the width, can you move a point from a bin to a lighter bin?
By shifting the first bin's starting point lower, can you do the same?

These are the recursion choices you can make, and then use the techniques for other dynamic programs to write down an algorithm.

edited Mar 9 at 10:10

answered Mar 9 at 5:16

John Moeller
9121311

	I don't follow. I thought for dynamic programming solutions you build an optimal solution at n and use that for n+1? Also, what do you mean by:"shifting the first bin lower" how can I shift a bin lower that starts at my first element? – quannabe Mar 9 at 9:41
	All k of your bins are the same width, correct? – John Moeller Mar 9 at 9:56
	Fixed k. No, they do not need to be equal width – quannabe Mar 9 at 10:01
	Ah, ok, I misunderstood. It sounds as though you want close to, if not the same, number of points in each bin? If that's the case, you'll want to ignore my answer and look up "quantiles." If k = 4 that sounds like quartiles. I don't know that you'd need dynamic programming for that though. – John Moeller Mar 9 at 10:05
	In fact, I think you can do this in $O(n \log k)$ by recursively running $k$-select. – John Moeller Mar 9 at 10:12

asked	2 months ago
viewed	53 times
active	2 months ago

Dynamic Programming— Variable Width Bin (Equi-Depth) Histogram

1 Answer

Your Answer

Not the answer you're looking for? Browse other questions tagged homework recursive-algorithms dynamic-programming or ask your own question.

Dynamic Programming— Variable Width Bin (Equi-Depth) Histogram

1 Answer

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged homework recursive-algorithms dynamic-programming or ask your own question.

Related