Comparing values in two numpy arrays with 'if'

Question

Im fairly new to numpy arrays and have encountered a problem when comparing one array with another.

I have two arrays, such that:

a = np.array([1,2,3,4,5])
b = np.array([2,4,3,5,2])

I want to do something like the following:

if b > a:
    c = b
else:
    c = a

so that I end up with an array c = np.array([2,4,3,5,5]).

This can be otherwise thought of as taking the max value for each element of the two arrays.

However, I am running into the error

ValueError: The truth value of an array with more than one element is ambiguous. 
Use a.any() or a.all().

I have tried using these but Im not sure that the are right for what I want.

Is someone able to offer some advice in solving this?

The fundamental thing to understand here is that it is ambiguous to compare 2 arrays as the error suggests, are you wanting to compare all values, what if one value is higher? what if all but one is higher? So you have to specify the criteria for the comparison, for what you are attempting np.maximum does what you want, but there are also any(), all() attributes also that spell out your criteria so you avoid the error — EdChum, Sep 10 '14 at 8:12

Nras · Accepted Answer · 2014-09-10 08:03:44Z

up vote 7 down vote accepted

You are looking for the function np.fmax. It takes the element-wise maximum of the two arrays, ignoring NaNs.

import numpy as np
a = np.array([1,2,3,4,5])
b = np.array([2,4,3,5,2])
c = np.fmax(a,b)

The output is

array([2, 4, 3, 5, 5])

answered Sep 10 '14 at 8:03

Nras
2,06015

by far the best answer here.. – user3012759 Sep 10 '14 at 8:04

add a comment |

abarnert · Answer 2 · 2014-09-10 08:15:47Z

As with almost everything else in numpy, comparisons are done element-wise, returning a whole array:

>>> b > a
array([ True,  True, False,  True, False], dtype=bool)

So, is that true or false? What should an if statement do with it?

Numpy's answer is that it shouldn't try to guess, it should just raise an exception.

If you want to consider it true because at least one value is true, use any:

>>> if np.any(b > a): print('Yes!')
Yes!

If you want to consider it false because not all values are true, use all:

>>> if np.all(b > a): print('Yes!')

But I'm pretty sure you don't want either of these. You want to broadcast the whole if/else over the array.

You could of course wrap the if/else logic for a single value in a function, then explicitly vectorize it and call it:

>>> def mymax(a, b):
...     if b > a:
...         return b
...     else:
...         return a
>>> vmymax = np.vectorize(mymax)
>>> vmymax(a, b)
array([2, 4, 3, 5, 5])

This is worth knowing how to do… but very rarely worth doing. There's usually a more indirect way to do it using natively-vectorized functions—and often a more direct way, too.

One way to do it indirectly is by using the fact that True and False are numerical 1 and 0:

>>> (b>a)*b + (b<=a)*a
array([2, 4, 3, 5, 5])

This will add the 1*b[i] + 0*a[i] when b>a, and 0*b[i] + 1*a[i] when b<=a. A bit ugly, but not too hard to understand. There are clearer, but more verbose, ways to write this.

But let's look for an even better, direct solution.

First, notice that your mymax function will do exactly the same as Python's built-in max, for 2 values:

>>> vmymax = np.vectorize(max)
>>> vmymax(a, b)
array([2, 4, 3, 5, 5])

Then consider that for something so useful, numpy probably already has it. And a quick search will turn up maximum:

>>> np.maximum(a, b)
array([2, 4, 3, 5, 5])

Great explanation of what I was doing and why it wouldn't work. I can see that I need to treat the array as a whole and not break it up by elements. — Nathan Thomas, Sep 10 '14 at 8:16
@NathanThomas: Exactly. That's the only tricky bit about numpy: most things just work the obvious way you'd expect, but every once in a while you want to do something that's obviously iterative in your hear, and you have to figure out how to translate it into something element-wise. (There's always vectorize—or, when worst comes to worst, fromiter around a generator expression—which you can't, but it should rarely if ever come to that.) — abarnert, Sep 10 '14 at 8:29

skyuuka · Answer 3 · 2014-09-10 08:33:47Z

up vote 0 down vote

The following methods also work:

Use numpy.maximum

>>> np.maximum(a, b)
Use numpy.max and numpy.vstack

>>> np.max(np.vstack(a, b), axis = 0)

edited Sep 10 '14 at 8:33

answered Sep 10 '14 at 8:22

skyuuka
158112

add a comment |

Ashoka Lella · Answer 4 · 2014-09-10 08:39:55Z

up vote 0 down vote

Here's an other way of achieving this

c = np.array([y if y>z else z for y,z in zip(a,b)])

edited Sep 10 '14 at 8:39

answered Sep 10 '14 at 8:03

Ashoka Lella
3,2561625

One problem of this solution is the result is not of type numpy.ndarray as a and b. – skyuuka Sep 10 '14 at 8:32

@skyuuka, thanks, fixed it – Ashoka Lella Sep 10 '14 at 8:40

add a comment |

asked	6 months ago
viewed	93 times
active	6 months ago

current community

your communities

more stack exchange communities

Comparing values in two numpy arrays with 'if'

4 Answers 4

Your Answer

Not the answer you're looking for? Browse other questions tagged python arrays numpy or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

Comparing values in two numpy arrays with 'if'

4 Answers 4

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged python arrays numpy or ask your own question.

Related

Hot Network Questions