Traversing binary tree

Question

I was given the following question to answer as a part of the hiring test using Codility. I would appreciate if any of you can give me the right answer for this problem.

Here is the problem description:

A binary tree is either an empty pointer or a node that consists of an integer value and two sub trees. A binary tree T is given. A node of that tree containing value V is described as visible if the path from the root of the tree to the node does not contain a node with any value exceeding V. In particular the root node is always visible and nodes with values lower than that of the root node are never visible.

Assume the following declarations are given
Class Tree 
{
  public int x;
  public Tree l;
  public Tree r; 
}
Write a function:
Class Solution { int solution(Tree T);}
that given a binary tree T consisting of N nodes, returns the number of visible nodes.

Assume that:

.N is an integer within the range of [0....1,000,000] .each element of tree T is an integer with in the range [-1,000,000...1,000,000]

For example, given tree T that has the following structure
        5
       / \      
      3   10
     / \  / 
   20  21 1
the function should return 4, because there are 4 visible nodes, namely those with values 5,10,20 and 21. The node with the value 1 is not visible because there is a node with a value 10 on the path from the root to that node and 10 exceeds 1. The node with value 3 is not visible because its value is lower than that of the root node, which has a value of 5.

Given the Tree T has the following structure:
      8
      / \
     2   6
    / \       
   8   7
the function should return 2 since the only visible nodes are with value 8.

As far as i remember the complexity should be O(N)... I am not sure. Although I did come up with a solution, I was not considered for the next round: so I am hoping for some insight into how this can be solved in an optimal manner.

The following is the solution that I came up with for this problem.

class Program
{
    static int count = 0;
    static int MainTreeNodeValue = 0;

    static void Main(string[] args)
    {
        Tree mainT = new Tree();

        mainT.x = 8;
        mainT.l = new Tree();
        mainT.r = new Tree();

        mainT.l.x = 9;
        mainT.r.x = 6;

        mainT.l.l = new Tree();
        mainT.l.l.x = 9;

        mainT.l.r = new Tree();
        mainT.l.r.x = 9;

        int finalCount = solution(mainT);

        Console.WriteLine(finalCount);
        Console.ReadLine();

    }

    public static int solution(Tree T)
    {

                if (MainTreeNodeValue == 0)
                    MainTreeNodeValue = T.x;

                if (T.x >= MainTreeNodeValue)
                        count++;

                if (T.l != null && T.l.x >= T.x && T.l.x >= MainTreeNodeValue)
                    count++;

                if (T.r != null && T.r.x >= T.x && T.r.x >= MainTreeNodeValue)
                    count++;

                if (T.l != null && T.l.l != null && !(T.l.x >= MainTreeNodeValue))
                   solution(T.l);


                if (T.l != null && T.l.r != null && !(T.r.x >= MainTreeNodeValue))
                    solution(T.r);


        return count;
    }
}

class Tree
{
    public int x;
    public Tree l;
    public Tree r;
}

You ask as if your solution does not work... does it work? If it does not work then this question is off-topic for CodeReview. See the help center — rolfl, 2 days ago

Mat's Mug · Answer 1 · 2014-03-03 19:24:55Z

Naming

The naming in the specs is awful.

For one thing, the Tree class:

class Tree
{
    public int x;
    public Tree l;
    public Tree r;
}

This class is bad for several reasons:

It's exposing its private fields;
Publicly accessible members have cryptic names;
The names chosen don't follow naming conventions for public members, which is PascalCasing.

The solution method:

Also has a bad name - method names should start with a verb that says what the method does;
Takes a T parameter of type Tree; T is typically ok as a name for a type parameter name in a generic class. Parameters should be camelCase, so a better name would have been tree.
Because of the bad naming, the entire content of the method is barely readable.

Nitpicks

Couple more points:

Scopes are not explicitly defined. It's always best to include the braces for an if block, even if it's a single line of code.
The solution could have been encapsulated in its own class, and made non-static.
Each if is redundantly checking if T.l != null or T.r != null; you could have regrouped the cases where [T.l == null | T.r == null] and arranged the conditionals in such a way that you wouldn't need to check again something you've checked already.

Creational Patterns

I think most of the code in the Main method belongs in a Builder or Factory class whose role is to create a Tree. All this manual setup looks like a missed opportunity at implementing a Builder pattern.

I... sorry I kinda skipped the "specs" part - the names in the Tree class seem to have been part of the spec.. however the specs also ask for a class Solution with a solution method - you haven't written that class and made it a static method instead. — Mat's Mug, 2 days ago
Hello..Thanks for taking time for your feed back. I understand that the class is badly designed etc. But the reason I posted this was if there was better way to write the method "solution" which calculates the number of nodes which are visible. — user3375390, 2 days ago
@user3375390 I've written a quick review on my lunch hour, CR answers can address any aspect of the code; by posting this answer I've addressed the aspects I could address in the limited amount of time I had, and other answers can complete with reviewing other aspects. Cheers! — Mat's Mug, 2 days ago

Ryan · Answer 2 · 2014-03-03 22:05:13Z

I believe your MainTreeNodeValue is throwing you off a bit. For starters, you only want to set it when you start traversing the tree - ie, you only want to set it once. A case where your code won't work is when the root value is 0. This will cause your if statement to set MainTreeNodeValue to 0, which will cause the if statement to be hit again on a lower level, and your MainTreeNodeValue will eventually be set to some non-zero value, which you don't want because your tree's root value is in fact 0.

Typically, in cases like this, there is a helper function that helps initialize things along with kicking off the recursion. Here's an example:

int MainTreeNodeValue; // This should not be a static variable!
public int solution(Tree T) 
{
    MainTreeNodeValue = T.x;
    return solutionHelper(T); // returns your final 'count'
}

private int solutionHelper(Tree T) // private to hide visibility.
{ 
    // Your code here 
}

Also, your algorithm won't work because it doesn't evaluate all nodes. Regardless of whether the current node is visible or not is not important, you must evaluate both children regardless.

I won't put the answer here, but your solution should have this basic layout:

// (All of this in solutionHelper(Tree t), as seen above)

if (T == null)
{
    // We want to break out of here if T is null, 
    //so we don't call T.x, T.l, or T.r on a null pointer.
    return 0; 
}

int visibility = isVisible(T) ? 1 : 0; // I'll leave it up to you to implement this.

// Always evaluate T.l and T.r!
return visibility + solutionHelper(T.l) + solutionHelper(T.r);

The tricky part is determining if the current node is visible. You will most likely need to maintain a stack or some other data structure, but I'll leave that as an exercise for the reader. You should note, however, that I don't use the MainTreeNodeValue or count anywhere.

Edit 1:

There may be some confusion on what the problem is asking. The problem says:

A node of that tree containing value V is described as visible if the path from the root of the tree to the node does not contain a node with any value exceeding V.

I read that like this: the current node we are looking at has the value V. As in, T.x == V. The node T is visible if the path from the root of the tree to T does not contain a value greater than T.x. Take this tree for example(right sides are ignored for simplicity):

I would say that 5 and 6 are visible. 6 is visible because none of the previous nodes exceed its value(including the root node). Take this tree for example, though:

Only 5 is visible, because 3 is exceeded by the root node. However, and perhaps most importantly, you're missing this case(which doesn't exist in your problem's setup either):

All nodes here are visible nodes. 5 is visible because it's the rood node, 10 is visible because 5 does not exceed it, 20 is visible because neither 5 nor 10 exceed it, and 40 is visible because 5, 10, and 20 don't exceed it.

In the examples given, all of the intermediate values between the root and the visible nodes happen to be less than the root node's value, but that is not a requirement for a node to be visible. A node is visible if and only if all values preceding it - including the root node's - are lower.

Ryan, Thanks for your inputs. Very much appreciated. As far as your comments about "Also, your algorithm won't work because it doesn't evaluate all nodes. Regardless of whether the current node is visible or not is not important, you must evaluate both children regardless." As per the problem, a child node can be considered visible only if all the parent nodes in its path are less than the root nodes value. So once I encounter a child node with a value greater than or equal to the root node then I do not see a reason to traverse the remaining child nodes. But thanks again for your input ! — user3375390, 2 days ago
Thanks for taking time reply back Ryan. I see your point. Looks like I misread it. — user3375390, yesterday

ChrisW · Answer 3 · 2014-03-03 23:24:55Z

Functional problems

You don't run the software using the test data given in the problem definition; so the reader doesn't know whether your solution returns the correct result for those two test cases.

Recursion is the obvious way to solve this problem; however it won't work: because when N is 1,000,000 as specified in the problem description, that much recursion is likely to cause a stack overflow. Therefore you should demonstrate that you know how (e.g. by using local heap-based stack variable) to unroll the recursion: how to iterate all the nodes of a tree without using recursion.

Stylistic problems

Stylistic problems are important too:

You're writing source code for other programmers to read: not just writing for the compiler
People (e.g. hiring managers) are IMO likely to judge you and your code based on its appearance as well as on its behaviour

So:

The formatting is poor:
- Too much indentation
- Too many blank lines
Too many global (i.e. static) variables; count and MainTreeNodeValue could be instance data members of a Solution class, or passed as parameters into the static solution method.
Inconsistent capitalization of variable names: sometimes they're lower-case (e.g. count) and sometimes upper-case (e.g. MainTreeNodeValue and T).
Poor choice of variable names; e.g. mainT has an abbreviation: why not mainTree or root instead?
The Console.ReadLine(); at the end is uneccesary: it's for debugging only (so you can inspect the results before the console window auto-closes when you run a debugging session); instead, for debugging put a break-point on the closing curly brace of Main, and don't leave debug-only statements which impair the run-time behaviour.
Running the solution method has a (strange) side-effect on global/static data ...
```
if (MainTreeNodeValue == 0)
    MainTreeNodeValue = T.x;
```

Can you give me an example of using a heap-based stack variable?

It's a neat technique, that I saw recently on this site.

When I said "heap-based" I meant that the local Stack variable stores its contents on the heap instead of on the program stack.

The first results of https://www.google.com/search?q=tree+recursion+stack are about traversing trees without recursion. It's something like:

class Solution
{
    public int solution(Tree root)
    {
        if (root == null)
            return 0;
        int rootValue = root.x;
        Stack<Tree> stack = new Stack<Tree>();
        stack.Push(root);
        int count = 0;
        while (stack.Count != 0)
        {
            Tree found = stack.Pop();
            if (found.x >= rootValue)
                ++count;
            if (found.l != null)
                stack.Push(found.l);
            if (found.r != null)
                stack.Push(found.r);
        }
        return count;
    }
}

Here's the same, modified for the requirement which is that:

The descent should search the ever node in the tree (including invisible children)
Nodes are counted (i.e. "visible") if they are at east as large as their largest ancestor.

using Pair = System.Tuple<Tree, int>;

class Solution
{
    public int solution(Tree root)
    {
        if (root == null)
            return 0;
        // A node of that tree containing value V is described as visible
        // if the path from the root of the tree to the node does not contain
        // a node with any value exceeding V.
        // So, store each Tree node together with an 'int' which represents
        // the highest V value found along that branch.
        Stack<Pair> stack = new Stack<Pair>();
        stack.Push(Tuple.Create(root, root.x));
        int count = 0;
        while (stack.Count != 0)
        {
            Pair tuple = stack.Pop();
            Tree found = tuple.Item1;
            int maxV = tuple.Item2;
            if (found.x >= maxV)
            {
                ++count;
                maxV = found.x;
            }
            if (found.l != null)
                stack.Push(Tuple.Create(found.l, maxV));
            if (found.r != null)
                stack.Push(Tuple.Create(found.r, maxV);
        }
        return count;
    }
}

Chris.. thanks for taking time to review this code. Can you give me an example of using a heap-based stack variable ? Thanks for your valuable input again. — user3375390, 2 days ago
The dept of a balanced binary tree with 1,000,000 nodes is 19, you have absolutely no reason to use a heap allocated stack instead of doing it the right way. On the other hand, the spec doesn't state if the tree is balanced. That's the problem with those stupid interview tests: You never know if they assume a balanced tree and laugh at you for using a heap allocated stack, or if the whole rationale for this test is to see if you are able to spot the potential stack overflow. God I hate those tests. — thomasch, yesterday
@thomasch Why is using recursion, instead of a local stack variable, "the right way"? — ChrisW, 21 hours ago

hocho · Answer 4 · 2014-03-03 21:53:54Z

up vote 1 down vote

Taking some pointers from other responses and adding my own 2 cents, a functional programming approach to this problem could look like this.

public class Node
{
    public int Value;
    public Node Left;
    public Node Right;

    public static int GetSolution(Node node, int maxValue = Int32.MinValue)
    {
        return
            node == null
                ? 0
                : node.Value >= maxValue
                        ?   (GetSolution(node.Left, node.Value)
                                + GetSolution(node.Right, node.Value)
                                + 1)
                        :   (GetSolution(node.Left, maxValue)
                                + GetSolution(node.Right, maxValue));

    }
}

edited 2 days ago

answered 2 days ago

hocho
27413

2

Hmm.. Isn't that slightly abusing ternary operators? – Mat's Mug 2 days ago

1

2 levels of ternary operators does not count as abuse to me. Replacing with an 'if', will make the solution imperative (with mutating variables) rather than functional (immutable values). Definitely can be rewritten in a style more comforting to the developer. – hocho 2 days ago

With up to 1,000,000 nodes in the tree, you will blow the stack with the recursion – Robert Slaney 2 days ago

Correct, I missed that requirement. Yes, a heap based stack would be a required for a large unbalanced tree. If the tree is balanced, the depth will be approximately 20. (2 ^ 20 > 1,000,000) and a recursive solution would work. – hocho yesterday

add comment

Jamal · Answer 5 · 2014-03-05 16:02:32Z

up vote 0 down vote

This code uses recursive and a very simple way to counting the partial order of a tree. It uses traditional tree travel. I believe this is a good point to help the others to think about the new solution.

public class Tree {
public int v;
public Tree l;
public Tree r;

public int travelNode(Tree root) {
    int counter = 0;

    if (root != null && root.l != null){
        if (root.v <= root.l.v)
            counter +=1;
        counter +=travelNode(root.l);
    }
    if (root != null && root.r != null) {
        if (root.v <=root.r.v)
            counter +=1;
        counter +=travelNode(root.r);
    }
    return counter;
   }

   Tree(int v, Tree l, Tree r) {
       this.l = l;
       this.r = r;
       this.v =v;
   }
}

The tree can be built as

Tree root = new Tree(5, new Tree(3, new Tree(20, null, null),
    new Tree(21, null,null)), new Tree(10, new Tree(1, null, null), null));

The number of visible nodes are root.travelNode(root) + 1;.

edited 16 hours ago

Jamal♦
6,74033165

answered 16 hours ago

daniel
11

Could you please explain how this improves the OP's code? Code-only answers are discouraged and are subject to deletion. – Jamal♦ 16 hours ago

this code uses recursive and a very simple way to counting the partial order of a tree. it uses traditional tree travel. I believe this is a good point to help the others to think about the new solution. does it mean to be deleted? – daniel 16 hours ago

That should be added to the post. – Jamal♦ 16 hours ago

Thanks Jamal. It looks better! – daniel 16 hours ago

add comment

asked	2 days ago
viewed	387 times
active	today

current community

your communities

more stack exchange communities

Traversing binary tree

5 Answers

Naming

Nitpicks

Creational Patterns

Edit 1:

Functional problems

Stylistic problems

Your Answer

Not the answer you're looking for? Browse other questions tagged c# algorithm codility or ask your own question.

Community Bulletin

Hot Network Questions

current community

your communities

more stack exchange communities

Traversing binary tree

5 Answers

Naming

Nitpicks

Creational Patterns

Edit 1:

Functional problems

Stylistic problems

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged c# algorithm codility or ask your own question.

Community Bulletin

Related

Hot Network Questions