Merge Sort algorithm

Question

I just read and experimented with C++ implementation of the merge sort algorithm from the Wikipedia page. During the experiments I have slightly modified the source code and would like to know how much the algorithm is improved (if it is).

//! \brief Performs a recursive merge sort on the given vector
//! \param vec The vector to be sorted using the merge sort
//! \return The sorted resultant vector after merge sort is
//! complete.
vector<int> merge_sort(vector<int>& vec)
{
    // Termination condition: List is completely sorted if it
    // only contains a single element.
    if(vec.size() == 1)
    {
        return vec;
    }

    // Determine the location of the middle element in the vector
    std::vector<int>::iterator middle = vec.begin() + (vec.size() / 2);

    vector<int> left(vec.begin(), middle);
    vector<int> right(middle, vec.end());

    // Perform a merge sort on the two smaller vectors
    left = merge_sort(left);
    right = merge_sort(right);

    return merge(vec,left, right);
}
//! \brief Merges two sorted vectors into one sorted vector
//! \param left A sorted vector of integers
//! \param right A sorted vector of integers
//! \return A sorted vector that is the result of merging two sorted
//! vectors.
vector<int> merge(vector<int> &vec,const vector<int>& left, const vector<int>& right)
{
    // Fill the resultant vector with sorted results from both vectors
    vector<int> result;
    unsigned left_it = 0, right_it = 0;

    while(left_it < left.size() && right_it < right.size())
    {
        // If the left value is smaller than the right it goes next
        // into the resultant vector
        if(left[left_it] < right[right_it])
        {
            result.push_back(left[left_it]);
            left_it++;
        }
        else
        {
            result.push_back(right[right_it]);
            right_it++;
        }
    }

    // Push the remaining data from both vectors onto the resultant
    while(left_it < left.size())
    {
        result.push_back(left[left_it]);
        left_it++;
    }

    while(right_it < right.size())
    {
        result.push_back(right[right_it]);
        right_it++;
    }
    //show merge process..
      int i;
      for(i=0;i<result.size();i++)
         {                                
         cout<<result[i]<<" ";
         }
    // break each line for display purposes..
        cout<<"***********"<<endl; 

    //take a source vector and parse the result to it. then return it.  
    vec = result;               
    return vec;
}

The following is my code:

  5 typedef std::vector<int> int_v;
  6 
  7 int_v Merge(int_v& vec, int_v& l, int_v& r) {
  8   int_v res;
  9   res.reserve(l.size() + r.size());
 10   unsigned int li = 0;
 11   unsigned int ri = 0;
 12   if (l.size() > 1 && (l[l.size()-1] < r[0])) {
 13     res.insert(res.end(), l.begin(), l.end());
 14     res.insert(res.end(), r.begin(), r.end());
 15     return res;
 16   }
 17   while (li < l.size() && ri < r.size()) {
 18     if (l[li] < r[ri]) {
 19       res.push_back(l[li++]);
 20     } else {
 21       res.push_back(r[ri++]);
 22     }
 23   }
 24   while (li < l.size()) {
 25       res.push_back(l[li++]);
 26   }
 27   while (ri < r.size()) {
 28       res.push_back(r[ri++]);
 29   }
 30   vec = res;
 31   return vec;
 32 }
 33 
 34 int_v MergeSort(int_v& v) {
 35   if (1 == v.size()) {
 36     return v;
 37   }
 38   int_v::iterator m = v.begin() + v.size()/2;
 39   int_v l(v.begin(), m);
 40   int_v r(m, v.end());
 41 
 42   l = MergeSort(l);
 43   r = MergeSort(r);
 44   return Merge(v, l, r);
 45 }

The modifications are:

 9   res.reserve(l.size() + r.size());

This will help to avoid relocation of the vector each time when there will need to double the size to be able to store the next element.

 12   if (l.size() > 1 && (l[l.size()-1] < r[0])) {
 13     res.insert(res.end(), l.begin(), l.end());
 14     res.insert(res.end(), r.begin(), r.end());
 15     return res;
 16   }

I think this will help to avoid the loops at lines 17 - 29 when just concatenation of two sides is enough.

Running both versions Wikipedia's one and mine for five million randomly generated integers produces the following differences:

Wikipedia's example:

time ./a.out 
real    0m26.278s
user    0m26.205s
sys 0m0.064s

Mine:

time ./a.out 
real    0m22.129s
user    0m22.026s
sys 0m0.092s

Loki Astari · Answer 1 · 2015-05-11 20:49:09Z

Overall comment.

It would have been better if you had used iterators to implement the sort. Its a lot more versatile than using a specific container.

Memory Allocation

Your implementation requires a lot of dynamic memory allocation.

38 int_v::iterator m = v.begin() + v.size()/2; 39 int_v l(v.begin(), m); 40 int_v r(m, v.end());

Each recursive call make a copy of the data to be merged. So we get this progression.

    n + n/2 + n/4 + n/8 + n/16 ....
    =>  n(1 + 1/2 + 1/4 + 1/8 + 1/16.....)
    =>  2n

So you are using 2n size of data just through calling MergeSort and it is being allocated and deallocated as the std::vector is being used then destroyed on return.

In Addition you are doing another memory allocation in Merge()

int_v res;
res.reserve(l.size() + r.size());

Note:

By using iterators rather than containers. It becomes easy to avoid some of this copying. You just pass the ranges of the containers and do it in-place.

Prefer to move rather than copy.

30   vec = res;

Here you are copying the content of the array. Arrays can be moved (this means it swaps a couple of pointers rather than copying all the data). Since you are not using res after this point you should move it.

30   vec = std::move(res);

User defined types usually start with a capitol

typedef std::vector<int> int_v;

To distinguish user defined types and objects. Proceed types with an initial capitol letter.

Personal

I hated that you posted line numbers into the question.
That kind of things makes it hard to cut and paste your code into our editors for review.

Also if SE sites decide to add line numbers to their code your extra line numbers will become an eye sore.

My Interface would have been:

template<typename I>
void mergeSort(I begin, I end)
{
    auto size = std::distance(begin, end);
    auto mid  = begin;
    std::advance(mid, size/2);

    mergeSort(begin, mid);
    mergeSort(mid, end);
    merge(begin, mid, end);
}
template<typename C>
void mergeSort(C& cont)
{
    mergeSort(std::begin(cont), std::end(cont));
}

SpiderPig · Answer 2 · 2015-05-11 22:44:28Z

If your goal is speed then simple arrays will be fastest. Also it's sufficient to create a new array only once at the start of the algorithm. Here is some C code that's reasonably fast. Needs only 0.5s for 5 million numbers.

time ./a.out

real  0m0.528s
user  0m0.466s
sys   0m0.061s

#include <time.h>
#include <stdio.h>
#include <stdlib.h>
#include <string.h>

void bubbleSort(int *a, int start, int end) {
    int i, x, y;
    end--;
    while(end > start) {
        for(i = start; i < end; i++) {
            x = a[i], y = a[i+1];
            if(x > y) {a[i] = y; a[i+1] = x;}
        }
        end--;
    }
}

void merge(int *a, int start, int *a2, int start2, int mid, int end) {
    int i, i1, i2;
    i = start, i1 = start2, i2 = mid;
    while(i1 < mid && i2 < end) {
        if(a2[i1] <= a2[i2]) a[i++] = a2[i1++];
        else a[i++] = a2[i2++];
    }
    while(i1 < mid) {
        a[i++] = a2[i1++];
    }
    while(i2 < end) {
        a[i++] = a2[i2++];
    }
};

void _mergeSort(int *a1, int *a2, int start, int end) {
    int mid;
    if(end - start < 2) return;
    if(end - start < 8) { bubbleSort(a1, start, end); return; }
    mid = (start + end)/2;
    _mergeSort(a2, a1, start, mid);
    _mergeSort(a2, a1, mid, end);
    merge(a1, start, a2, start, mid, end);
};

void mergeSort(int *a, int length) {
    int *a2 = malloc(sizeof(int) * length);
    memcpy(a2, a, sizeof(int) * length);
    _mergeSort(a, a2, 0, length);
    free(a2);
};

void fillRandNumbers(int *a, int start, int end, int low, int high) {
  int i;
  for(i = start; i < end; i++) {
    a[i] = rand() % (high - low) + low;
  }
}

void main() {
  int i, N = 5000000;
  long sum;
  int *nums = malloc(sizeof(int) * N);
  srand(time(NULL));
  fillRandNumbers(nums, 0, N, 1, N + 1);

  mergeSort(nums, N);

  free(nums);
}

This answer has me confused: It's not C++ but C. Now, I know these languages are related, but your answer comes over as a code dump, not a review, and a code-dump-in-a-different language. Answers like this are typically deleted.... I'm going the down-vote route. — rolfl♦, 2 days ago

asked	3 days ago
viewed	67 times
active	2 days ago

current community

your communities

more stack exchange communities

Merge Sort algorithm

2 Answers 2

Overall comment.

Memory Allocation

Note:

Prefer to move rather than copy.

User defined types usually start with a capitol

Personal

My Interface would have been:

Your Answer

Not the answer you're looking for? Browse other questions tagged c++ algorithm sorting mergesort or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

Merge Sort algorithm

2 Answers 2

Overall comment.

Memory Allocation

Note:

Prefer to move rather than copy.

User defined types usually start with a capitol

Personal

My Interface would have been:

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged c++ algorithm sorting mergesort or ask your own question.

Related

Hot Network Questions