Project Euler #3 - largest prime factor

Question

I was going through the Project Euler problem #3 and made a program to solve it. The problem is as follows:

The prime factors of 13195 are 5, 7, 13 and 29. What is the largest prime factor of the number 600851475143 ?

My code is as follows:

#include<iostream>
#include<vector>
using std::cout;
using std::endl;
using std::vector;
typedef long long int llint;

vector<llint> computeFactors(llint);
vector<llint> computePrimeFactors(vector<llint>&);
llint getLargestPrimeFactor(const llint);
bool isPrime(const llint);

int main() {
   llint num = 600851475143;
   llint prime;
   cout<<"The number to factorize is : "<<num<<endl;
   //call the function getLargestPrimeFactor to get the factor
   prime = getLargestPrimeFactor(num);
   cout<<"The largest prime factor is : "<<prime<<endl;
   return 0;
}

//////////////////////////////////////////////////////////////
////////// GET THE LARGEST PRIME FACTOR BY MAGIC /////////////
//////////////////////////////////////////////////////////////

llint getLargestPrimeFactor(const llint n) {
   vector<llint> list, primeList;

   list = computeFactors(n);
   //if n itself is prime then return n;
   if(list.size()==2) {
      return *(primeList.end());
   } else {
      primeList = computePrimeFactors(list);
   }
   for(vector<llint>::iterator i=primeList.begin(); i<primeList.end(); i++) {
      cout<<*i<<endl;
   }
   vector<llint>::iterator i = primeList.end() - 1;
   return *i;
}

//////////////////////////////////////////////////////////////
/////////// GET THE FACTORS  /////////////////////////////////
//////////////////////////////////////////////////////////////

vector<llint> computeFactors(llint n) {
   vector<llint> list;
   //iterate through every number in the range
   for(llint i=1; i<=n; i++) {
      //find out whether any of the number divides n
      if(n%i==0) {
         //if it does push it into the vector
         list.push_back(i);
         n=n/i;
      }
   }
   return list;
}

//////////////////////////////////////////////////////////////
/////////// GET THE PRIME FACTORS ////////////////////////////
//////////////////////////////////////////////////////////////

vector<llint> computePrimeFactors(vector<llint>& factorList) {
   vector<llint> prime;
   //iterate through every number in the vector
   for(vector<llint>:: iterator i=factorList.begin();
                               i<factorList.end();
                               i++)
   {
      //check whether each of the element is prime
      if(isPrime(*i)) {
         prime.push_back(*i);
      }
   }
   return prime;
}

//////////////////////////////////////////////////////////////
/////////// WHETHER A PRIME NUMBER ///////////////////////////
//////////////////////////////////////////////////////////////

bool isPrime(const llint n) {
   int count = 0;
   for(llint i=1; i<=n; i++) {
      if(n%i==0) {
         count++;
      }
   }
   if(count == 2) {
      return true;
   } else {
      return false;
   }
}

My doubts start from here:

I called a single function from main() that gives me the result.
Now that function calls other functions from itself rather than in main().
Those sub-functions call other functions to achieve the task.

So now while designing the program, I just had to think about a single function that gave me the result. Now I decompose the function into different parts.

So when I call the getLargestPrimeFactor(), others are implicitly called.

But what I could have done is that I could have called each function explicitly in main().

Is there a difference in between the two approaches ? Does it really matter ? In what cases thinking like this can be harmful ? I have studied a bit about cohesion and coupling. I just want to get the habits right from the beginning.

what do you know about all of the factors you are testing above the square root of n? — Grady Player, 18 hours ago
This code is very inefficient. Try it on 600851475177 instead and it takes forever. — SpiderPig, 13 hours ago
@SpiderPig - I am sorry but I am quite new to programming. Can you please give me a hint for a faster algorithm ? :) — user2040026, 11 hours ago

Nobody · Accepted Answer · 2014-04-11 13:52:38Z

Let me start off by general reviewing and end with your specific question:

Documentation

It is good that you thought about documenting with comments. However, the ones you made can be improved.

Function comments document the function interface and as such should be located where the interface is declared (so at the beginning of your file).

They also should not document the obvious ("GET THE LARGEST PRIME FACTOR" for getLargestPrimeFactor) or irrelevant ("my magic").

They should state pre- and postconditions of the function and what the function does (if there is more to it than the function's name).

Think about adopting the use of computer readable documentation systems like doxygen to let IDE's and other tools aid you with correct documentation.

Maximize constness

For example in vector<llint> computePrimeFactors(vector<llint>& factorList) you don't write to the factorList so it should really be defined as const vector<llint>& factorList to indicate that. This makes the code easier to reason about.

Readability

If it is possible try to use C++11 which gives some nice features that improve readability. For example this:

for(vector<llint>:: iterator i=factorList.begin();
                               i<factorList.end();
                               i++)
   {
      //check whether each of the element is prime
      if(isPrime(*i)) {
         prime.push_back(*i);
      }
   }

can become this in C++11:

for(auto const &i : factorList)
{
  //check whether each of the element is prime
  if(isPrime(i)) {
     prime.push_back(i);
  }
}

You can access the first and last element of most containers without using iterators by front() and back() so you could write:

return primeList.back();

instead of:

vector<llint>::iterator i = primeList.end() - 1;
return *i;

Efficiency

At first you calculate some of the divisors (in getFactors) of the number and store them in a list. Then you process this list and pick out the primes with isPrime (in getPrimeFactors) by trying to divide them by all numbers below them.

But actually you could have obtained all prime factors in the first function (getFactors) if you had divided as long as it was possible:

if(n % i == 0) {
     //if it does push it into the vector
     list.push_back(i);
     //remove this prime factor from the number
     while(n % i == 0)
         n /= i;
}

Thereby you would avoid the necessity to later check if the factors are prime.

Naming

Your function computePrimeFactors is misnamed. Actually it should be called something like filterPrimes because it gets a list of numbers and returns the primes inside this list. The better approach for this name would be to get the number as parameter and return the prime factors while internally calling computeFactors:

vector<llint> computePrimeFactors(llint n) {
   vector<llint> factors = computeFactors(n);
   vector<llint> primeFactors;
   // c++11!
   std::copy_if(factors.begin(), factors.end(),
                std::back_inserter(primeFactors), 
                &isPrime);
   return prime;
}

I have also replaced the copy code by a stdlib function, which reduces the risk of programming errors while (usually) increasing readability.

Your question

Finally I will answer your question: In general it is good to break up logic units into functions that define clear boundaries and give a name to the unit. As such you would even further reduce the main function by making it's body another function that gets called from main. This allows reuse of the code later on.

However, in your code it is unlikely to be needed again, so it is not necessary to do this. I would however state, that you should not put more into main.

Excellent answer. Thank you so much. :) – user2040026 yesterday

BenVlodgi · Answer 2 · 2014-04-11 14:40:08Z

As far as the methods you've created and are calling I see no problem with (as per your doubts). Functions calling functions is a good thing. Functions should pull all of their own weight, calling other functions when necessary, the coder shouldn't have to call those functions beforehand and pass in the parameters, unless of course there is a situation where you (the coder) do at times need to specify specific running values.
Your main looks good, however as far as aesthetics of your code goes, I would add spaces between your operators and symbols...
```
//this
cout<<"The largest prime factor is : "<<prime<<endl;

//becomes
cout << "The largest prime factor is : " << prime << endl;
```
This lack of spacing is in other parts of the code, but I'm not going to point out everywhere.

Aesthetically this block of code could be re-factored a few different ways.

//Use one liners and no ugly brackets.
if(list.size() == 2) 
   return *(primeList.end());
else
    primeList = computePrimeFactors(list);

//Take advantage of the fact that you don't need an else after a return.
if(list.size() == 2) 
    return *(primeList.end());
primeList = computePrimeFactors(list);

Use shortcut operators
```
n /= i;
```
Don't do extra work. There are two things to change about the code below.
```
if(count == 2) {
    return true;
} else {
    return false;
}
```
Kill the brackets as I did earlier. (opinion, and optional)
```
if(count == 2)
    return true;
else 
    return false;
```
Now that we can see this code clearly, we can realize what it is actually doing. count == 2 yields a Boolean value which is then passed to the special if function, which then asks if the Boolean passed in was true, if so.. then return true, else false (oversimplification of what is happening on the low level, but you get the gist) count == 2 is a Boolean, so instead of using that in an if... just return the Boolean you got from using that operator.
```
return count == 2;
```

"Kill the brackets" is against many guidelines (most?). Many reviews here would have said instead "add brackets" in case there weren't already. — Morwenn, yesterday
@Morwenn it an opinion based and there is no definitive answer on that, however I highly recommend it, as it clears up the code from pointless fluff and allows you to more simply see what is going on. Most will agree that you shouldn't mix these two approaches within the context of a single if-else, so not to have brackets for half and not for the second, in this particular case, the brackets were serving one purpose... to get in the way of anyone trying to read the code. — BenVlodgi, yesterday
On the other hand, you can later add statements to the body of your if or else without risking to forget the braces, which is safer. — Morwenn, yesterday

Streppel · Answer 3 · 2014-04-11 19:12:41Z

This is absolutely fine. By calling only one function in main and letting it handle what it has been asked for, you are achieving the concept of abstraction, leaving out the unnecessary crunchy-details and just saying what you want, which is the largest prime factor of a given number. The user of this method does not need to know anything about how it works internally.

Abstraction is one of the key concepts of OOP, and since you are programming in C++, you'll start to see the benefits of using such patterns soon, when you start to use classes that can grow out bigger and more complex than expected. Then, you'll need to use encapsulation, which is achieved through abstraction. So, the earlier your adopt this kind of pattern, the easier it will be in the future for you to resolve other issues like these.

Now, regarding your code: one thing that I'd definitely do is drop these /* magic stuff happens here */ comments, which do more harm than good. They do not explain anything and just yell that this code probably smells. Also, you should pay more attention to your one-character-name variables, like n, which does not mean much and is hard (actually impossible, depending on the size of your code) to find through CTRLF if needed.

If you have access to C++11, you could also take a peek on it's features to improve your code. For example, loops such as

for(vector<llint>:: iterator i=factorList.begin();
                               i<factorList.end();
                               i++) {}

could become ranged-for loops

for(auto i: factorList) {}

And I had this major confusion with whether to use one lettered variable names or not. You cleared it up. — user2040026, yesterday

paercebal · Answer 4 · 2014-04-11 13:44:16Z

up vote 2 down vote

Naming

What's the name of your function? "getLargestPrimeFactor"? So it must do all the work to solve the problem.

More generally, you want to make your code easy to use, and, if the user doesn't care about how it is done, providing one function with a correct name to do all the work is a welcome sight.

Customization

In one hand, you must provide a robust, bug-resistant, easy way for the user to get the result (here, the largest prime factor).
In the other, you could want to provide a way for the users to call the functions as it suits them. This should be optional, and this can be done different ways, depending on the type of customization you want to provide.

your priority is to provide the first part: A user should be able to easy have the expected result, the same way starting your car is just turning the key (or pressing the button).

If you really want to provide customization, then you must think carefully about the kind of customization you can, and thus, common idioms will apply.

For example, one naive solution would be to provide one class with all customization points being virtual methods. The users can then override the method they want to customize, while leaving the others as is. This needs research to find the right solution to your specific problem.

edited yesterday

answered yesterday

paercebal
1916

Could you explain what you mean in your naming paragraph? I feel the getLargestPrimeFactor does what it is supposed to do. Also, it looks as if customization points are a bit overkill for some simple arithmetic calculations. What would you customize in this special instance (could you give a code example)? – Nobody yesterday

@Nobody : My point with the "naming" part is that if one function says it will do something, then it must do it. If someone needs a getTheJobDone function, then providing doFirstPart, doSecondPart and doThirdPart instead, needing the caller to call the three functions one after the other is perhaps misguided... About the customization, yes, it is overkill, but this is how I understood the question: In a more general problem, this is one way to provide customization. I'll think about a code example as soon as I am able to. – paercebal yesterday

If these functions are there to solve subproblems, then why not? The getLargestPrimeFactor function does to the user exactly what is needed: It returns the largest prime factor of a number. If you mean computePrimeFactors then I would be on your side :) This one should actually call computeFactors inside and only get the number as input. – Nobody yesterday

add comment

AdmiralAdama · Answer 5 · 2014-04-11 16:15:41Z

I think it's actually a really good habit to call functions from within functions. This is a very important concept in Object-Oriented Programming. If you wrote this source code using OOP, you might structure it like this...

Class Main
- main()
Class PrimeFactors
- public getLargestPrimeFactor()
- private computeFactors()
- private computePrimeFactors()
- private isPrime()

As you can see, in this class any method called from Main is declared as public, and any method only called from within the class is declared as private.

Structuring this program as a class instead of a series of functions makes it easy to re-use the code. It makes Class PrimeFactors like a library that you can easily include into any main that you want.

I agree with the others about commenting less. I was reading Clean Code by Robert C. Martin the other day, and he argues that comments should be minimal and pragmatic. Ideally you want to choose very succinct names for your variables and functions/methods, and you want to make as many chunks of code into functions/methods as possible. And what this does is it names everything, which helps to avoid you needing to make as many comments in the first place. I think you're already on the right track with this because you wrote multiple functions and you tried to give them all good names.

asked	yesterday
viewed	698 times
active	today

current community

your communities

more stack exchange communities

Project Euler #3 - largest prime factor

5 Answers

Documentation

Maximize constness

Readability

Efficiency

Naming

Your question

Naming

Customization

Your Answer

Not the answer you're looking for? Browse other questions tagged c++ design-patterns primes functions programming-challenge or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

Project Euler #3 - largest prime factor

5 Answers

Documentation

Maximize constness

Readability

Efficiency

Naming

Your question

Naming

Customization

Your Answer

Sign up or log in

Post as a guest

Not the answer you're looking for? Browse other questions tagged c++ design-patterns primes functions programming-challenge or ask your own question.

Related

Hot Network Questions