See the tag entry for "gpu".
109
votes
16answers
6k views
Why aren't we programming on the GPU? [closed]
So I finally took the time to learn CUDA and get it installed and configured on my computer and I have to say, I'm quite impressed!
Here's how it does rendering the Mandelbrot set at 1280 x 678 ...
29
votes
5answers
4k views
How to use GPU for mathematics
I am looking at utilising the GPU for crunching some equations but cannot figure out how I can access it from C#. I know that the XNA and DirectX frameworks allow you to use shaders in order to ...
15
votes
7answers
11k views
How do you get around the maximum CUDA run-time?
I've noticed that CUDA applications tend to have a rough maximum run-time of 5-15 seconds before they will fail and exit out. I realize it's ideal to not have CUDA application run that long but ...
12
votes
4answers
6k views
Run C# code on GPU
I have no knowledge of GPU programming concepts and APIs. I have a few questions:
Is it possible to write a piece of managed C# code and compile/translate it to some kind of module, which can be ...
10
votes
9answers
2k views
How do I test OpenCL on GPU when logged in remotely on Mac?
My OpenCL program can find the GPU device when I am logged in at the console, but not when I am logged in remotely with ssh. Further, if I run the program as root in the ssh session, the program can ...
9
votes
5answers
1k views
Is there any possibility to write GPU-applications using CUDA under F sharp?
I am interested in using F# for numerical computation. How can I access the GPU using NVIDIA's CUDA standart under F#?
9
votes
3answers
5k views
OpenCL and GPU programming Roadmap
i would like to start stating that i know nothing of OpenCL/GPU programming but i am a advanced C# (general .Net) programmer without fear of C++ and i would like to learn OpenCL/GPU programming... my ...
9
votes
2answers
336 views
Good books and resources on data parallel programming and algorithms
I've read the following and most of the NVIDIA manuals and other content. I was also at GTC last year for the papers and talks.
CUDA by Example: An Introduction to General-Purpose GPU Programming
...
7
votes
2answers
284 views
Looking for an API to employ the wonderful powers of the GPU from .NET
I'm looking for a nice API to use for implementing algorithms on the GPU. I would prefer something relevantly simple and flexible. I'm not looking for the most hardcore performance, but for something ...
6
votes
1answer
478 views
Is restrict(amp) more restrictive than CUDA kernel code?
In C++ AMP, kernel functions or lambdas are marked with restrict(amp), which imposes severe restrictions on the allowed subset of C++ (listed here). Does CUDA allow any more freedom on the subset of C ...
6
votes
3answers
1k views
Getting started with GPU programming on Linux [closed]
I'm wanting to get started playing with GPU programming on Linux and have several questions:
What graphics cards are recommended? I'd like a fairly inexpensive one that supports CUDA and/or OpenCL. ...
6
votes
3answers
3k views
How can I use the GPU as a second processor in .Net?
The question says it all really. I'm hoping that I don't have to write the code in a c++ .dll and then call it from managed code.
6
votes
1answer
796 views
Is there algorithm for sorting array of strings for GPU?
Array to sort has approximately one million strings, where every string can have length up to one million characters.
I am looking for any implementation of sorting algorithm for GPU.
I have a block ...
5
votes
4answers
6k views
Can I run CUDA on Intel?
I have a very simple Toshiba Laptop with i3 processor. Also, I do not have any expensive graphics card. In the display settings, I see Intel(HD) Graphics as display adapter. I am planning to learn ...
5
votes
3answers
885 views
GPU Programming?
I'm new to the GPU Programming world, I've tried reading on Wikipedia and Googling, but I still have several questions:
I downloaded some GPU Examples, for CUDA, there were some .cu files and some ...
5
votes
4answers
340 views
GPU reads from CPU or CPU writes to the GPU?
I am beginner in parallel programming. I have a query which might be seem to be silly but I didn't get a definitive answer when I googled it out.
In GPU computing there is a device i.e. the GPU and ...
5
votes
3answers
4k views
CUDA Matrix multiplication breaks for large matrices
I have the following matrix multiplication code, implemented using CUDA 3.2 and VS 2008. I am running on Windows server 2008 r2 enterprise. I am running a Nvidia GTX 480. The following code works fine ...
5
votes
5answers
3k views
OpenCL FFT lib for GPUs?
Is there any general FFT lib available for running on the GPU using OpenCL? As far as my knowledge goes, Apple sample code for power-of-two OpenCL FFT is the only such code available?
Does any such ...
5
votes
1answer
2k views
RAR password recovery on GPU using ATI Stream processor
I'm newbie in GPU programming , and i work on brute force RAR Password Recovery on ATI Stream Processor using brook+ language, but i see that the kernel written in brook+ language doesn't allow any ...
4
votes
3answers
15k views
printf inside CUDA __global__ function
I am currently writing a matrix multiplication on a GPU and would like to debug my code, but since I can not use printf inside a device function, is there something else I can do to see what is going ...
4
votes
3answers
6k views
libcudart.so.4: cannot find, Ubuntu 10.04
Hey im trying to get my Cuda SDK samples running! but i get this error,
./bandwidthTest: error while loading shared libraries: libcudart.so.4: cannot open shared > object file: No such file or ...
4
votes
4answers
2k views
Java on GPU: Complete Method directly on GPUin plain Java
Firstly: Is it possible to use Java and let it (partly) run on or use GPUs?
And if it's possible, is it possible to use the normal Java syntax and not using special cuda or opencl syntax?
I want just ...
4
votes
2answers
1k views
CUDA memcheck address - how to determine location in code?
cuda-memcheck is reporting this information for a release mode CUDA kernel:
========= Error: process didn't terminate successfully
========= Invalid __global__ read of size 4
========= at ...
4
votes
6answers
4k views
Can we benchmark how fast CUDA or OpenCL is compared to CPU performance?
How much faster can an algorithm on CUDA or OpenCL code run compared to a general single processor core? (considering the algorithm is written and optimized for both the CPU and GPU target).
I know ...
4
votes
4answers
479 views
Which Java code can be moved to the GPU?
With the framework rootbeer is GPU programming for Java possible.
Which Java code should be used for rootbeer and which code should better run in the Java VM self?
Or other: which code produce ...
4
votes
2answers
997 views
What are the programming languages for GPU
I read an article stating that GPU are the future of supercomputing. I would like to know what are the programming languages used for programming on GPU's
4
votes
2answers
1k views
Is there really a timeout for kernels on nvidia gpus?
searching for answers for why my kernels produce strange error messages or "0" only results I found this answer on SO that mentions that there is a timeout of 5s for kernels running on nvidia gpus? I ...
4
votes
4answers
931 views
suggestions on a project in C++ / distributed systems / networks
I'd like to work on a 2-3 month long project (full time) that involves coding in C++ and is related to networks (protocol stacks). I was considering writing my own network stack but that doesn't seem ...
4
votes
3answers
602 views
CUDA /openCL; rewriting branches as non-branching expression
Most of the time a branch is required in a CUDA or OpenCL program, like:
for (int i=0; i<width; i++)
{
if( i % threadIdx.x == 0)
quantity += i*i;
}
the code can always (or at least, most ...
4
votes
1answer
151 views
Addition Assignment Operator in Cuda C
I'm experiencing a problem with addition assignment operator in Cuda C. I'm getting the following error:
kernel.cu(5): error: expression must have integral or enum type
My code is :
import ...