The study of solving computational problems by exploiting the use of multiple processors simultaneously.

learn more… | top users | synonyms

2
votes
2answers
37 views

Speedup prediction of rotating mask filter

I am trying to do the speed up analysis of the rotating mask filter (section 4.2.3). Let $N^2$ be the pixels in the image and let $m^2$ be the neighborhood of a given pixel, what I have for my ...
1
vote
0answers
52 views

CUDA Mandelbrot Set effective bandwidth and optimization

I was reading through this article (very good article and excellent blog BTW) to do some measurements in my (very simple) implementation of the Mandelbrot Set. I'm using a Quadro 2000D card which has ...
1
vote
0answers
40 views

Is there MPI_Star instead of MPI_Win (window) for MPI_Get?

In FEM applications, each processor needs only data from neighboring processors, which gives a stencil or star pattern of communication. In MPI_Win for MPI_Get, one needs to setup a group of ...
-1
votes
0answers
67 views

Why are Computational Fluid Dynamics Problems Difficult to Solve in Parallel Processing? [closed]

I've heard this in distributed computing circles. Can you tell me why this is so? Are there any solutions? P.S: can I use the Finite Element method for a solution?
4
votes
2answers
80 views

Under which conditions is better OMP_NUM_THREADS = 4 or 2 for multithread two-core CPU?

I am running my Fortran 95 numerical codes, whose structure is most times easily parallelized by little more than simply adding the corresponding OpenMP instruction before some critical DO sentences, ...
1
vote
1answer
76 views

Memory management in time-domain simulation

I develop a parallel time-domain DAE simulation software using OpenMP and Fortran 2003. The main algorithm is: ...
3
votes
0answers
79 views

Best choise of solver for a large sparse symmetric (but not positive definite) system

I am presently working on solving very large symmetric (but not positive definite) systems, generated by some certain algorithms. These matrices have a nice block sparsity which can be used for ...
7
votes
3answers
148 views

Log-log parallel scaling/efficiency plots

A lot of my own work revolves around making algorithms scale better, and one of the preferred ways of showing parallel scaling and/or parallel efficiency is to plot the performance of an ...
4
votes
1answer
76 views

Parallel computation of big covariance matrices

We need to compute covariance matrices with sizes ranging from 10000x10000 to 100000x100000. We have access to GPUs and clusters, we wonder what is the best parallel approach for speeding up these ...
8
votes
4answers
153 views

Parallel (GPU) algorithms for asynchronous cellular automata

I have a collection of computational models that could be described as asynchronous cellular automata. These models resemble the Ising model, but are slightly more complicated. It seems as if such ...
1
vote
1answer
58 views

Multi-node jobs with Sun Grid Engine without MPI

I am trying to figure out how to write a proper job file for the Sun Grid Engine so I can run a parallel computation using Mathematica on multiple nodes. Unfortunately almost all tutorials on the ...
4
votes
1answer
91 views

MPI-based Implementations of FFT

In a numerical computation, I am required to take a multi-dimensional FFT on a distributed-memory cluster. The data is currently distributed using a distributed array in PETSc (DMDA). I initial ...
2
votes
0answers
13 views

An asynchronous version of the Covariance Matrix Adaption (CMA)?

Does anyone know of a variant of the CMA algorithm that is suitable for an asynchronous parallel implementation? The conventional version of the algorithm allows one to evaluate the fitness function ...
4
votes
1answer
119 views

Threaded OpenBlas benchmark slower than system blas?

I'm not sure if I did something wrong or if I just didn't understand the concept of an optimized BLAS. I'm a FEM engineer trying to optimize my setup on a small cluster computer (six nodes). I'm ...
0
votes
0answers
89 views

Determining Computing Requirements

I am looking at building/buying myself a workstation for scientific computing. I will be doing molecular dynamics simulations that are memory intensive (data on a large number of particles and a ...

1 2 3 4 5
15 30 50 per page