discusion related to cluster mecanism.
0
votes
0answers
59 views
Microcomputer cluster for stress testing
I'm looking to build a microcomputer cluster at work for stress testing one of our applications, and while I have a little financial leeway, I would like to keep the cost low and the computing ...
0
votes
0answers
33 views
Powerful Workstation
I heard about ILM's (Industrial Light and Magic) workstations on which they create models and scenes for a lot of different movies. They have workstations with more than 1024 GB of ram and a lot of ...
0
votes
0answers
17 views
SGE to PBS/Torque qsub conversion
Hi I am setting up a software pipeline (an extensive bioinformatic annotation tool) that was originally written to submit SGE jobs but I am on a PBS/Torque cluster. Any ideas about a way to convert ...
2
votes
2answers
34 views
Gather job usage information on cluster
I am running simulations on a linux cluster shared by hundreds of other users. There is a job scheduler installed on the cluster that determines who runs where, and one of the scheduler commands ...
34
votes
1answer
2k views
How do I distribute a large download over multiple computers?
I need to download a large file (1GB). I also have access to multiple computers running Linux, but each is limited to a 50kB/s download speed by an admin policy.
How do I distribute downloading this ...
2
votes
1answer
34 views
How much space for /home on a hadoop cluster? [closed]
What is a reasonable size to provide for a /home partition for 100 users on a hadoop cluster?
Assume that a landing zone has been provided to store files/data for ingestion into the cluster, so the ...
0
votes
1answer
72 views
Running shell script on cluster in parralel
I thought dsh(Distributed Shel) will be a good option but when i ran my shell script across my nodes, I didnt get expected output,
dsh -aM -c bash /home/cloudera/bash_script.sh
[email protected]: ...
0
votes
0answers
27 views
Mysql Cluster Data node not started
I am working on Mysql Cluster in Fedora. I have three Fedora virtual machine. One is Management node and other two are Data node.
My Management node started but my Data nodes are not started.
When ...
2
votes
1answer
54 views
SED and PDSH: adding a line to a file on a slave node in a cluster
I am trying to add a new line containing GATEWAY=10.0.10.1 to the /etc/sysconfig/network file on each of 32 slave nodes in a cluster.
I read this post on how to insert a line with sed.
This works for ...
3
votes
1answer
60 views
Configuration management, shared files, etc with a Laptop
I have a couple computers sitting around in my home network. They all run the same version of Ubuntu, and I would like to unify their configuration to some extent.
At my university, you can log in to ...
6
votes
3answers
334 views
Multiple tar processes writing to the same archive file at once
I am running many tasks on a Linux cluster. Each task creates many output files. When all tasks are finished, I run something like tar cf foo.tar output_files/to create a tar archive. This is a very ...
2
votes
0answers
55 views
Takeover delay in Heartbeat HA cluster
I have set up a heartbeat HA two node cluster. And both of server have same configuration of ha.cf
#cat /etc/ha.cf
logfile /var/log/ha-log
autojoin none
bcast eth1
warntime 20ms
deadtime 40ms
...
0
votes
0answers
19 views
Is it possible to install RHEV on el6 clustered computer
Is it possible to use Linux (el6) clustered computer can be used as a hypervisor for RHEV 3.3/Ovirt 3.3?
Cause, we can not combine RAM/CPU from two or more hypervisores to be considered as a one ...
2
votes
1answer
2k views
usr/bin/xterm Xt error: Can't open display: /usr/bin/xterm: DISPLAY is not set?
I'm trying to submit a job to a school server (HPC) with:
#!/bin/bash
#$ -S /bin/bash
#$ -cwd
#$ -o ./out_$JOB_ID.txt
#$ -e ./err_$JOB_ID.txt
#$ -notify
#$ -pe orte 1
date
pwd
...
3
votes
1answer
35 views
NFS file with same name but different content depending on host
I have cluster using NFS with a program myprogram which writes to a fixed location, e.g. /home/sharedfs/somedir/somefile.txt.
This means that if you run myprogram from each node, they all write to the ...
2
votes
1answer
57 views
CentOS VM 3972 Segmentation Fault
I'm trying to install MATLAB 2012 on CentOS 5.8 running on a VM cluster. I was able to mount the .iso file, but when I run ./install, it was unable to write to /tmp giving the error, "no space on ...
1
vote
0answers
32 views
HA Cluster in Linux [closed]
My question is :
Is there any other solution for High Availability Clustering in Linux like RedHat Cluster Suite, or not ?
Thanks..
2
votes
0answers
21 views
heartbeat stops after reviving primary node
I followed these steps correctly, everything worked well before I have revived heartbeat on node01 "at the last steps of the tutorial"
when I hit /etc/init.d/heartbeat start it started correctly but ...
0
votes
0answers
112 views
What software should I use to create a HPC (High performance compute) cluster with CentOS 6.5 VMs locally?
I've read about Torque, Slurm resource managers and Maui, Open Grid scheduler, Sonf of Grid Engine but I do not know which is more appropriate to use in a local environment? I've also read that I need ...
0
votes
0answers
142 views
Red Hat/CentOS Cluster HA Web Server Issue “rgmanager[30244]: [script] Executing /etc/rc.d/init.d/httpd stop”
I'm trying out to deploy a red hat cluster to setup a HA Web Server.
I followed the documentation below at page 81
...
3
votes
1answer
154 views
Fencing VM nodes
I'm using VMware Workstation as my hypervisor. I want to create a cluster with two nodes and a sharing storage with FreeNAS.
My questions are around fencing.
Is it possible to do it in this case?
...
1
vote
1answer
46 views
Will having dozens of inactive logins hurt performance
I work on a cluster shared with ~20 users, many of them operating remotely. It's quite common to open the login node, type users and see ~100 users listed, mostly the same user logged in multiple ...
1
vote
1answer
325 views
what is the difference between qsub and ./
Can anyone tell me the difference between the following ways of submitting a script:
$ qsub script_name.sh
and
./script_name.sh
What are the differences between the above two ways of submitting ...
0
votes
0answers
52 views
check/change reserve policy on a virtual disk
I need to check if a Virtual Disk has this policy enable, if so I need to reserve_policy=no_reserve I need to create an DB2 Database Cluster
I cannot find a Linux Command for that.
I do know if this ...
1
vote
3answers
3k views
NFS Automatic Fail-over or Load-balanced or Clustering?
I've been using NFS (and it is the only one i know well in Network Files) for a few years. But still can not able to find out ..
How to make NFS Servers in Clustered Design (or Load-balanced?)
Or at ...
1
vote
2answers
100 views
can I tell to useradd to execute a script? Maybe in /etc/default/useradd?
There are plenty of people that already answered to the question "how to execute useradd properly in a script?"
My question is: "is it possible to tell to useradd to execute one script when it is ...
3
votes
1answer
534 views
Recommendations for replacing a GFS cluster?
I have a couple of CentOS GFS-clusters (GFS as in Global File System) using a shared disk in a Fibre Channel SAN. They are mature now, and the time has come to start planning for their replacement.
...
3
votes
0answers
257 views
Corosync/Pacemaker cluster loss of synchronization
I'm facing a problem with the corosync and pacemaker cluster software.
I've two virtual nodes (Scientific Linux 6.4) and corosync/pacemaker installed on them:
corosync-1.4.1-17.el6.x86_64
...
0
votes
1answer
585 views
Script for running sequential job
I started recently to learn using Linux. I want to write a script in bash shell to run a serial job on a cluster. I have been searching for hints and instructions on how to write such a script. I ...
3
votes
1answer
280 views
How does ZFS grow/scale?
I am wondering how would one design a ZFS system to build something that is massively scalable, along the lines of your favorite cloud storage, but with all the neat benefits the come with ZFS.
A ...
1
vote
1answer
119 views
Strategies for building distributed computing system with hybrid Mac and Linux systems
I want to build a distributed computing system to run Matlab, C and other programming languages for scientific computing. Now I've several old Mac machines with Lion Mac OS installed acted as web ...
4
votes
2answers
255 views
Setup linux cluster like Windows with CentOS or Ubuntu? [closed]
Along the lines of this MSDN article, how would you setup a pair of Ubuntu or CentOS (or RHEL) servers in a cluster so they appear to hosted applications as a single server, but will continue to work ...
2
votes
2answers
619 views
How can I mount vxfs FS to two or more Solaris servers?
I want to have two Solaris servers share the same SAN vxfs Filesystem. Though at a time just one would be accessing the share. This is to allow for a quick failover in case the primary server ...
2
votes
1answer
112 views
mpd daemon prematurely ending jobs
I am a little out of my depth here so bear with me. I am trying to configure mpirun and mpiexec to run software called Materials Studio on a 1 node, 2 processor, 12 core cluster. The submission scheme ...
1
vote
1answer
36 views
Oracle 11gR2 grid silent installation: error for privateinterconnects
I am installing Oracle grid 11gR2 using silent install with response file. I am getting error after "Collecting interface information for node 1" in logs.
I have a feeling that this could be due to ...
1
vote
1answer
190 views
HAProxy should not forward an incoming request to down instance
In my scenario, I have 2 servers, both are communicating with HAProxy. When a request comes in, HAProxy splits the request and sends it to both servers, following a round robin method. If in any case ...
-1
votes
1answer
117 views
zero downtime on server [duplicate]
The scenario is, I have two web servers both are running nginx on it, also I have configured haproxy after the LB for both the servers, I have checked with heartbeat, but no luck to make downtime ...
9
votes
2answers
1k views
How to build a personnal cluster?
I have a debian server running some services (mail, web, git, mysql...). But it is alone, meaning that if it breaks, I have to repair it to access my services.
I'd like to put a second server so that ...
0
votes
3answers
946 views
Kill nohup command on cluster
Last night I launched a command via nohup on a cluster and then closed the connection. Today I would like to kill this job (I know it's still running because output is being generated in an output ...
2
votes
2answers
682 views
comm fails on bash variable input
I have a script that's supposed to get the list of files of two directories, get differences and execute some code for certain files.
These are the commands to get the file lists:
list_in=$(find ...
6
votes
1answer
570 views
Running parallel bash jobs on a HPC cluster using GNU parallel
On an HPC cluster I am trying to run multiple bash scripts (permute2.sh) from 1 bash script using GNU parallel, however it doesn't complete every job. It randomly completes one job, while it is stuck ...
8
votes
3answers
2k views
How to set up “poor man's” Linux desktop cluster?
There are several small Linux/Android devices on the market with 1 GHz ARM processor, 512 MByte RAM and cca. 4 Gbyte storage for $35-$40. If I buy 5 pcs, it's only around $200.
Let's say, we have ...
3
votes
0answers
223 views
Should I investigate Rocks or Ubuntu/Mosix for building a cluster?
I work at a college doing system administration. Someone in their lab is looking to build a cluster for running complex Octave and C++ projects.
They have brand new modern machines. My question is ...
1
vote
1answer
2k views
EMC storage + powercf command + powerpath
When I run powercf -q on my Solaris machine, what changes does this command do on the EMC storage, or on Solaris file system?
From manual page:
DESCRIPTION
During system boot on Solaris hosts, ...
4
votes
3answers
2k views
On Ubuntu Linux, is it normal for mandb to run continuously (apparently in the background)?
I have access to a Ubuntu Linux node at my institution. The nodes are shared among the group, but typically I am the only person who uses this particular node.
I am running a calculation in parallel ...
2
votes
1answer
5k views
How do I download and launch a JNLP file in CentOS?
I'm using Amazon's GPU cluster to experiment with some HPC java code on CentOS.
The Java application is in JNLP format and has a GUI.
I've followed Amazon's getting started video, but I'm not ...
0
votes
1answer
199 views
Samba on a Beowulf Cluster
Is this possible? I have only been able to locate resources related to running scientific applications. Any resources would be helpful.
Regards
1
vote
0answers
148 views
Getting from proficient to expert [closed]
I've got my vacation coming in and thought I might use that for something useful. Essentially, I've been a Linux user since around 1995 when I installed Slackware 2.0 on my computer and have been ...
20
votes
1answer
23k views
How does tcp-keepalive work in ssh?
I am trying to code a shell-script that uses a ssh-connection for doing "heartbeats". I want to terminate the client- and server-side of that connection after a certain timeout (after the connection ...
1
vote
4answers
1k views
How to find which machines are idle on a cluster of many nodes
I run measurements on a cluster consisting of 32 nodes/machines. I do not require all nodes, only 4, for example. The problem is most of the time, the nodes are busy with other people running their ...