Tagged Questions
2
votes
1answer
732 views
How fast can one submit consecutive and independent jobs with qsub?
This question is related to pbs job no output when busy. i.e Some of the jobs I submit produce no output when PBS/Torque is 'busy'. I imagine that it is busier when many jobs are being submitted one ...
2
votes
7answers
6k views
Python compute cluster
Would it be possible to make a python cluster, by writing a telnet server, then telnet-ing the commands and output back-and-forth? Has anyone got a better idea for a python compute cluster?
PS. ...
8
votes
2answers
667 views
How to tell Condor to dispatch jobs only to machines on the cluster, that have “numpy” installed on them?
I just figured out how to send jobs to be processed on machines on the cluster by using Condor. Since we have a lot of machines and not each of those machines are configured the same, I was wondering:
...
1
vote
2answers
145 views
How can I cluster a list of a list of tuple (tag, probability)? - python
I have a bunch of text and they are classified into categories and then each document is tagged 0, 1 or 2 with a probability for each tag.
[ "this is a foo bar",
"bar bar black sheep",
"sheep is ...
1
vote
2answers
313 views
how can Python see 12 cpus on a cluster where I got allocated 4 cores by LSF?
I access a Linux cluster where resources are allocated using LSF, which I think is a common tool and comes from Scali (http://www.scali.com/workload-management/high-performance-computing). In an ...
0
votes
1answer
196 views
Hadoop and Python: Disable Sorting
I've realized that when running Hadoop with Python code, either the mapper or reducer (not sure which) is sorting my output before it's printed out by reducer.py. Currently it seems to be sorted ...
0
votes
1answer
111 views
Hadoop: Output file has double output
I am running a Hadoop program and have the following as my input file, input.txt:
1
2
mapper.py:
import sys
for line in sys.stdin:
print line,
print "Test"
reducer.py:
import sys
for line ...