Hadoop is an Apache open-source project that provides software for reliable and scaleable distributed computing. The project itself includes a variety of other complimentary additions.
0
votes
1answer
14 views
Why is Select Count(*) slower than Select * in hive
When i am running queries in VirtualBox Sandbox with hive. I feel Select count(*) is too much slower than the Select *.
Can anyone explain what is going on behind?
And why this delay is happening?
0
votes
0answers
8 views
Hadoop datanode blocks storing information
I want to find out how many blocks are stored in particular data nade in hadoop cluster. And which file those blocks are belongs to.. I have only two node cluster..
0
votes
0answers
6 views
Mahout Canopy Clustering, K-means Clustering : Java Heap Space - out of memory
I'm running Mahout 0.7 on an my cluster which has 30 nodes(each has 8 cores 16G memory) trying to canopy-cluster 250000 SparseVector(300000).
If I canopy-cluster finding small number of canopy ...
0
votes
0answers
13 views
How to load SQL data into the Hortonworks Sandbox?
I have Installed Hartonworks SandBox in my pc . also tried with a CSV file and its getting in a table structerd manner its ok(Hive + hadoop), nw i want to migrate my current sql Databse into ...
-1
votes
0answers
6 views
w/o superuser privilege, how can I limit the number of nodes participating a map-reduce task in hadoop? [closed]
I am trying to run a hadoop scalability experiment. I want to limit that the nodes participating in a map-reduce job are limited within a rack instead of the whole cluster? How can I do that per job ...
0
votes
1answer
11 views
Control the split size with Avro Input Format in Hadoop
I must read Avro record serialized in avro files in HDFS. To do that, I use the AvroKeyInputFormat, so my mapper is able to work with the read records as keys.
My question is, how can I control the ...
0
votes
0answers
10 views
Pig: Force one mapper per input line/row
I have a Pig Streaming job where the number of mappers should equal the number of rows/lines in the input file. I know that setting
set mapred.min.split.size 16
set mapred.max.split.size 16
set ...
-3
votes
0answers
14 views
represent graph matrix by adjacency list using mapreduce [closed]
I want to convert my graph represented by a dense matrix into adjacency list before I do some matrix computations using mapreduce. I want to represent it in the form ColumnID : RowID : Value. Please ...
0
votes
1answer
23 views
The type Mapper is not generic; it cannot be parameterized with arguments <LongWritable, Text, Text, IntWritable>
What could be the reason for the error? I am able to successfully compile on mac eclipse. But not on hadoop server which is running CDH4.
root@hadoop]# javac -cp ...
0
votes
2answers
38 views
NoClassDefFoundError in wordcount program
I'm running hadoop wordcount program. But it is giving me error like "NoClassDefFoundError"
command for running :
hadoop -jar /home/user/Pradeep/sample.jar hdp_java.WordCount /user/hduser/ana.txt ...
-3
votes
0answers
14 views
where are the web log files of adventureWorks databse located [closed]
can anyone please let me know where one can get the year wise web log files of AdventureWorks Database in csv or any other format which has to be used in Hadoop HDInsight for business analytics?
0
votes
0answers
15 views
Hadoop & Hbase Installation and configuration Issues in a single node cluster on window
I have recently started studying about NoSql and Big Data and decided to continue working on them. From couples of days I am trying to install and configure Hadoop and Hbase on my win2008 R2 64 bit ...
0
votes
0answers
21 views
Apache Hbase installation
I am trying to install Hbase(hbase-0.94.8) in ubuntu 12.04 enviroment.
I followed exactly the same steps given in this page
http://hbase.apache.org/book/quickstart.html
I am able to start Hbase and ...
0
votes
0answers
21 views
Datanode Dies After a Few Seconds
I am running Apache Hadoop version 1.0.4.
I followed a tutorial here: http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/ with some tweaks to set up Hadoop.
...
0
votes
1answer
25 views
Hive Insertion showing error
i have created a table in hive .
create table demo(no int, name string)
INSERT demo values (1,'haris')
But when iam inserting values in that table it showing the following error .
FAILED: Parse ...