Tagged Questions
HBase is the Hadoop database. Use it when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.
9
votes
0answers
811 views
scans with asynchbase
I'm started to use asynchbase library from StumbleUpon and got some problems with continued asynchronous scans. I wrote this code, based on my understanding of library principles:
public class ...
4
votes
0answers
196 views
Garbage Collection duration in Hadoop CDH5
We have a four-datanodes-cluster running CDH5.0.2, installed through Cloudera Manager parcels.
In order to import 13M users' rows into HBase, we wrote a simple Python script and used hadoop-streaming ...
3
votes
0answers
479 views
Not able to connect to remote EC2-hbase cluster from EMR hive cluster
Before i ask my question let me first explain the scenario. I had setup the hbase cluster on ec2 with 3 instances:
i-xxxxxxx-- master, zookeeper1, regionserver1
i-xxxxxxx-- slave1, zookeeper2, ...
2
votes
0answers
71 views
Manual Fix of Hbase table Overlap (Multi region has same start key)
I was inserting the data into Hbase through the java client. But, suddenly the Region server crashed at a point. So i restarted the Hbase, which after that the Hmaster was not running. When i run the ...
2
votes
0answers
102 views
Querying HBase Stargate for a hexadecimal rowkey via Ruby
I have an HBase table that (in part) utilizes hexadecimal bytes to construct its rowkeys. I'm able to query from the Hbase Shell just fine as follows
get 'my_table', ...
2
votes
0answers
101 views
How to import csv files from ftp server to hadoop(HDFS/HIVE/HBASE)?
My source files are located in ftp server in csv format. I want to import those files to HDFS for further processing. I am not aware about how to get those csv files.
Can anyone help me on this?
2
votes
0answers
108 views
Pig, Oozie, and HBase - java.io.IOException: No FileSystem for scheme: hbase
My Pig script works fine on its own, until I put it in an Oozie workflow, where I receive the following error:
ERROR 2043: Unexpected error during execution.
...
2
votes
0answers
499 views
Unable to find region for hello_world
Versions: Hadoop 2.2, Hbase 0.96.1, Pig 0.12
Whenever I run this pig script
raw_data = LOAD 'sample_data.csv' USING PigStorage( ',' ) AS (
listing_id: chararray, fname: chararray, lname: ...
2
votes
0answers
2k views
(HBase) Error: JAVA_HOME is not set and Java could not be found
I have read some comments about setting JAVA_HOME. As I am a beginner in this,I did not edit anything without any knowledge of what I was doing. I did not want to mess things up more than this.
So if ...
2
votes
0answers
106 views
updating Hbase table using pig
I am aware that different columns of a row in HBase can be updated using Put. I am using Pig + HBase with HBaseStorage. I'm able to store and load in HBase without any issues with Pig expression where ...
2
votes
0answers
371 views
HBase NoServerForRegionException with region server working fine
I'm trying to run a HBase MapReduce job in the pseudo-distributed mode and the following NoServerForRegionException occurred. I've checked the status of master node on port 60010 and region server on ...
2
votes
0answers
905 views
HBase - java.lang.NoClassDefFoundError in java
Im just trying to interact with Hbase using Java
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.hbase.*;
import org.apache.hadoop.hbase.client.*;
import ...
2
votes
0answers
331 views
HBase distributed log-splitting keeps failing because unable to get a lease
We used up all the free space on our test HDFS cluster so HBase crashed. After cleaning up some space, we were able to restart HBase, but after the startup a distributed log split job keeps failing.
...
2
votes
0answers
228 views
Java: Massive Class Collision
I am working on a web service that talks to Apache HBase. The HBase Maven artifact contains all sorts of web libraries, such as jersey-core, but I am using Apache CXF. When I compile and run my app, ...
2
votes
0answers
116 views
Is there any way to create an autoscaling Hadoop Hbase cluster
I used aws autoscaling for my webserver and Apache whirr for hadoop habse cluster.
Do Apache whirr has an autoscaling property.
2
votes
0answers
181 views
cygbin: hostname nor servname provided, or not known
Installing hadoop1.0.3 successfully. Now run Hbase service(start-hbase.sh command)
Having changes in hbaseenv.sh and hbasesite.xml
But the problem arises:
Please suggest how to resolve this ...
2
votes
0answers
2k views
Why Namenode is not working in Hadoop given setup
I want to establish HBase cluster of 2 nodes. For that I first establish Hadoop setup. It works fine. Namenode, Secondary namenode, datanode, Jobtracker, Tasktracker, all are working, but when I ...
2
votes
0answers
258 views
How to configure hbase using cygwin?
i have configured my Hadoop in cygwin now i m trying to configure Hbase
i have made some changes in hbase-evn.sh as:
export JAVA_HOME=C:\\java\\jre
export ...
2
votes
0answers
270 views
Hbase Importtsv java.lang.IllegalArgumentException: Can't read partitions file error
I have a distributed cluster and when a run a bulk upload command using importtsv it give me the java.lang.IllegalArgumentException: Can't read partitions file error and moreover it has a local job ...
2
votes
0answers
1k views
Not All Meta Regions Online Exception in hbase
I have a 4 node hadoop,hbase cluster of which 1 serves as hadoopmaster & hbasemaster.
The rest three server as datanode & regionserver & zookeeper.
Recently one of my machine running the ...
2
votes
0answers
535 views
AWS DynamoDB VS HBase
I have been using HBase for the past six months and I came to know about DynamoDB by Amazon. Maintenance wise dynamo db looks easier to handle since its taken care by Amazon. But whether to switch to ...
2
votes
0answers
334 views
What is the desirable log format for Flume's hbase() sink?
The log format I use is JSON and it has 15 key-value pairs:
...
2
votes
0answers
663 views
Pig accessing HBase using Spring Data Hadoop
Has anyone got experience of using Spring Data Hadoop to run a Pig script that connects to HBase using Elephant Bird's HBaseLoader?
I'm new to all of the above, but need to take some existing Pig ...
2
votes
0answers
210 views
HBase client thread stuck waiting on HBaseClient.call()
I have a standalone instance of HBase (single instance, on localhost, no Hadoop).
After reading a few thousand records using a scanner my thread gets stuck waiting, always on the same record.
...
1
vote
0answers
18 views
graceful_stop.sh not found in HDP2.1 Hbase
I was reading Hortonworks documenrtation to remove regionserver from any host of cluster ...
1
vote
0answers
29 views
How do I do Capacity Planning for HBase?
I am planning to use HBase for a social application. The starting user base would be around 5k and monthly it would keep increasing on around some 3k per month.
Which approach would increase the ...
1
vote
0answers
20 views
My node.js hbase code don't retrieve data
I work with hbase and node.js. I write my code:
var hbase = require('hbase');
http = require('http');
http.createServer(function (request, response) {
hbase()
.getRow('DatosBicing', '1*')
...
1
vote
0answers
103 views
Hbase 0.96 with Spark v 1.0+
This combination of Hbase / Spark versions appears to be pretty toxic. I have spent hours trying to find various MergeStrategy's that would work but to no avail.
Here is the core of the present ...
1
vote
0answers
32 views
hbase lookup performance
I have an hbase hosting around 80G of data(around 110 million rows). Each row has variable number of columns. I want to primarily use this hbase for key lookups(around 10 million lookups) while ...
1
vote
0answers
15 views
difference between org.apache.hbase -hbase and -hbase-client
I'm working with project that uses HBase and need to use some API for both working with data (Put and other classes) and schema(HTable, HColumn etc). I found two maven dependencies to work with it:
...
1
vote
0answers
43 views
Connecting to HBase in KNIME
I am using KNIME 2.9.4 and I have HBase installed (version - 0.94.8) in a remote Linux server. With the host IP, I am trying to connect to HBase.
Here is what I did.
Added the "hbase-0.94.8.jar" in ...
1
vote
0answers
41 views
Compatibility issue between Hbase 0.94.2 and apache nutch dependency
I am trying to install apache nutch 2.2.1 and have successfully build it after making the required changes in the configuration files by following ...
1
vote
0answers
42 views
hbase custom filter not working
I'm trying to create a custom filter on hbase 0.98.1 in standalone mode on ubuntu 14.04.
I created a class extending FilterBase. I put the jar in HBASE_HOME/lib. Looking in the logs, I see that my ...
1
vote
0answers
67 views
hadoop and hbase rebalancing after node additions
I have a fundamental question about load balancer. I just finished adding new nodes to our hadoop(2.3) cluster which also has hbase v0.98. After the addition and having all nodes online in hadoop and ...
1
vote
0answers
40 views
How to create a hbase custom command on jruby?
I am new to Hadoop and whole IT itself. I want to know whether I can create a custom hbase command similar to already available scan, put commands.. I have a sample jruby script, client.rb that ...
1
vote
0answers
113 views
Apache Nutch Error
I am completely new to Nutch Solr, and Hbase. I have installed the following
apache-nutch-2.2.1,
hbase-0.9.4, and
solr-3.6.2.
I am currently running OS X Mavericks and I have Java 1.7 installed
...
1
vote
0answers
14 views
Not finding an Existing table in hbase with data stored by a Websphere 8 Application
I'm experiencing an issue when i attempt to start up an application on Websphere which stores its data in HBASE. I've checked HBASE and found that the database is intact and no inconsistencies were ...
1
vote
0answers
18 views
Hadoop FileFormat for Null fields
I was thinking of storing below CSV file in any format available in HDFS.
H1 H2 H3 H4
Row1 1 NA 8.6 65
Row2 2 NA NA 56
Row3 4.6 7.5 NA 65
I know it's very easy we ...
1
vote
0answers
41 views
Efficient search in nested HBase entities
If I follow Ian Varley's HBase design practice and store a bunch of nested entities in the same HBase entity (to benefit from HBase's single row ACID properties), would it be possible to efficiently ...
1
vote
0answers
325 views
Hive query with where clause not working
I am querying an external Hbase table from Hive.
when i do a simple query
select * from Document_Table_Hive
The query works and I get the records stored in the table.
but when I do a query with ...
1
vote
0answers
45 views
Hbase configuraion without using hdfs
How can I make an hbase-site.xml?
How is its format?
how does my client connect to HbaseDB ?
my format is:
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" ...
1
vote
0answers
22 views
Is it possible to specify which takstrackers to use in a MapReduce job?
We have two types of jobs in our Hadoop cluster. One job uses MapReduce HBase scanning, the other one is just pure manipulation of raw files in HDFS. Within our HDFS cluster, part of the datanodes are ...
1
vote
0answers
161 views
java.io.IOException:java.lang.ClassCastException: org.apache.hadoop.hbase.client.Result cannot be cast to org.apache.hadoop.io.Writable
Tried one sample of handling a table in hbase from hive.
The CREATE EXTERNAL TABLE command was successful, but the select statement gives a class cast exception
ENV:
hive 0.12.0, hbase 0.96.1, ...
1
vote
0answers
52 views
How to perform GET using hbase rest with hashed rowkey
I have the following row key: \xFE\xA1\x52\x272014-02-14T00:00:00Z
I am trying to use hbase rest to perform a GET.
Performing the following, I get 404 Not Found:
curl -v -X GET http://{server ...
1
vote
0answers
172 views
hbase adding phoenix jars under cloudera
I'm looking for a way to add the Phoenix Project (http://phoenix.incubator.apache.org/) JAR's to an HBase region server running under CDH 4.5 Cloudera.
There are several mentions in the googlecanon ...
1
vote
0answers
58 views
Generate HFileOutputFormat after the Reducer
I have written a MapReduce-Job, which needs the Mapper and the Reducer part. After the Reducer (which generates the Puts) I would like to save the result into HFileOutputFormat to do an incremental ...
1
vote
0answers
137 views
HBase/Zookeeper Class Not Found Exception
I'm running a stand alone instance of HBase on Windows using Cygwin. When a try to start Hbase a get the below exception. I have HBASE_CLASSPATH set in my hbase-env.sh file( export ...
1
vote
0answers
27 views
Nutch updatedb WrongRegionException
I run generate-fetch-parse and after update I got this exception.
2014-01-23 13:40:56,905 ERROR store.HBaseStore - Failed 747 actions: WrongRegionException: 747 times, servers with issues: ...
1
vote
0answers
43 views
HBASE with Distributed File System?
Well its quite clear that HBASE is database that save its file in HDFS. Can HBASE even be integrated with other distributed file system? If yes, then what should be the underlying approaches?For ...
1
vote
0answers
125 views
HBase scan and retrieve single columns
I am trying to understand the data model that HBase uses in storing its data, in order to plan a more effective organization of application data. In particular I am trying to understand whether it is ...