HBase is the Hadoop database. Use it when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware.

learn more… | top users | synonyms

9
votes
0answers
811 views

scans with asynchbase

I'm started to use asynchbase library from StumbleUpon and got some problems with continued asynchronous scans. I wrote this code, based on my understanding of library principles: public class ...
4
votes
0answers
196 views

Garbage Collection duration in Hadoop CDH5

We have a four-datanodes-cluster running CDH5.0.2, installed through Cloudera Manager parcels. In order to import 13M users' rows into HBase, we wrote a simple Python script and used hadoop-streaming ...
3
votes
0answers
479 views

Not able to connect to remote EC2-hbase cluster from EMR hive cluster

Before i ask my question let me first explain the scenario. I had setup the hbase cluster on ec2 with 3 instances: i-xxxxxxx-- master, zookeeper1, regionserver1 i-xxxxxxx-- slave1, zookeeper2, ...
2
votes
0answers
71 views

Manual Fix of Hbase table Overlap (Multi region has same start key)

I was inserting the data into Hbase through the java client. But, suddenly the Region server crashed at a point. So i restarted the Hbase, which after that the Hmaster was not running. When i run the ...
2
votes
0answers
102 views

Querying HBase Stargate for a hexadecimal rowkey via Ruby

I have an HBase table that (in part) utilizes hexadecimal bytes to construct its rowkeys. I'm able to query from the Hbase Shell just fine as follows get 'my_table', ...
2
votes
0answers
101 views

How to import csv files from ftp server to hadoop(HDFS/HIVE/HBASE)?

My source files are located in ftp server in csv format. I want to import those files to HDFS for further processing. I am not aware about how to get those csv files. Can anyone help me on this?
2
votes
0answers
108 views

Pig, Oozie, and HBase - java.io.IOException: No FileSystem for scheme: hbase

My Pig script works fine on its own, until I put it in an Oozie workflow, where I receive the following error: ERROR 2043: Unexpected error during execution. ...
2
votes
0answers
499 views

Unable to find region for hello_world

Versions: Hadoop 2.2, Hbase 0.96.1, Pig 0.12 Whenever I run this pig script raw_data = LOAD 'sample_data.csv' USING PigStorage( ',' ) AS ( listing_id: chararray, fname: chararray, lname: ...
2
votes
0answers
2k views

(HBase) Error: JAVA_HOME is not set and Java could not be found

I have read some comments about setting JAVA_HOME. As I am a beginner in this,I did not edit anything without any knowledge of what I was doing. I did not want to mess things up more than this. So if ...
2
votes
0answers
106 views

updating Hbase table using pig

I am aware that different columns of a row in HBase can be updated using Put. I am using Pig + HBase with HBaseStorage. I'm able to store and load in HBase without any issues with Pig expression where ...
2
votes
0answers
371 views

HBase NoServerForRegionException with region server working fine

I'm trying to run a HBase MapReduce job in the pseudo-distributed mode and the following NoServerForRegionException occurred. I've checked the status of master node on port 60010 and region server on ...
2
votes
0answers
905 views

HBase - java.lang.NoClassDefFoundError in java

Im just trying to interact with Hbase using Java import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.hbase.*; import org.apache.hadoop.hbase.client.*; import ...
2
votes
0answers
331 views

HBase distributed log-splitting keeps failing because unable to get a lease

We used up all the free space on our test HDFS cluster so HBase crashed. After cleaning up some space, we were able to restart HBase, but after the startup a distributed log split job keeps failing. ...
2
votes
0answers
228 views

Java: Massive Class Collision

I am working on a web service that talks to Apache HBase. The HBase Maven artifact contains all sorts of web libraries, such as jersey-core, but I am using Apache CXF. When I compile and run my app, ...
2
votes
0answers
116 views

Is there any way to create an autoscaling Hadoop Hbase cluster

I used aws autoscaling for my webserver and Apache whirr for hadoop habse cluster. Do Apache whirr has an autoscaling property.
2
votes
0answers
181 views

cygbin: hostname nor servname provided, or not known

Installing hadoop1.0.3 successfully. Now run Hbase service(start-hbase.sh command) Having changes in hbaseenv.sh and hbasesite.xml But the problem arises: Please suggest how to resolve this ...
2
votes
0answers
2k views

Why Namenode is not working in Hadoop given setup

I want to establish HBase cluster of 2 nodes. For that I first establish Hadoop setup. It works fine. Namenode, Secondary namenode, datanode, Jobtracker, Tasktracker, all are working, but when I ...
2
votes
0answers
258 views

How to configure hbase using cygwin?

i have configured my Hadoop in cygwin now i m trying to configure Hbase i have made some changes in hbase-evn.sh as: export JAVA_HOME=C:\\java\\jre export ...
2
votes
0answers
270 views

Hbase Importtsv java.lang.IllegalArgumentException: Can't read partitions file error

I have a distributed cluster and when a run a bulk upload command using importtsv it give me the java.lang.IllegalArgumentException: Can't read partitions file error and moreover it has a local job ...
2
votes
0answers
1k views

Not All Meta Regions Online Exception in hbase

I have a 4 node hadoop,hbase cluster of which 1 serves as hadoopmaster & hbasemaster. The rest three server as datanode & regionserver & zookeeper. Recently one of my machine running the ...
2
votes
0answers
535 views

AWS DynamoDB VS HBase

I have been using HBase for the past six months and I came to know about DynamoDB by Amazon. Maintenance wise dynamo db looks easier to handle since its taken care by Amazon. But whether to switch to ...
2
votes
0answers
334 views

What is the desirable log format for Flume's hbase() sink?

The log format I use is JSON and it has 15 key-value pairs: ...
2
votes
0answers
663 views

Pig accessing HBase using Spring Data Hadoop

Has anyone got experience of using Spring Data Hadoop to run a Pig script that connects to HBase using Elephant Bird's HBaseLoader? I'm new to all of the above, but need to take some existing Pig ...
2
votes
0answers
210 views

HBase client thread stuck waiting on HBaseClient.call()

I have a standalone instance of HBase (single instance, on localhost, no Hadoop). After reading a few thousand records using a scanner my thread gets stuck waiting, always on the same record. ...
1
vote
0answers
18 views

graceful_stop.sh not found in HDP2.1 Hbase

I was reading Hortonworks documenrtation to remove regionserver from any host of cluster ...
1
vote
0answers
29 views

How do I do Capacity Planning for HBase?

I am planning to use HBase for a social application. The starting user base would be around 5k and monthly it would keep increasing on around some 3k per month. Which approach would increase the ...
1
vote
0answers
20 views

My node.js hbase code don't retrieve data

I work with hbase and node.js. I write my code: var hbase = require('hbase'); http = require('http'); http.createServer(function (request, response) { hbase() .getRow('DatosBicing', '1*') ...
1
vote
0answers
103 views

Hbase 0.96 with Spark v 1.0+

This combination of Hbase / Spark versions appears to be pretty toxic. I have spent hours trying to find various MergeStrategy's that would work but to no avail. Here is the core of the present ...
1
vote
0answers
32 views

hbase lookup performance

I have an hbase hosting around 80G of data(around 110 million rows). Each row has variable number of columns. I want to primarily use this hbase for key lookups(around 10 million lookups) while ...
1
vote
0answers
15 views

difference between org.apache.hbase -hbase and -hbase-client

I'm working with project that uses HBase and need to use some API for both working with data (Put and other classes) and schema(HTable, HColumn etc). I found two maven dependencies to work with it: ...
1
vote
0answers
43 views

Connecting to HBase in KNIME

I am using KNIME 2.9.4 and I have HBase installed (version - 0.94.8) in a remote Linux server. With the host IP, I am trying to connect to HBase. Here is what I did. Added the "hbase-0.94.8.jar" in ...
1
vote
0answers
41 views

Compatibility issue between Hbase 0.94.2 and apache nutch dependency

I am trying to install apache nutch 2.2.1 and have successfully build it after making the required changes in the configuration files by following ...
1
vote
0answers
42 views

hbase custom filter not working

I'm trying to create a custom filter on hbase 0.98.1 in standalone mode on ubuntu 14.04. I created a class extending FilterBase. I put the jar in HBASE_HOME/lib. Looking in the logs, I see that my ...
1
vote
0answers
67 views

hadoop and hbase rebalancing after node additions

I have a fundamental question about load balancer. I just finished adding new nodes to our hadoop(2.3) cluster which also has hbase v0.98. After the addition and having all nodes online in hadoop and ...
1
vote
0answers
40 views

How to create a hbase custom command on jruby?

I am new to Hadoop and whole IT itself. I want to know whether I can create a custom hbase command similar to already available scan, put commands.. I have a sample jruby script, client.rb that ...
1
vote
0answers
113 views

Apache Nutch Error

I am completely new to Nutch Solr, and Hbase. I have installed the following apache-nutch-2.2.1, hbase-0.9.4, and solr-3.6.2. I am currently running OS X Mavericks and I have Java 1.7 installed ...
1
vote
0answers
14 views

Not finding an Existing table in hbase with data stored by a Websphere 8 Application

I'm experiencing an issue when i attempt to start up an application on Websphere which stores its data in HBASE. I've checked HBASE and found that the database is intact and no inconsistencies were ...
1
vote
0answers
18 views

Hadoop FileFormat for Null fields

I was thinking of storing below CSV file in any format available in HDFS. H1 H2 H3 H4 Row1 1 NA 8.6 65 Row2 2 NA NA 56 Row3 4.6 7.5 NA 65 I know it's very easy we ...
1
vote
0answers
41 views

Efficient search in nested HBase entities

If I follow Ian Varley's HBase design practice and store a bunch of nested entities in the same HBase entity (to benefit from HBase's single row ACID properties), would it be possible to efficiently ...
1
vote
0answers
325 views

Hive query with where clause not working

I am querying an external Hbase table from Hive. when i do a simple query select * from Document_Table_Hive The query works and I get the records stored in the table. but when I do a query with ...
1
vote
0answers
45 views

Hbase configuraion without using hdfs

How can I make an hbase-site.xml? How is its format? how does my client connect to HbaseDB ? my format is: <?xml version="1.0"?> <?xml-stylesheet type="text/xsl" ...
1
vote
0answers
22 views

Is it possible to specify which takstrackers to use in a MapReduce job?

We have two types of jobs in our Hadoop cluster. One job uses MapReduce HBase scanning, the other one is just pure manipulation of raw files in HDFS. Within our HDFS cluster, part of the datanodes are ...
1
vote
0answers
161 views

java.io.IOException:java.lang.ClassCastException: org.apache.hadoop.hbase.client.Result cannot be cast to org.apache.hadoop.io.Writable

Tried one sample of handling a table in hbase from hive. The CREATE EXTERNAL TABLE command was successful, but the select statement gives a class cast exception ENV: hive 0.12.0, hbase 0.96.1, ...
1
vote
0answers
52 views

How to perform GET using hbase rest with hashed rowkey

I have the following row key: \xFE\xA1\x52\x272014-02-14T00:00:00Z I am trying to use hbase rest to perform a GET. Performing the following, I get 404 Not Found: curl -v -X GET http://{server ...
1
vote
0answers
172 views

hbase adding phoenix jars under cloudera

I'm looking for a way to add the Phoenix Project (http://phoenix.incubator.apache.org/) JAR's to an HBase region server running under CDH 4.5 Cloudera. There are several mentions in the googlecanon ...
1
vote
0answers
58 views

Generate HFileOutputFormat after the Reducer

I have written a MapReduce-Job, which needs the Mapper and the Reducer part. After the Reducer (which generates the Puts) I would like to save the result into HFileOutputFormat to do an incremental ...
1
vote
0answers
137 views

HBase/Zookeeper Class Not Found Exception

I'm running a stand alone instance of HBase on Windows using Cygwin. When a try to start Hbase a get the below exception. I have HBASE_CLASSPATH set in my hbase-env.sh file( export ...
1
vote
0answers
27 views

Nutch updatedb WrongRegionException

I run generate-fetch-parse and after update I got this exception. 2014-01-23 13:40:56,905 ERROR store.HBaseStore - Failed 747 actions: WrongRegionException: 747 times, servers with issues: ...
1
vote
0answers
43 views

HBASE with Distributed File System?

Well its quite clear that HBASE is database that save its file in HDFS. Can HBASE even be integrated with other distributed file system? If yes, then what should be the underlying approaches?For ...
1
vote
0answers
125 views

HBase scan and retrieve single columns

I am trying to understand the data model that HBase uses in storing its data, in order to plan a more effective organization of application data. In particular I am trying to understand whether it is ...