Please help, I have the following sample data:
-21.33,45.677,1234,1245,1267,1290,1212,1111,10000,1902
-21.34,45.677,1264,1645,1266,1260,1612,1611,16000,1602
-21.35,45.677,1244,1445,1467,1240,1242,1211,11000,1912
-21.36,45.677,1231,1215,1217,1210,1212,1111,10010,1902
I want that my hadoop mapreduce code should consider the first two float entries as the key(-21.33,45.677) and the remaining integer entries as value (1234,1245,1267,1290,1212,1111,10000,1902).
I am not sure if it can be done with the existing FileInputFormats. So how shall I go about this knowing that the value should be used as array not text.
Also how should I change the inputSplit such that I am able to get multiple records at the same time in the map for computation.