Input File formats In HDFS
In general the input file format will play key role Hadoop MapR Programming
because of the output is generated in three stages
primary data---> IN HDFS----> Mapper______> Map output-------------> Reducer Input/Output
HDFS data Represented in Text,Sequential and Binary Format
By default It Will Represent Text Input Format
if u have submitted the input data to the HDFS
it will organise the data like KEY , VALUE pairs
key is the BYTEOFFSET ie address value of the Line
Value is the Individual String of line in your file system
In general the input file format will play key role Hadoop MapR Programming
because of the output is generated in three stages
primary data---> IN HDFS----> Mapper______> Map output-------------> Reducer Input/Output
HDFS data Represented in Text,Sequential and Binary Format
By default It Will Represent Text Input Format
if u have submitted the input data to the HDFS
it will organise the data like KEY , VALUE pairs
key is the BYTEOFFSET ie address value of the Line
Value is the Individual String of line in your file system
No comments:
Post a Comment