MC	Which statement is NOT CORRECT?	Reducer workers in Hadoop will start once all mapper workers have fished.	correct	A MapReduce pipeline in Hadoop can include an optional Sorter to sort the final output.	incorrect	A reducer in Hadoop reduces a collection of elements to one or more output elements.	incorrect	A mapper in Hadoop maps each element in a collection to one or more output elements.	incorrect	
MC	Which components does the base Hadoop stack include?	NDFS, MapReduce, and YARN.	incorrect	HDFS, Spark and YARN.	incorrect	HDFS, Map and Reduce.	incorrect	HDFS, MapReduce and YARN.	correct	
MC	Which statement is CORRECT?	DataNodes in HDFS store a registry of metadata.	incorrect	HDFS is composed of a NameNode, DataNodes, and an optional SecondaryNameNode.	correct	Both the SecondaryNameNode and primary NameNode can simultaneously handle requests from clients.	incorrect	The HDFS NameNode sends regular heartbeat messages to its DataNodes.	incorrect	
MC	Which statement is CORRECT?	DataNodes in HDFS store a registry of metadata.	incorrect	The HDFS NameNode sends regular heartbeat messages to its DataNodes.	incorrect	Both the SecondaryNameNode and primary NameNode can simultaneously handle requests from clients.	incorrect	HDFS is composed of a NameNode, DataNodes, and an optional SecondaryNameNode.	correct	
MC	Which statement is NOT CORRECT?	A traditional RDBMS applies a 'schema on write' approach.	incorrect	Variety in Big Data refers to data "in many forms".	incorrect	Both the SecondaryNameNode and primary NameNode can simultaneously handle requests from clients.	correct	Spark SQL can be used from within Java, Python, Scala and R.	incorrect	
MC	Which statement is NOT CORRECT?	Hive offers an SQL engine to query Hadoop data.	incorrect	Hive offers a JDBC interface.	incorrect	Hive queries run much faster than hand-written MapReduce programs.	correct	Hive's query language is not as feature complete as the full SQL standard.	incorrect	
MC	Which of the following commands are not a part of HBase?	Put	incorrect	Get	incorrect	Place	correct	Describe	incorrect	
MC	Which statement is NOT CORRECT?	Hive offers an SQL engine to query Hadoop data.	incorrect	Hive offers a JDBC interface.	incorrect	Hive's query language is not as feature complete as the full SQL standard.	incorrect	Hive queries run much faster than hand-written MapReduce programs.	correct	
MC	Which of the following commands are not a part of HBase?	Get	incorrect	Describe	incorrect	Place	correct	Put	incorrect	
MC	Which statement is CORRECT?	One of the disadvantages of Spark is that it has no way to deal with graph based data.	incorrect	MLlib is based on the MapReduce pipeline.	incorrect	Variety in Big Data refers to data "in many forms".	correct	One of the disadvantages of Spark is that it does not support streaming data.	incorrect