MC Which of the following is not one of the reasons why Spark programs are generally faster than MapReduce operations? Because Mesos can be used as a resource manager instead of YARN. correct Because Spark uses a directed acyclic graph instead of MapReduce. incorrect Because RDD transformations are "lazily" applied. incorrect Because Spark tries to keep its RDDs in memory as long as possible. incorrect MC Pig is... A project offering a programming language to provide more user-friendliness compared to MapReduce programs. correct A database which runs on Hadoop. incorrect A programming language that can be used to query HDFS data. incorrect A SQL engine which runs on top of Hadoop. incorrect MC Which of the following schema handling methods does Hive apply? Schema on write incorrect Schema on query incorrect Schema on read correct Schema on load incorrect MC Which statement is CORRECT? HDFS is composed of a NameNode, DataNodes, and an optional SecondaryNameNode. correct DataNodes in HDFS store a registry of metadata. incorrect The HDFS NameNode sends regular heartbeat messages to its DataNodes. incorrect Both the SecondaryNameNode and primary NameNode can simultaneously handle requests from clients. incorrect MC Which components does the base Hadoop stack include? HDFS, Map and Reduce. incorrect HDFS, MapReduce and YARN. correct HDFS, Spark and YARN. incorrect NDFS, MapReduce, and YARN. incorrect MC Which statement is NOT CORRECT? Hive offers an SQL engine to query Hadoop data. incorrect Hive's query language is not as feature complete as the full SQL standard. incorrect Hive offers a JDBC interface. incorrect Hive queries run much faster than hand-written MapReduce programs. correct MC Pig is... A project offering a programming language to provide more user-friendliness compared to MapReduce programs. correct A SQL engine which runs on top of Hadoop. incorrect A programming language that can be used to query HDFS data. incorrect A database which runs on Hadoop. incorrect MC Which statement is NOT CORRECT? Hive offers a JDBC interface. incorrect Hive queries run much faster than hand-written MapReduce programs. correct Hive's query language is not as feature complete as the full SQL standard. incorrect Hive offers an SQL engine to query Hadoop data. incorrect MC Which statement is CORRECT? The HDFS NameNode sends regular heartbeat messages to its DataNodes. incorrect HDFS is composed of a NameNode, DataNodes, and an optional SecondaryNameNode. correct DataNodes in HDFS store a registry of metadata. incorrect Both the SecondaryNameNode and primary NameNode can simultaneously handle requests from clients. incorrect MC Which statement is NOT CORRECT? MapReduce programs can be automatically parallelized and executed across a cluster of different computers. incorrect DStream is internally represented as a sequence of RDD fragments. incorrect HBase offers a SQL engine to query its data. correct Pig is slower than a modern RDBMS, especially for reasonably sized, structured data. incorrect