MC Which statement is NOT CORRECT? A mapper in Hadoop maps each element in a collection to one or more output elements. incorrect A MapReduce pipeline in Hadoop can include an optional Sorter to sort the final output. incorrect Reducer workers in Hadoop will start once all mapper workers have fished. correct A reducer in Hadoop reduces a collection of elements to one or more output elements. incorrect MC Which statement is CORRECT? Reducer workers in Hadoop will start once all mapper workers have fished. incorrect The Hive executer takes the MapReduce stages and sends these to Hadoop Common. incorrect MLlib is originally built on Spark SQL's DataFrames-based API. correct The original idea behind Big Data technologies is to provide extensive query facilities. incorrect MC Which statement is NOT CORRECT? A mapper in Hadoop maps each element in a collection to one or more output elements. incorrect A reducer in Hadoop reduces a collection of elements to one or more output elements. incorrect A MapReduce pipeline in Hadoop can include an optional Sorter to sort the final output. incorrect Reducer workers in Hadoop will start once all mapper workers have fished. correct MC Pig is... A database which runs on Hadoop. incorrect A project offering a programming language to provide more user-friendliness compared to MapReduce programs. correct A programming language that can be used to query HDFS data. incorrect A SQL engine which runs on top of Hadoop. incorrect MC What do the 5 V's of Big Data stand for? Volume, Visualization, Velocity, Variety, Value. incorrect Volume, Variety, Velocity, Variability, Value. incorrect Volume, Variety, Velocity, Veracity, Value. correct Volume, Versatile, Velocity, Visualization, Value. incorrect MC Which statement is NOT CORRECT? NodeManagers in YARN are responsible for setting up containers on the node hosting a particular (sub)task. incorrect Spark Streaming performs streaming analytics. incorrect A MapReduce program can be implemented in an easy, straightforward manner. correct HBase can be considered as a NoSQL database. incorrect MC What do the 5 V's of Big Data stand for? Volume, Variety, Velocity, Variability, Value. incorrect Volume, Versatile, Velocity, Visualization, Value. incorrect Volume, Variety, Velocity, Veracity, Value. correct Volume, Visualization, Velocity, Variety, Value. incorrect MC Which of the following schema handling methods does Hive apply? Schema on read correct Schema on write incorrect Schema on query incorrect Schema on load incorrect MC Which statement is NOT CORRECT? Reducer workers in Hadoop will start once all mapper workers have fished. correct A mapper in Hadoop maps each element in a collection to one or more output elements. incorrect A MapReduce pipeline in Hadoop can include an optional Sorter to sort the final output. incorrect A reducer in Hadoop reduces a collection of elements to one or more output elements. incorrect MC Which of the following is not one of the reasons why Spark programs are generally faster than MapReduce operations? Because Spark uses a directed acyclic graph instead of MapReduce. incorrect Because Mesos can be used as a resource manager instead of YARN. correct Because Spark tries to keep its RDDs in memory as long as possible. incorrect Because RDD transformations are "lazily" applied. incorrect