MC Which statement is NOT CORRECT? Variety in Big Data refers to data "in many forms". incorrect Velocity in Big Data refers to data "in movement". incorrect Veracity in Big Data refers to data "in change". correct Volume in Big Data refers to data "at rest". incorrect MC Which statement is CORRECT? One of the disadvantages of Spark is that it does not support streaming data. incorrect One of the disadvantages of Spark is that it has no way to deal with graph based data. incorrect One of the disadvantages of Spark is that its streaming API does not allow to join multiple streams. incorrect One of the disadvantages of Spark is that its streaming and machine learning APIs are still mostly RDD based. correct MC Which of the following commands are not a part of HBase? Get incorrect Place correct Describe incorrect Put incorrect MC Which statement is NOT CORRECT? The YARN ApplicationMaster contains a scheduler which will hold submitted jobs in a queue until they are deemed ready to start. correct NodeManagers in YARN are responsible for setting up containers on the node hosting a particular (sub)task. incorrect YARN's JobHistoryServer keeps a log of all finished jobs. incorrect Apart from handling MapReduce programs, YARN can also be used to manage other types of applications. incorrect MC Which of the following schema handling methods does Hive apply? Schema on read correct Schema on query incorrect Schema on load incorrect Schema on write incorrect MC Pig is... A SQL engine which runs on top of Hadoop. incorrect A programming language that can be used to query HDFS data. incorrect A project offering a programming language to provide more user-friendliness compared to MapReduce programs. correct A database which runs on Hadoop. incorrect MC What do the 5 V's of Big Data stand for? Volume, Versatile, Velocity, Visualization, Value. incorrect Volume, Variety, Velocity, Variability, Value. incorrect Volume, Variety, Velocity, Veracity, Value. correct Volume, Visualization, Velocity, Variety, Value. incorrect MC Which statement is NOT CORRECT? RDDs represent an abstract, immutable data structure. incorrect RDDs offer failure protection by tracking the lineage of operations that are applied on them. incorrect RDDs allow for two forms of operations: transformations and actions. incorrect RDDs are structured and represent a collection of columnar objects. correct MC Which statement is CORRECT? MapReduce programs cannot be used with HBase. Data is accessed using simple put and get commands instead. incorrect HBase can be considered as a NoSQL database. correct HBase offers a SQL engine to query its data. incorrect HBase works well on large clusters as well as small ones having a few nodes. incorrect MC Which statement is CORRECT? HBase offers a SQL engine to query its data. incorrect Hive's main advantage lies within the query speed and performance. incorrect Hive applies a 'schema on query' approach. incorrect YARN's JobHistoryServer keeps a log of all finished jobs. correct