MC Pig is... A SQL engine which runs on top of Hadoop. incorrect A programming language that can be used to query HDFS data. incorrect A database which runs on Hadoop. incorrect A project offering a programming language to provide more user-friendliness compared to MapReduce programs. correct MC Which statement is NOT CORRECT? Hive queries run much faster than hand-written MapReduce programs. correct Hive offers an SQL engine to query Hadoop data. incorrect Hive's query language is not as feature complete as the full SQL standard. incorrect Hive offers a JDBC interface. incorrect MC Which of the following commands are not a part of HBase? Describe incorrect Put incorrect Place correct Get incorrect MC Which statement is CORRECT? HiveQL supports complex subqueries. incorrect DataNodes in HDFS store a registry of metadata. incorrect HDFS's high emphasis on fault tolerance results in data replication. correct Hive applies a 'schema on write' approach. incorrect MC Which of the following is not one of the reasons why Spark programs are generally faster than MapReduce operations? Because Spark uses a directed acyclic graph instead of MapReduce. incorrect Because Spark tries to keep its RDDs in memory as long as possible. incorrect Because Mesos can be used as a resource manager instead of YARN. correct Because RDD transformations are "lazily" applied. incorrect MC Which statement is NOT CORRECT? Spark SQL DataFrames need to be created by loading a file. correct Spark SQL exposes DataFrame and Dataset APIs which underlyingly use RDDs together with a performant SQL query engine. incorrect Spark SQL can be used from within Java, Python, Scala and R. incorrect Spark SQL can be used through ODBC and JDBC interfaces. incorrect MC Which statement is CORRECT? HBase works well on large clusters as well as small ones having a few nodes. incorrect HBase can be considered as a NoSQL database. correct HBase offers a SQL engine to query its data. incorrect MapReduce programs cannot be used with HBase. Data is accessed using simple put and get commands instead. incorrect MC Pig is... A database which runs on Hadoop. incorrect A project offering a programming language to provide more user-friendliness compared to MapReduce programs. correct A SQL engine which runs on top of Hadoop. incorrect A programming language that can be used to query HDFS data. incorrect MC Which statement is NOT CORRECT? Hive offers a JDBC interface. incorrect Hive queries run much faster than hand-written MapReduce programs. correct Hive's query language is not as feature complete as the full SQL standard. incorrect Hive offers an SQL engine to query Hadoop data. incorrect MC Which statement is NOT CORRECT? Velocity in Big Data refers to data "in movement". incorrect Veracity in Big Data refers to data "in change". correct Variety in Big Data refers to data "in many forms". incorrect Volume in Big Data refers to data "at rest". incorrect