[Free] 2018(Jan) Dumps4cert Testking IBM C2090-102 Dumps with VCE and PDF 71-80

By | January 30, 2018

Dumps4cert.com : Latest Dumps with PDF and VCE Files
2018 Jan IBM Official New Released C2090-102
100% Free Download! 100% Pass Guaranteed!

IBM Big Data Architect

Question No: 71

You are designing storage for a new Hadoop cluster. Which of the following statements is TRUE regarding the usage of SAN or NAS?

  1. SAN or NAS should not be used to set up HDFS

  2. SAN or NAS must be used, if available, to provide backup capabilities

  3. SAN or NAS can be used to support retention policies

  4. SAN or NAS cannot be used if your Hadoop cluster spans 2 sites

Answer: A Explanation:

References: http:// www-01.ibm.com/software/data/infosphere/hadoop/hdfs/

Question No: 72

Company A has decided to implement a new data system to support their rapidly growing business. They have an existing 20 TB worth of raw data, with an expected weekly incoming rate of 50 GB of new raw data. The data is mostly text based and unstructured. A typical query can involve pulling in 10 GB of data. Historically, performance has been an issue and currently needs to be addressed. Which of the following would you suggest to support these requirements?

  1. Set up a Hadoop system with commodity HW for scalability

  2. Utilize de-duplication and compression technology

  3. Use a mixture of different disk-types to provide hot/cold storage

  4. Create range partitions for the data

Answer: A

Question No: 73

Which of the following is NOT a valid Service Level Agreement (SLA) metric?

  1. Mean time between failures

  2. Mean time to repair

  3. Identification to responsible party

  4. Identification of failing component

Answer: D Explanation:

References: https://en.wikipedia.org/wiki/Service-level_agreement

Question No: 74

A media company collects customer behavior data, such as how frequently they tune in, specific viewing habits, and peak usage in real time, in order to improve their services.The company likes to segmentits customers for advertisers by correlating viewing habits with public data, such as voter registration, in order to launch highly targeted campaigns to specific demographics. What technology should their Data Architect consider?

  1. InfoSphere Streams, BigInsights, and Pure Data for Analytics PDA

  2. BigInsights and Pure Data for Operational Analytics PDOA

  3. InfoSphere Streams, Spark, and BigR

  4. PureData for Analytics and SPSS

Answer: D

Reference: http://www.ibm.com/software/data/puredata/analytics/nztechnology/analytics.html

Question No: 75

A large Retailer (online and 鈥渂rick amp; mortar鈥? processes data for analyzing marketing campaigns for their loyalty club members. The current process takes weeks for processing only 10% of social data. What is the most costeffective platform for processing and analyzing campaign results from social data on a daily basis using 100% dataset?

  1. Enterprise Data Warehouse

  2. BigInsights Open Data Platform

  3. High Speed Mainfraime Processing

  4. In Memory Computing

Answer: B Explanation:

References: http://www.ibm.com/developerworks/data/library/techarticle/dm- 1110biginsightsintro/

Question No: 76

Which of the following statements is TRUE regarding cloud computing solutions?

  1. Cloud security is planned, developed, and layered on top of an application after the application development process is complete

  2. Stateless applications are better candidates for cloud services than applications that maintain state

  3. Cloud solutions rely on scaling up (vertical) scaling vs. scale out (horizontal) scaling

  4. Server virtualization is a requirement in a cloud implementation

Answer: D

Question No: 77

A telecommunication company needs a Big Data solution that could store and analyze multiple years worth of call detail records (CDRs, aprox. 17 billion events per day) containing switch, billing, and network event data for its millions of subscribers. Which of the following would you recommend for these requirements?

  1. Infosphere DataStage

  2. DB2

  3. Pure Data System for Analytics

  4. SPSS

Answer: C

Question No: 78

Which of the following approaches can an organization take to solve the impact on the performance and capacity originated by the variety of data types?

  1. Define a data catalog in a traditional data warehouse

  2. Create different solutions to handle every kind of data

  3. Store a wide range of data formats on the same platform

  4. Define a comprehensive taxonomy and constantly review

Answer: D Explanation:

References: http://www.redbooks.ibm.com/redpapers/pdfs/redp5070.pdf Page: 15

Question No: 79

In BigSheets you can add sheets to workbooks to progressively edit and explore your data. Which of the following is a CORRECT list of types of sheets that BigSheets provides and can contain predefined logic for analyzing data?

  1. Copy, Formula, Update

  2. Filter, Insert, Union

  3. Distinct, Join, Stored Procedure

  4. Complement, Limit, Group

Answer: D Explanation:

References: https://developer.ibm.com/hadoop/blog/2014/08/18/use-bigsheets-analytics/

Question No: 80

Which data format stores all of the data in a binary format making the files more compact, and will even add in markers to help Map Reduce jobs determine where to break large files for more efficient processing?

  1. Parquet

  2. Avro

  3. ORC

  4. Sequence File

Answer: B

100% Dumps4cert Free Download!
Download Free Demo:C2090-102 Demo PDF
100% Dumps4cert Free Guaranteed!
C2090-102 Dumps

Dumps4cert ExamCollection Testking
Lowest Price Guarantee Yes No No
Up-to-Dated Yes No No
Real Questions Yes No No
Explanation Yes No No
Free VCE Simulator Yes No No
Instant Download Yes No No

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.