Juan Taborda
Quiz by , created more than 1 year ago

Examen 1 de certificacion

425
1
0
Juan Taborda
Created by Juan Taborda almost 8 years ago
Close

Examen Fundamental Big Data

Question 1 of 200

1

Big Data

Select one of the following:

  • A Not-only SQL (NoSQL) database is a non-relational database that can be use to store it

  • is an open-source framework for large-scale data storage and data processing that is mor or less run on commodity hardware

  • are capable of providing highly scalable, on-demand IT resources that can be leased via pay-as-you-go models

  • Is a field dedicated to the analysis, processing and storage of large collections of data that frequenty originate from disparate sources

Explanation

Question 2 of 200

1

Big Data Solutions

Select one of the following:

  • queries can take several minutes or even longer, depending on the complexity of the query and the number of records queried

  • is a measured for gauging sucess within a particular context

  • Examples can include EDI, e-mails, spreadcheets, RSS feeds, rss feeds and sensor data

  • are typically requiered when traditional data analysis, processing and storage technologies and techniques are insufficient

Explanation

Question 3 of 200

1

Big Data Addresses

Select one of the following:

  • Arrives at such fast speeds that enormous datasets can accumulate within very shorts periods of time

  • does not conform to a data model or data schema

  • Data adquired such as via online customer registrations, usually contains less noise

  • distinct requierements, such as the combining of multiple unrelated datasets, processing of large ammounts of unstructured data and harvesting of hidden information, in a time-sensitive manner

Explanation

Question 4 of 200

1

Using Big Data Solutions

Select one of the following:

  • are closesly liked with an enterprise's strategic objectives

  • further use databases that store historical data in multidimensional arrays and can answer complex queries based on multiple dimensions of the data

  • multiple formats and types of data that need to be supported by Big Data Solutions

  • complex analysis tasks can be carried out to arrive at deeply meaningful and insightful analysis results for the benefit of the business

Explanation

Question 5 of 200

1

Big Data Solutions

Select one of the following:

  • Some streams are public. Other streams go to vendors and business directly

  • Analytics and Data Science

  • are relevant to big data in that they can serve as both a datas source as well as an data sink that is capable of receiving data

  • can process massive quantities of data that arrive at varying speeds, may be of many different varieties and have numerous incompatibilities

Explanation

Question 6 of 200

1

Data within Big Data

Select one of the following:

  • is the process of gaining insights into the workings of an enterprise to improve decision-making by analyzing external data and data generated by its business processes

  • can have multiple data marts

  • is a process of loading data from a source system into a target system, the source system can be a database, a flat file or an application, similarly, the target system can be a database or some other information system

  • accumulates from being amassed within the enterprise (via applications) or from external sources that are then stored by the big datat solution

Explanation

Question 7 of 200

1

Data processed by Big Data

Select one of the following:

  • does generally require special or customized logic when it comes to pre-processing and storage

  • Data adquired such as blog posting, usually contains more noise

  • store historical data that is aggregated and denormalized to support fast reporting capability

  • can be used by enterprise applications directly, or fed into a data warehouse to enrich existing data.This data is typically analyzed and subjected to analytics

Explanation

Question 8 of 200

1

Processed data and analysis results

Select one of the following:

  • are closesly liked with an enterprise's strategic objectives

  • represents the main operation through which data warehouses are fed data

  • does often have special pre-processing and storage requierements, especially if the underline format is not text-based

  • are commonly used for meaningful and complex reporting and assessment task and can also be fed back into applications to enhance their behavior (such as when product recommendations are displayed online)

Explanation

Question 9 of 200

1

Data processed by Big Data

Select one of the following:

  • Analytics and Data Science

  • actionable intelligence

  • operational optimization

  • can be human-generated or machine generated, although it is ultimately the responsibility of machines to generate the processing results

Explanation

Question 10 of 200

1

Human-generated data

Select one of the following:

  • is a subset of the data stored in a data warehouse, that typically belongs to a department, division or specific line of business

  • each technology is uniquely relevant to modern-day Big Data Solutions and ecosystems

  • used to identify problem areas in order to take corrective actions

  • is the result of human interaction with systems, such as online services and digital devices (Ex. Social media, micro blogging, e-mails, photo sharing and messaging)

Explanation

Question 11 of 200

1

Machine-generated data

Select one of the following:

  • represents the main operation through which data warehouses are fed data

  • With periodic data imports from accross the enterprise, the amount of data contained will continue to increase. Query response times for data analysis task performed as part of BI can suffer as a result

  • defined as the usefulness of data for an enterprise

  • is the result of the automated, event-driven generation of data by software programs or hardware devices (Ex. Web logs, sensor data, telemetry data, smart meter data and appliance usage data

Explanation

Question 12 of 200

1

BDS processing results

Select one or more of the following:

  • is the process of gaining insights into the workings of an enterprise to improve decision-making by analyzing external data and data generated by its business processes

  • scientific and research data (large Hadron Collider, Atacama Large Milimeter/Submilimeter Array Telescope)

  • operational optimization

  • actionable intelligence

Explanation

Question 13 of 200

1

BDS processing results

Select one or more of the following:

  • is crucial to big data processing storage and analysis

  • With periodic data imports from accross the enterprise, the amount of data contained will continue to increase. Query response times for data analysis task performed as part of BI can suffer as a result

  • identification of new markets

  • accurate predictions

Explanation

Question 14 of 200

1

BDS processing results

Select one or more of the following:

  • is directly related to the veracity characteristic

  • The required data is first obtained from the sources, after which the extracts are modified by applying rules

  • fault and fraud detection

  • more detailed records

Explanation

Question 15 of 200

1

BDS processing results

Select one or more of the following:

  • related to collecting and processing large quantities of diverse data has become increasingly affordable

  • simple insert, delete and update operations with sub-second response times

  • improved decision-making

  • scientific discoveries

Explanation

Question 16 of 200

1

Datasets

Select one of the following:

  • improved decision-making

  • representing a common source of structured analytics input

  • The anticipated volume of data that is processed by Big Data solutions is substantial and usually ever-growing

  • Collections or groups of related data (Ex. Tweets stored in a flat file, collection of image files, extract of rows stored in a table, historical weather observations that are stored as XML Files)

Explanation

Question 17 of 200

1

Datum

Select one of the following:

  • Shares the same set of attributes as others in the same dataset

  • Are the data analysis results being accurately communicated to the appropriate decision-makers?

  • The anticipated volume of data that is processed by Big Data solutions is substantial and usually ever-growing

  • is based on a quantifiable indicator that is identified and agreed upon beforehand

Explanation

Question 18 of 200

1

Data analysis

Select one or more of the following:

  • either exists in textual or binary form

  • is the result of human interaction with systems, such as online services and digital devices (Ex. Social media, micro blogging, e-mails, photo sharing and messaging)

  • is the process of examining data to find facts, relationships, patterns, insights and/or trends. The eventual goal is to support decision-making

  • helps establish patterns and relationships amog the data being analyzed

Explanation

Question 19 of 200

1

Analytics

Select one or more of the following:

  • semi-structured data

  • Can exist as a separate DBMS, as in the case of an OLAP database

  • is the discipline of gaininng an understanding of data by analyzing it via a multitude of scientific techniques and automated tools, with a focus on locating hidden patterns and correlations

  • is usually applied using highly scalable distributed technologies and frameworks for analyzing large volumes of data from different sources

Explanation

Question 20 of 200

1

Analytics

Select one of the following:

  • generally involves sifting through large amounts of raw, unstructured data to extract meaningful information that can serve as an input for identifying patterns, enriching existing enterprise data, or performing large-scale searches

  • may not always be high. For Example, MRI scan images are usually not generated as frequently as log entries form a high-traffic Web Server

  • Shares the same set of attributes as others in the same dataset

  • attributes providing the file size and resolution of a digital photograph

Explanation

Question 21 of 200

1

in the business-oriented environments analytics results can lower operational costs and facilitate strategic decision-making?

Select one of the following:

  • True
  • False

Explanation

Question 22 of 200

1

scientific domain

Select one of the following:

  • does often have special pre-processing and storage requierements, especially if the underline format is not text-based

  • is also dependent on how long data processing takes, time are inversely proportional to each other

  • is a data analysis technique that focuses on quantifying the patterns and correlations found in the data

  • analytics can help identify the cause of a phenomenon to improve the accuracy of predictions

Explanation

Question 23 of 200

1

services-based environments

Select one of the following:

  • are relevant to big data in that they can serve as both a datas source as well as an data sink that is capable of receiving data

  • each technology is uniquely relevant to modern-day Big Data Solutions and ecosystems

  • are commonly used for meaningful and complex reporting and assessment task and can also be fed back into applications to enhance their behavior (such as when product recommendations are displayed online)

  • analytics can help strengthen the focus on delivering high quality services by driving down cost

Explanation

Question 24 of 200

1

Analytics

Select one of the following:

  • are closesly liked with an enterprise's strategic objectives

  • Shares the same set of attributes as others in the same dataset

  • generally makes up 80% of the data within an enterprise, and has a faster growth rate than structured data

  • enables data-driven decision-making with scientific backing, so that decisions can be based on a factual data and not on past experience or intuition alone

Explanation

Question 25 of 200

1

Business Intelligence

Select one or more of the following:

  • generally involves sifting through large amounts of raw, unstructured data to extract meaningful information that can serve as an input for identifying patterns, enriching existing enterprise data, or performing large-scale searches

  • can be used as an ETL engine, or as an analytics engine for processing large amounts of structured, semi-structured and unstructured data

  • is the process of gaining insights into the workings of an enterprise to improve decision-making by analyzing external data and data generated by its business processes

  • applyes analytics to large amounts of data across the enterprise

Explanation

Question 26 of 200

1

Business Intelligence

Select one of the following:

  • store historical data that is aggregated and denormalized to support fast reporting capability

  • is the process of examining data to find facts, relationships, patterns, insights and/or trends. The eventual goal is to support decision-making

  • The anticipated volume of data that is processed by Big Data solutions is substantial and usually ever-growing

  • can be further utilize the consolidated data contained in data warehouses to run analytical queries

Explanation

Question 27 of 200

1

KPI

Select one or more of the following:

  • is crucial to big data processing storage and analysis

  • is mostly machine-generated and automatically appended to the data

  • is a measured for gauging sucess within a particular context

  • are closesly liked with an enterprise's strategic objectives

Explanation

Question 28 of 200

1

KPI

Select one or more of the following:

  • Shares the same set of attributes as others in the same dataset

  • ticket reservation systems and banking and POS transactions

  • used to identify problem areas in order to take corrective actions

  • used to achieve regulatory compliance

Explanation

Question 29 of 200

1

KPI

Select one or more of the following:

  • more detailed records

  • big data solutions particularly rely on it when processing semi-structured and unstructured data

  • act as quick reference points for measuring the overall performance of the business

  • is based on a quantifiable indicator that is identified and agreed upon beforehand

Explanation

Question 30 of 200

1

primary business and technology drivers

Select one or more of the following:

  • the relational data is stored as denormalized data in the form of cubes, this allows the data to be queried during any data analysis task that are performed later

  • XML tags providing the author and creation date of a document

  • Analytics and Data Science

  • Digitization

Explanation

Question 31 of 200

1

primary business and technology drivers

Select one or more of the following:

  • A Not-only SQL (NoSQL) database is a non-relational database that can be use to store it

  • are capable of providing highly scalable, on-demand IT resources that can be leased via pay-as-you-go models

  • Affordable Technology & Commodity Hardware

  • Social Media

Explanation

Question 32 of 200

1

primary business and technology drivers

Select one or more of the following:

  • does often have special pre-processing and storage requierements, especially if the underline format is not text-based

  • is directly related to the veracity characteristic

  • Hyper-Connected Communities & Devices

  • Cloud Computing

Explanation

Question 33 of 200

1

Analytics & Data Science

Select one of the following:

  • generally makes up 80% of the data within an enterprise, and has a faster growth rate than structured data

  • more detailed records

  • fault and fraud detection

  • The maturity of these fields of practice inspired and enabled much of the core functionality expected from contemporary Big Data solutions and tools

Explanation

Question 34 of 200

1

Digitized data

Select one of the following:

  • How well has the data been stored?

  • is always fed with data from multiple OLTP systems using regular batch processing jobs

  • The longer it takes for data to be turned into meaninful information, the less potential it may have for the business

  • Leads to an opportunity to collect further "secondary" data, such as when individuals carry out searches or complete surveys

Explanation

Question 35 of 200

1

Colecting secondary data

Select one of the following:

  • accurate predictions

  • Extract Transform Load (ETL)

  • data bearing value leading to meaningful information

  • can be important to businesses. Mining this data may allow for customized marketing, automated recomendations and the development of optimized product features

Explanation

Question 36 of 200

1

Affordable Technology

Select one of the following:

  • Hyper-Connected Communities & Devices

  • is usually applied using highly scalable distributed technologies and frameworks for analyzing large volumes of data from different sources

  • are relevant to big data in that they can serve as both a datas source as well as an data sink that is capable of receiving data

  • related to collecting and processing large quantities of diverse data has become increasingly affordable

Explanation

Question 37 of 200

1

Tipical Big Data solutions

Select one of the following:

  • is typically stored in relational databases and frequently generated by custom enterprise applications, ERP systems amd CRM systems

  • The longer it takes for data to be turned into meaninful information, the less potential it may have for the business

  • operational optimization

  • are based on open-source software that requires little more than commodity hardware

Explanation

Question 38 of 200

1

commodity hardware

Select one of the following:

  • How well has the data been stored?

  • Hyper-Connected Communities & Devices

  • fault and fraud detection

  • makes the adoption of big data solutions accessible to businesses without large capital investments

Explanation

Question 39 of 200

1

Social Media

Select one or more of the following:

  • does not conform to a data model or data schema

  • store historical data that is aggregated and denormalized to support fast reporting capability

  • provide feedback in near-realtime via open and public mediums

  • business are storing increasing amounts of data on customer interaction and from social media avenues in an attempt to harvest this data to increase sales, enable targeted marketing and create new products and service

Explanation

Question 40 of 200

1

Social Media

Select one of the following:

  • may not always be high. For Example, MRI scan images are usually not generated as frequently as log entries form a high-traffic Web Server

  • Are the data analysis results being accurately communicated to the appropriate decision-makers?

  • operational optimization

  • business are also increasingly interested in incorporating publicly avaliable datasets from social media and other external data source

Explanation

Question 41 of 200

1

Hyper-Connected Communities & Devices

Select one or more of the following:

  • Examples can include EDI, e-mails, spreadcheets, RSS feeds, rss feeds and sensor data

  • is the process of examining data to find facts, relationships, patterns, insights and/or trends. The eventual goal is to support decision-making

  • The broadening coverage of the internet and the proliferation of cellular and Wi-Fi networks has enabled more people to be continuously active in virtual communities

  • This is either directly through online interaction on indirectly through the usage of connected devices, this has resulted in massive data streams

Explanation

Question 42 of 200

1

Hyper-Connected Communities & Devices

Select one of the following:

  • is an open-source framework for large-scale data storage and data processing that is mor or less run on commodity hardware

  • can be important to businesses. Mining this data may allow for customized marketing, automated recomendations and the development of optimized product features

  • can also be fed back into OLTPs

  • Some streams are public. Other streams go to vendors and business directly

Explanation

Question 43 of 200

1

Cloud Computing

Select one or more of the following:

  • is the process of gaining insights into the workings of an enterprise to improve decision-making by analyzing external data and data generated by its business processes

  • attributes providing the file size and resolution of a digital photograph

  • have led to the creation of remote environments

  • are capable of providing highly scalable, on-demand IT resources that can be leased via pay-as-you-go models

Explanation

Question 44 of 200

1

Cloud Computing

Select one or more of the following:

  • multiple formats and types of data that need to be supported by Big Data Solutions

  • applyes analytics to large amounts of data across the enterprise

  • Business have the opportunity to leverage the infraestructure, storage and processing capabilities provided by these environments in order to build large scale Big Data Solutions

  • Can be leveraged for its scaling capabilities to perform Big Data Processing task

Explanation

Question 45 of 200

1

Cloud Computing

Select one of the following:

  • either exists in textual or binary form

  • actionable intelligence

  • have a greater noise-to-signal ratio

  • can be leased dramatically reduces the requiered up-front investment of big data projects

Explanation

Question 46 of 200

1

Technologies Related to Big Data

Select one or more of the following:

  • It also periodically pulls data from other sources for consolidation into a dataset (such as from OLTP, ERP, CRM, and SCM systems).

  • This is either directly through online interaction on indirectly through the usage of connected devices, this has resulted in massive data streams

  • Online Transaction Processing (OLTP)

  • Online Analytical Processing (OLAP)

Explanation

Question 47 of 200

1

Technologies Related to Big Data

Select one or more of the following:

  • each technology is uniquely relevant to modern-day Big Data Solutions and ecosystems

  • represents the main operation through which data warehouses are fed data

  • Extract Transform Load (ETL)

  • Data Warehouses

Explanation

Question 48 of 200

1

Technologies Related to Big Data

Select one of the following:

  • are capable of providing highly scalable, on-demand IT resources that can be leased via pay-as-you-go models

  • is the discipline of gaininng an understanding of data by analyzing it via a multitude of scientific techniques and automated tools, with a focus on locating hidden patterns and correlations

  • is crucial to big data processing storage and analysis

  • Hadoop

Explanation

Question 49 of 200

1

OLTP

Select one or more of the following:

  • further use databases that store historical data in multidimensional arrays and can answer complex queries based on multiple dimensions of the data

  • is a process of loading data from a source system into a target system, the source system can be a database, a flat file or an application, similarly, the target system can be a database or some other information system

  • store operational data that is fully normalized

  • is a software system that processes transaction-oriented data

Explanation

Question 50 of 200

1

Online Transaction

Select one of the following:

  • operational optimization

  • A Not-only SQL (NoSQL) database is a non-relational database that can be use to store it

  • Collections or groups of related data (Ex. Tweets stored in a flat file, collection of image files, extract of rows stored in a table, historical weather observations that are stored as XML Files)

  • the completion on an activity in realtime and not batch-processed

Explanation

Question 51 of 200

1

OLTP

Select one of the following:

  • representing a common source of structured analytics input

  • generally involves sifting through large amounts of raw, unstructured data to extract meaningful information that can serve as an input for identifying patterns, enriching existing enterprise data, or performing large-scale searches

  • require automated data cleansing and data verification when carrying out ETL processes

  • are closesly liked with an enterprise's strategic objectives

Explanation

Question 52 of 200

1

Big Data Analysis Results

Select one of the following:

  • used to identify problem areas in order to take corrective actions

  • either exists in textual or binary form

  • enables data-driven decision-making with scientific backing, so that decisions can be based on a factual data and not on past experience or intuition alone

  • can also be fed back into OLTPs

Explanation

Question 53 of 200

1

Queries Supported by OLTP

Select one of the following:

  • mostly exist in textual form such as XML or JSON files.

  • data bearing value leading to meaningful information

  • The broadening coverage of the internet and the proliferation of cellular and Wi-Fi networks has enabled more people to be continuously active in virtual communities

  • simple insert, delete and update operations with sub-second response times

Explanation

Question 54 of 200

1

Examples of OLTP

Select one of the following:

  • Data Warehouses

  • big data solutions particularly rely on it when processing semi-structured and unstructured data

  • structured data

  • ticket reservation systems and banking and POS transactions

Explanation

Question 55 of 200

1

OLAP

Select one or more of the following:

  • related to collecting and processing large quantities of diverse data has become increasingly affordable

  • XML tags providing the author and creation date of a document

  • is a system used for processing data analysis queries

  • form an integral part of business intelligence, data mining and machine learning processes

Explanation

Question 56 of 200

1

OLAP

Select one or more of the following:

  • Collections or groups of related data (Ex. Tweets stored in a flat file, collection of image files, extract of rows stored in a table, historical weather observations that are stored as XML Files)

  • store historical data that is aggregated and denormalized to support fast reporting capability

  • are relevant to big data in that they can serve as both a datas source as well as an data sink that is capable of receiving data

  • are using in diagnostic, predictive and prescriptive analysis

Explanation

Question 57 of 200

1

OLAP

Select one or more of the following:

  • Social Media

  • Sensor Data (RFID, Smart meters, GPS sensors)

  • further use databases that store historical data in multidimensional arrays and can answer complex queries based on multiple dimensions of the data

  • is always fed with data from multiple OLTP systems using regular batch processing jobs

Explanation

Question 58 of 200

1

OLAP

Select one or more of the following:

  • have a less noise-to-signal ratio

  • Are the right types of question being asked during data analysis?

  • queries can take several minutes or even longer, depending on the complexity of the query and the number of records queried

  • the relational data is stored as denormalized data in the form of cubes, this allows the data to be queried during any data analysis task that are performed later

Explanation

Question 59 of 200

1

ETL

Select one or more of the following:

  • either exists in textual or binary form

  • generally involves sifting through large amounts of raw, unstructured data to extract meaningful information that can serve as an input for identifying patterns, enriching existing enterprise data, or performing large-scale searches

  • is a process of loading data from a source system into a target system, the source system can be a database, a flat file or an application, similarly, the target system can be a database or some other information system

  • represents the main operation through which data warehouses are fed data

Explanation

Question 60 of 200

1

ETL

Select one or more of the following:

  • online transactions (point-of-scale, banking)

  • act as quick reference points for measuring the overall performance of the business

  • A big data solution encompasses this tool feature-set for converting data of different types

  • The required data is first obtained from the sources, after which the extracts are modified by applying rules

Explanation

Question 61 of 200

1

ETL

Select one of the following:

  • analytics results can lower operational costs and facilitate strategic decision-making

  • Collections or groups of related data (Ex. Tweets stored in a flat file, collection of image files, extract of rows stored in a table, historical weather observations that are stored as XML Files)

  • generally involves sifting through large amounts of raw, unstructured data to extract meaningful information that can serve as an input for identifying patterns, enriching existing enterprise data, or performing large-scale searches

  • The data is inserted into a target system

Explanation

Question 62 of 200

1

Data Warehouse

Select one or more of the following:

  • impose distinct data storage and processing demands, as well as management ans access processes

  • is based on a quantifiable indicator that is identified and agreed upon beforehand

  • is a central, enterprise-wide repository, consisting of historical and current data

  • are heavily used by BI to run various analytical queries

Explanation

Question 63 of 200

1

Data Warehouse

Select one or more of the following:

  • The required data is first obtained from the sources, after which the extracts are modified by applying rules

  • analytics can help identify the cause of a phenomenon to improve the accuracy of predictions

  • usually interface with an OLAP system to support analytical queries

  • It also periodically pulls data from other sources for consolidation into a dataset (such as from OLTP, ERP, CRM, and SCM systems).

Explanation

Question 64 of 200

1

Data Warehouse

Select one or more of the following:

  • This is either directly through online interaction on indirectly through the usage of connected devices, this has resulted in massive data streams

  • conforms to a data model or schema

  • Data pertaining to multiple business entities from different operational systems is periodically extracted, validated, transformed an consolidated into a single database

  • With periodic data imports from accross the enterprise, the amount of data contained will continue to increase. Query response times for data analysis task performed as part of BI can suffer as a result

Explanation

Question 65 of 200

1

Data Warehouse

Select one of the following:

  • can also be fed back into OLTPs

  • helps establish patterns and relationships amog the data being analyzed

  • the relational data is stored as denormalized data in the form of cubes, this allows the data to be queried during any data analysis task that are performed later

  • Usually contain optimized databases called analytical database to handle reporting and data analysis tasks

Explanation

Question 66 of 200

1

Analytical Database

Select one of the following:

  • This is either directly through online interaction on indirectly through the usage of connected devices, this has resulted in massive data streams

  • Brings challenges for enterprises in terms of data integration, transformation, processing and storage

  • does not conform to a data model or data schema

  • Can exist as a separate DBMS, as in the case of an OLAP database

Explanation

Question 67 of 200

1

Data Mart

Select one of the following:

  • act as quick reference points for measuring the overall performance of the business

  • online transactions (point-of-scale, banking)

  • can also be fed back into OLTPs

  • is a subset of the data stored in a data warehouse, that typically belongs to a department, division or specific line of business

Explanation

Question 68 of 200

1

Data Warehouse

Select one or more of the following:

  • does generally require special or customized logic when it comes to pre-processing and storage

  • is directly related to the veracity characteristic

  • can have multiple data marts

  • single version of "truth" is based on cleansed data, which is a prerequisite for accurate and error-free reports

Explanation

Question 69 of 200

1

Hadoop

Select one or more of the following:

  • further use databases that store historical data in multidimensional arrays and can answer complex queries based on multiple dimensions of the data

  • identification of new markets

  • is an open-source framework for large-scale data storage and data processing that is mor or less run on commodity hardware

  • has established itself as a de facto industry platform for contemporary Big Data Solutions

Explanation

Question 70 of 200

1

Hadoop

Select one of the following:

  • analytics can help strengthen the focus on delivering high quality services by driving down cost

  • have led to the creation of remote environments

  • are closesly liked with an enterprise's strategic objectives

  • can be used as an ETL engine, or as an analytics engine for processing large amounts of structured, semi-structured and unstructured data

Explanation

Question 71 of 200

1

Data Characteristics

Select one of the following:

  • does not conform to a data model or data schema

  • Are the data analysis results being accurately communicated to the appropriate decision-makers?

  • is the process of examining data to find facts, relationships, patterns, insights and/or trends. The eventual goal is to support decision-making

  • Volume, Velocity, Variety, Veracity & Value

Explanation

Question 72 of 200

1

Volume

Select one or more of the following:

  • scientific and research data (large Hadron Collider, Atacama Large Milimeter/Submilimeter Array Telescope)

  • is the process of gaining insights into the workings of an enterprise to improve decision-making by analyzing external data and data generated by its business processes

  • The anticipated volume of data that is processed by Big Data solutions is substantial and usually ever-growing

  • impose distinct data storage and processing demands, as well as management ans access processes

Explanation

Question 73 of 200

1

Volume

Select one or more of the following:

  • Leads to an opportunity to collect further "secondary" data, such as when individuals carry out searches or complete surveys

  • Digitization

  • online transactions (point-of-scale, banking)

  • Sensor Data (RFID, Smart meters, GPS sensors)

Explanation

Question 74 of 200

1

Volume

Select one of the following:

  • is crucial to big data processing storage and analysis

  • can be leased dramatically reduces the requiered up-front investment of big data projects

  • impose distinct data storage and processing demands, as well as management ans access processes

  • Social Media (Facebook, Tweeter)

Explanation

Question 75 of 200

1

Velocity

Select one or more of the following:

  • can be human-generated or machine generated, although it is ultimately the responsibility of machines to generate the processing results

  • analytics can help strengthen the focus on delivering high quality services by driving down cost

  • Arrives at such fast speeds that enormous datasets can accumulate within very shorts periods of time

  • translates into the amount of time it takes for the data to be processed once it enters the enterprise perimeter

Explanation

Question 76 of 200

1

Velocity

Select one or more of the following:

  • Examples can include EDI, e-mails, spreadcheets, RSS feeds, rss feeds and sensor data

  • is a measured for gauging sucess within a particular context

  • Coping with the fast inflow of data requires the enterprise to design highly elastic and avaliable processing solutions and corresponding data storage capabilities

  • may not always be high. For Example, MRI scan images are usually not generated as frequently as log entries form a high-traffic Web Server

Explanation

Question 77 of 200

1

Variety

Select one or more of the following:

  • data bearing value leading to meaningful information

  • big data solutions particularly rely on it when processing semi-structured and unstructured data

  • multiple formats and types of data that need to be supported by Big Data Solutions

  • Brings challenges for enterprises in terms of data integration, transformation, processing and storage

Explanation

Question 78 of 200

1

Veracity

Select one of the following:

  • Online Transaction Processing (OLTP)

  • Shares the same set of attributes as others in the same dataset

  • generally makes up 80% of the data within an enterprise, and has a faster growth rate than structured data

  • refers to the quality or fidelity of data

Explanation

Question 79 of 200

1

Noise

Select one of the following:

  • has a defined level of structure and consistency, but cannot be relational in nature

  • The anticipated volume of data that is processed by Big Data solutions is substantial and usually ever-growing

  • Coping with the fast inflow of data requires the enterprise to design highly elastic and avaliable processing solutions and corresponding data storage capabilities

  • data carrying no value

Explanation

Question 80 of 200

1

Signal

Select one of the following:

  • is a subset of the data stored in a data warehouse, that typically belongs to a department, division or specific line of business

  • provide feedback in near-realtime via open and public mediums

  • A Not-only SQL (NoSQL) database is a non-relational database that can be use to store it

  • data bearing value leading to meaningful information

Explanation

Question 81 of 200

1

controlled source

Select one of the following:

  • are heavily used by BI to run various analytical queries

  • Examples can include EDI, e-mails, spreadcheets, RSS feeds, rss feeds and sensor data

  • makes the adoption of big data solutions accessible to businesses without large capital investments

  • Data adquired such as via online customer registrations, usually contains less noise

Explanation

Question 82 of 200

1

uncontrolled source

Select one of the following:

  • business are also increasingly interested in incorporating publicly avaliable datasets from social media and other external data source

  • accurate predictions

  • Business have the opportunity to leverage the infraestructure, storage and processing capabilities provided by these environments in order to build large scale Big Data Solutions

  • Data adquired such as blog posting, usually contains more noise

Explanation

Question 83 of 200

1

Degree of noise

Select one of the following:

  • is a measured for gauging sucess within a particular context

  • act as quick reference points for measuring the overall performance of the business

  • analytics results can lower operational costs and facilitate strategic decision-making

  • Depends on the type of data present

Explanation

Question 84 of 200

1

Value

Select one or more of the following:

  • store historical data that is aggregated and denormalized to support fast reporting capability

  • is an open-source framework for large-scale data storage and data processing that is mor or less run on commodity hardware

  • defined as the usefulness of data for an enterprise

  • is directly related to the veracity characteristic

Explanation

Question 85 of 200

1

Value

Select one or more of the following:

  • is the process of gaining insights into the workings of an enterprise to improve decision-making by analyzing external data and data generated by its business processes

  • Brings challenges for enterprises in terms of data integration, transformation, processing and storage

  • is also dependent on how long data processing takes, time are inversely proportional to each other

  • The longer it takes for data to be turned into meaninful information, the less potential it may have for the business

Explanation

Question 86 of 200

1

Value Considerations

Select one or more of the following:

  • scientific discoveries

  • ticket reservation systems and banking and POS transactions

  • How well has the data been stored?

  • Has the data been stripped of any valuable attributes?

Explanation

Question 87 of 200

1

Value Considerations

Select one or more of the following:

  • have a less noise-to-signal ratio

  • attributes providing the file size and resolution of a digital photograph

  • Are the right types of question being asked during data analysis?

  • Are the data analysis results being accurately communicated to the appropriate decision-makers?

Explanation

Question 88 of 200

1

Data Types

Select one or more of the following:

  • improved decision-making

  • does not conform to a data model or data schema

  • structured data

  • unstructured data

Explanation

Question 89 of 200

1

Data Types

Select one of the following:

  • translates into the amount of time it takes for the data to be processed once it enters the enterprise perimeter

  • have led to the creation of remote environments

  • The anticipated volume of data that is processed by Big Data solutions is substantial and usually ever-growing

  • semi-structured data

Explanation

Question 90 of 200

1

structured data

Select one or more of the following:

  • Are the right types of question being asked during data analysis?

  • makes the adoption of big data solutions accessible to businesses without large capital investments

  • conforms to a data model or schema

  • is stored in a tabular form

Explanation

Question 91 of 200

1

structured data

Select one or more of the following:

  • is crucial to big data processing storage and analysis

  • is the process of examining data to find facts, relationships, patterns, insights and/or trends. The eventual goal is to support decision-making

  • can be relational

  • is typically stored in relational databases and frequently generated by custom enterprise applications, ERP systems amd CRM systems

Explanation

Question 92 of 200

1

structured data

Select one of the following:

  • can be important to businesses. Mining this data may allow for customized marketing, automated recomendations and the development of optimized product features

  • analytics can help strengthen the focus on delivering high quality services by driving down cost

  • The anticipated volume of data that is processed by Big Data solutions is substantial and usually ever-growing

  • does not generally have any special pre-processing or storage requirements. Examples include banking transactions, OLTP system records and customer records

Explanation

Question 93 of 200

1

unstructured data

Select one or more of the following:

  • qualitative analysis

  • enables data-driven decision-making with scientific backing, so that decisions can be based on a factual data and not on past experience or intuition alone

  • does not conform to a data model or data schema

  • is generally inconsistent and non-relational

Explanation

Question 94 of 200

1

unstructured data

Select one or more of the following:

  • simple insert, delete and update operations with sub-second response times

  • Shares the same set of attributes as others in the same dataset

  • either exists in textual or binary form

  • generally makes up 80% of the data within an enterprise, and has a faster growth rate than structured data

Explanation

Question 95 of 200

1

unstructured data

Select one or more of the following:

  • is mostly machine-generated and automatically appended to the data

  • Shares the same set of attributes as others in the same dataset

  • does generally require special or customized logic when it comes to pre-processing and storage

  • cannot be inherently processed or queried using SQL or traditional programming features and is usually an awkward fit with relational databases

Explanation

Question 96 of 200

1

unstructured data

Select one of the following:

  • has a defined level of structure and consistency, but cannot be relational in nature

  • are relevant to big data in that they can serve as both a datas source as well as an data sink that is capable of receiving data

  • A big data solution encompasses this tool feature-set for converting data of different types

  • A Not-only SQL (NoSQL) database is a non-relational database that can be use to store it

Explanation

Question 97 of 200

1

semi-structured data

Select one or more of the following:

  • Are the right types of question being asked during data analysis?

  • How well has the data been stored?

  • has a defined level of structure and consistency, but cannot be relational in nature

  • mostly exist in textual form such as XML or JSON files.

Explanation

Question 98 of 200

1

semi-structured data

Select one or more of the following:

  • defined as the usefulness of data for an enterprise

  • may not always be high. For Example, MRI scan images are usually not generated as frequently as log entries form a high-traffic Web Server

  • Examples can include EDI, e-mails, spreadcheets, RSS feeds, rss feeds and sensor data

  • does often have special pre-processing and storage requierements, especially if the underline format is not text-based

Explanation

Question 99 of 200

1

metadata

Select one or more of the following:

  • is the process of gaining insights into the workings of an enterprise to improve decision-making by analyzing external data and data generated by its business processes

  • require automated data cleansing and data verification when carrying out ETL processes

  • provide information about dataset's characteristics and structure

  • is mostly machine-generated and automatically appended to the data

Explanation

Question 100 of 200

1

metadata

Select one or more of the following:

  • refers to the quality or fidelity of data

  • Has the data been stripped of any valuable attributes?

  • XML tags providing the author and creation date of a document

  • attributes providing the file size and resolution of a digital photograph

Explanation

Question 101 of 200

1

metadata

Select one of the following:

  • The data is inserted into a target system

  • semi-structured data

  • single version of "truth" is based on cleansed data, which is a prerequisite for accurate and error-free reports

  • big data solutions particularly rely on it when processing semi-structured and unstructured data

Explanation

Question 102 of 200

1

structured data

Select one of the following:

  • data carrying no value

  • can also be fed back into OLTPs

  • quantitative analysis

  • have a less noise-to-signal ratio

Explanation

Question 103 of 200

1

semi-structured data and unstructured data

Select one of the following:

  • identification of new markets

  • Are the data analysis results being accurately communicated to the appropriate decision-makers?

  • improved decision-making

  • have a greater noise-to-signal ratio

Explanation

Question 104 of 200

1

Noise

Select one of the following:

  • A big data solution encompasses this tool feature-set for converting data of different types

  • structured data

  • Brings challenges for enterprises in terms of data integration, transformation, processing and storage

  • require automated data cleansing and data verification when carrying out ETL processes

Explanation

Question 105 of 200

1

Types of data analysis

Select one or more of the following:

  • is a measured for gauging sucess within a particular context

  • Shares the same set of attributes as others in the same dataset

  • quantitative analysis

  • qualitative analysis

Explanation

Question 106 of 200

1

Types of data analysis

Select one of the following:

  • does not generally have any special pre-processing or storage requirements. Examples include banking transactions, OLTP system records and customer records

  • online transactions (point-of-scale, banking)

  • is a central, enterprise-wide repository, consisting of historical and current data

  • data mining

Explanation

Question 107 of 200

1

quantitative analysis

Select one of the following:

  • The longer it takes for data to be turned into meaninful information, the less potential it may have for the business

  • queries can take several minutes or even longer, depending on the complexity of the query and the number of records queried

  • translates into the amount of time it takes for the data to be processed once it enters the enterprise perimeter

  • is a data analysis technique that focuses on quantifying the patterns and correlations found in the data

Explanation

Question 108 of 200

1

quantitative analysis

Select one or more of the following:

  • cannot be inherently processed or queried using SQL or traditional programming features and is usually an awkward fit with relational databases

  • refers to the quality or fidelity of data

  • this technique involves analyzing a large number of observations from a dataset

  • since the sample size is large, the results can be applied in a generalized manner to the entire dataset

Explanation

Question 109 of 200

1

quantitative analysis

Select one of the following:

  • defined as the usefulness of data for an enterprise

  • provide more value than any other type of analytics and correspondingly require the most advance skillset, as well as specialized software and tools

  • single version of "truth" is based on cleansed data, which is a prerequisite for accurate and error-free reports

  • are absolute in nature and can therefore be used for numerical comparisons

Explanation

Question 110 of 200

1

qualitative analysis

Select one or more of the following:

  • Data pertaining to multiple business entities from different operational systems is periodically extracted, validated, transformed an consolidated into a single database

  • can also be fed back into OLTPs

  • is a data analysis technique that focuses on describing various data qualities using words

  • involves analyzing a smaller sample in greater depth compared to quantitative data analysis

Explanation

Question 111 of 200

1

qualitative analysis

Select one or more of the following:

  • accurate predictions

  • the information is generated at periodic intervals in realtime or near realtime

  • theses analysis results cannot be generalized to an entire dataset due to the small sample size

  • they also cannot be measured numerically or used for numerical comparisons

Explanation

Question 112 of 200

1

data mining

Select one or more of the following:

  • policies for data privacy and data anonymization

  • aim to determine the cause of a phenomenon that occuried in the past, using questions that focus on the reason behind the event

  • also known as data discovery, is a specialized form of data analysis that targets large datasets

  • refers to automated, sofware-based techniques that sift through massive datasets to identify patterns and trends

Explanation

Question 113 of 200

1

data mining

Select one or more of the following:

  • is typically stored in relational databases and frequently generated by custom enterprise applications, ERP systems amd CRM systems

  • actionable intelligence

  • involves extracting hidden or unknown patterns in the data with the intention of identifying previously unknown patterns

  • forms the basis for predictive analytics and business intelligence (BI)

Explanation

Question 114 of 200

1

Analysis & Analitycs

Select one of the following:

  • based on the input data, the algorithm develops an understanding of which data belongs to which category

  • data carrying no value

  • act as quick reference points for measuring the overall performance of the business

  • These techniques may not provide accurate findings in a timely manner because of the data's volume, velocity and/or variety

Explanation

Question 115 of 200

1

Analytics tools

Select one of the following:

  • enables multiple outcomes to be visualized by enabling related factors to be dynamically changed

  • are often carried out via ad-hoc reporting or dashboards

  • some realtime data analysis solutions that do exist are proprietary

  • can automate data analyses through the use of highly scalable computational technologies that apply automated statistical quantitative analysis, data mining an machine learning techniques

Explanation

Question 116 of 200

1

Types of Analytics

Select one or more of the following:

  • the adoption of a big data environment may necessitate that some or all of that environment be hosted witin a cloud

  • Are the right types of question being asked during data analysis?

  • descriptive analytics

  • diagnostic analytics

Explanation

Question 117 of 200

1

Types of Analytics

Select one or more of the following:

  • involves analyzing a smaller sample in greater depth compared to quantitative data analysis

  • also known as data discovery, is a specialized form of data analysis that targets large datasets

  • predictive analytics

  • prescriptive analytics

Explanation

Question 118 of 200

1

Types of Analytics

Select one of the following:

  • does not generally have any special pre-processing or storage requirements. Examples include banking transactions, OLTP system records and customer records

  • policies for data cleansing and filtering

  • can be important to businesses. Mining this data may allow for customized marketing, automated recomendations and the development of optimized product features

  • Value and complexity increase as we move from descriptive to prescriptive analytics

Explanation

Question 119 of 200

1

descriptive analytics

Select one or more of the following:

  • is generally inconsistent and non-relational

  • This involves identifying patterns in the training data and classifying new or unseen data based on known patterns

  • is carried out to answer questions about events that have already occurred

  • Arround 80% of analytics are ________ in nature

Explanation

Question 120 of 200

1

descriptive analytics

Select one or more of the following:

  • refers to the information about the source of the data that helps determine its authenticity and quality. It also used for auditing purposes

  • This is either directly through online interaction on indirectly through the usage of connected devices, this has resulted in massive data streams

  • provides the least value and requires a relatively basic skillset

  • are often carried out via ad-hoc reporting or dashboards

Explanation

Question 121 of 200

1

descriptive analytics

Select one or more of the following:

  • is directly related to the veracity characteristic

  • Business have the opportunity to leverage the infraestructure, storage and processing capabilities provided by these environments in order to build large scale Big Data Solutions

  • The reports are generally static in nature and display historical data that is presented in the form of data grids or charts

  • Queries are executed on the OLTP systems or data obtained from various other information systems, such as CRMs and ERPs

Explanation

Question 122 of 200

1

diagnostic analytics

Select one or more of the following:

  • aim to determine the cause of a phenomenon that occuried in the past, using questions that focus on the reason behind the event

  • are considered to provide more value than descriptive analysis, requiring a more advanced skillset

  • data bearing value leading to meaningful information

  • The data is inserted into a target system

Explanation

Question 123 of 200

1

diagnostic analytics

Select one or more of the following:

  • single version of "truth" is based on cleansed data, which is a prerequisite for accurate and error-free reports

  • a substancial budget may still be required to obtain external data

  • usually require collecting data from multiple sources and storing it in a structure that lends itself to performing drill-downs and roll-ups

  • analytics results are viewed via interactive visualization tools that enable users to identify trends and patterns

Explanation

Question 124 of 200

1

diagnostic analytics

Select one of the following:

  • can join structured and unstructured data that is kept in memory for fast data access

  • impose distinct data storage and processing demands, as well as management ans access processes

  • will be required to control how data flows in and out of big data solutions and how feedback loops can be established to enable the processed data to undergo repeated refinements

  • the executed queries are more complex compared to descriptive analytics, and are performed on multi-dimensional data held in OLAP systems

Explanation

Question 125 of 200

1

predictive analytics

Select one or more of the following:

  • the adoption of a big data environment may necessitate that some or all of that environment be hosted witin a cloud

  • is also dependent on how long data processing takes, time are inversely proportional to each other

  • are carried out to attempt to determine the outcome of an event that might occur in the future

  • try to predict the event outcome and predictions are made based on patterns, trends and exceptions found in historical and current data

Explanation

Question 126 of 200

1

predictive analytics

Select one or more of the following:

  • as big data initiatives are inherently business-driven, there needs to be a clear business case for adopting a big data solution to ensure that it is justified and that expectations are met

  • Graphically representing data can make it easier to understand reports, view trends and identify patterns

  • This can lead to the identification of risk and opportunities

  • involve the use of large datasets (comprised of both internal and external data), statistical techniques, quantitative analysis, machine learning and data mining techniques

Explanation

Question 127 of 200

1

predictive analytics

Select one or more of the following:

  • may employ machine learning algorithms, such as unsupervised learning to extract previously unknown attributes

  • is considered to provide more value and required more advance skillset than both descriptive and diagnostic analytics

  • tool generally abstract underlying statistical intricacies by providing user-friendly front-end interfaces

  • enables a detailed view of the data of interest by focusing in on a data subset from the summarized view

Explanation

Question 128 of 200

1

prescriptive analytics

Select one or more of the following:

  • is the process of teaching computers to learn from existing data and apply the adquired knowledge to formulate predictions about unknown data

  • incorporate predictive and prescriptive data analytics and data transformation features

  • build upon the results of predictive analytics by prescribing actions that should be taken. The focus is on which prescribed options to follow, and why and when it should be followed, to gain an advantage or mitigate a risk

  • provide more value than any other type of analytics and correspondingly require the most advance skillset, as well as specialized software and tools

Explanation

Question 129 of 200

1

prescriptive analytics

Select one or more of the following:

  • rely on BI and data warehouses as core components of big data environments and ecosystems

  • risk associated with collecting accurate and relevant data, and with integrating the big data environment itself, need to be identified and quantified

  • various outcomes are calculated, and the best course of action for each outcome is suggested

  • The approach shifts form explanatory to advisory and can include the simulation of various scenarios

Explanation

Question 130 of 200

1

prescriptive analytics

Select one or more of the following:

  • helps establish patterns and relationships amog the data being analyzed

  • unstructured data

  • incorporate internal data (current and historical sales data, customer information, product data, business rules) and external data (social media data, weather data, demographic data)

  • involve the use of business rules and large amounts of internal and/or external data to simulate outcomes and prescribe the best course of action

Explanation

Question 131 of 200

1

machine learning

Select one or more of the following:

  • coupling a traditional data warehouse with these new technologies results in a hybrid data warehouse

  • various outcomes are calculated, and the best course of action for each outcome is suggested

  • is the process of teaching computers to learn from existing data and apply the adquired knowledge to formulate predictions about unknown data

  • This involves identifying patterns in the training data and classifying new or unseen data based on known patterns

Explanation

Question 132 of 200

1

machine learning types

Select one or more of the following:

  • even analyzing separate datasets that contain seemingly benign can reveal private information when the datasets are analyzed jointly

  • scientific discoveries

  • supervised learning

  • unsupervised learning

Explanation

Question 133 of 200

1

supervised learning

Select one or more of the following:

  • distinct requierements, such as the combining of multiple unrelated datasets, processing of large ammounts of unstructured data and harvesting of hidden information, in a time-sensitive manner

  • theses analysis results cannot be generalized to an entire dataset due to the small sample size

  • algorithm is first fed sample data where the data categories are already known

  • based on the input data, the algorithm develops an understanding of which data belongs to which category

Explanation

Question 134 of 200

1

supervised learning

Select one of the following:

  • refers to the quality or fidelity of data

  • usually require collecting data from multiple sources and storing it in a structure that lends itself to performing drill-downs and roll-ups

  • the information is generated at periodic intervals in realtime or near realtime

  • having developed an understanding, the algorithm can then apply the learned behavior to categorize unknown data

Explanation

Question 135 of 200

1

unsupervised learning

Select one or more of the following:

  • identification of new markets

  • try to predict the event outcome and predictions are made based on patterns, trends and exceptions found in historical and current data

  • data categories are unknown and no sample data is fed

  • Instead, the algorithm attemps to categorize data by grouping data with similar attributes together

Explanation

Question 136 of 200

1

data mining

Select one or more of the following:

  • is directly related to the veracity characteristic

  • Online Transaction Processing (OLTP)

  • unearths hidden patterns and relationships based on previously unknown attributes of data

  • may employ machine learning algorithms, such as unsupervised learning to extract previously unknown attributes

Explanation

Question 137 of 200

1

machine learning

Select one or more of the following:

  • This can lead to the identification of risk and opportunities

  • is not "intelligent" as such because it only provides answers to correctly formulated questions

  • makes predictions by categorizing data based on known patterns

  • can use the output from data mining (identified patterns) for further data classification through supervised learning

Explanation

Question 138 of 200

1

data mining

Select one or more of the following:

  • provide a holistic view of key business areas

  • Due to the volumes of data that some big data solutions are required to process, performance can sometimes become a concern

  • may employ machine learning algorithms, such as unsupervised learning to extract previously unknown attributes

  • this is accomplished by categorizing data which leads to the identification of patterns

Explanation

Question 139 of 200

1

Big Data Solutions

Select one or more of the following:

  • is stored in a tabular form

  • aim to determine the cause of a phenomenon that occuried in the past, using questions that focus on the reason behind the event

  • rely on BI and data warehouses as core components of big data environments and ecosystems

  • has advance BI and data warehouses technologies and practices to a point where a new generation of these platforms has emerged

Explanation

Question 140 of 200

1

Traditional BI

Select one or more of the following:

  • queries and statistical formulae can then be applied as part of various data analysis tasks for viewing data in a user-friendly format, such as on a dashboard

  • more detailed records

  • utilizes descriptive and diagnostic analysis to provide information on historical and current events

  • is not "intelligent" as such because it only provides answers to correctly formulated questions

Explanation

Question 141 of 200

1

Traditional BI

Select one of the following:

  • can also be fed back into OLTPs

  • is mostly machine-generated and automatically appended to the data

  • they also cannot be measured numerically or used for numerical comparisons

  • correctly formulating questions requires an understanding of business problems and issues, and of the data itself

Explanation

Question 142 of 200

1

BI reports on KPI

Select one or more of the following:

  • Sensor Data (RFID, Smart meters, GPS sensors)

  • tool generally abstract underlying statistical intricacies by providing user-friendly front-end interfaces

  • ad-hoc reports

  • dashboards

Explanation

Question 143 of 200

1

ad-hoc reporting

Select one or more of the following:

  • are commonly used for meaningful and complex reporting and assessment task and can also be fed back into applications to enhance their behavior (such as when product recommendations are displayed online)

  • Online Analytical Processing (OLAP)

  • is a process that involves manually processing data to produce custom-made reports

  • the focus is usually on a specific area of the business, such as its marketing or supply chain management.

Explanation

Question 144 of 200

1

ad-hoc reporting

Select one of the following:

  • Data adquired such as via online customer registrations, usually contains less noise

  • policies for data privacy and data anonymization

  • makes the adoption of big data solutions accessible to businesses without large capital investments

  • the generated custom reports are detailed and often tabular in nature

Explanation

Question 145 of 200

1

OLAP and OLTP data sources

Select one of the following:

  • Instead, the algorithm attemps to categorize data by grouping data with similar attributes together

  • each iteration can then help fine-tune processing steps, algorithms and data models to improve the accuracy of the result and deliver greater value to the business

  • Big data solutions require tools that can seamlessly connect to structured, semi-structured and unstructured data sources and are further capable of handling millions of data records

  • can be used by BI tools for both ad-hoc reporting and dashboards

Explanation

Question 146 of 200

1

dashboards

Select one or more of the following:

  • analytics results are viewed via interactive visualization tools that enable users to identify trends and patterns

  • in-house hardware resources are inadequate

  • provide a holistic view of key business areas

  • the information is generated at periodic intervals in realtime or near realtime

Explanation

Question 147 of 200

1

dashboards

Select one of the following:

  • are not turn-key solutions

  • does often have special pre-processing and storage requierements, especially if the underline format is not text-based

  • performing analytics on datasets can reveal confidential information about organizations or individuals

  • the presentation of data is graphical in nature, such as column charts, pie charts and gauges

Explanation

Question 148 of 200

1

OLAP and OLTP

Select one of the following:

  • The longer it takes for data to be turned into meaninful information, the less potential it may have for the business

  • datasets that need to be processed reside in a cloud

  • provide feedback in near-realtime via open and public mediums

  • BI tools use to display the information on dashboards

Explanation

Question 149 of 200

1

data warehouse and data marts

Select one of the following:

  • is carried out to answer questions about events that have already occurred

  • either exists in textual or binary form

  • can have multiple data marts

  • contain consolidated and validated information about enterprise-wide business entities

Explanation

Question 150 of 200

1

Traditional BI

Select one or more of the following:

  • policies that regulate the kind of external data that can be adquired

  • does often have special pre-processing and storage requierements, especially if the underline format is not text-based

  • cannot function effectively without data marts because they contain the optimized and segregated data requires for reporting purposes

  • without data marts, data needs to be extracted from the data warehouse via an ETL process on an ad-hoc basis whenever a query needs to be run

Explanation

Question 151 of 200

1

Traditional BI

Select one of the following:

  • can be used as an ETL engine, or as an analytics engine for processing large amounts of structured, semi-structured and unstructured data

  • accumulates from being amassed within the enterprise (via applications) or from external sources that are then stored by the big datat solution

  • Near-realtime data processing can be archieved by processing transactional data as it arrives and combining it with already summarized batch-processed data

  • uses datawarehouses and data marts for reporting and data analysis, because they allow complex data analysis queries with multiple joins and aggregations to be issued

Explanation

Question 152 of 200

1

Big Data BI

Select one or more of the following:

  • each feedback cycle may reveal the need for existing steps to be modified, or new steps, such as pre-processing for data cleasing, to be added

  • policies for data archiving data sources and analysis results

  • builds upon BI by acting on the cleansed, consolidated enterprise-wide data in the data warehouse and combining it with semi-structured and unstructured data sources

  • comprises both predictive and prescriptive analysis to facilitate the development of an enterprise-wide understanding of the way a business works

Explanation

Question 153 of 200

1

Big Data BI

Select one of the following:

  • The broadening coverage of the internet and the proliferation of cellular and Wi-Fi networks has enabled more people to be continuously active in virtual communities

  • they also cannot be measured numerically or used for numerical comparisons

  • sound processes and sufficient skillsets for those who will be responsible for implementing, customizing, populating and using big data solutions are also necessary

  • analyses focus on multiple business processes simultaneously

Explanation

Question 154 of 200

1

Traditional BI

Select one of the following:

  • analyses generally focus on individual business processes

  • Depends on the type of data present

  • as big data initiatives are inherently business-driven, there needs to be a clear business case for adopting a big data solution to ensure that it is justified and that expectations are met

  • refers to the information about the source of the data that helps determine its authenticity and quality. It also used for auditing purposes

Explanation

Question 155 of 200

1

Big Data BI

Select one or more of the following:

  • it is important to accept that big data solutions are not necessary for all business

  • business are also increasingly interested in incorporating publicly avaliable datasets from social media and other external data source

  • This helps reveal patterns and anomalies across a broader scope within the enterprise

  • It also leads to data discovery by identifying insights and information that may have been previously absent or unknown

Explanation

Question 156 of 200

1

Big Data BI

Select one or more of the following:

  • distinct requierements, such as the combining of multiple unrelated datasets, processing of large ammounts of unstructured data and harvesting of hidden information, in a time-sensitive manner

  • generally involves sifting through large amounts of raw, unstructured data to extract meaningful information that can serve as an input for identifying patterns, enriching existing enterprise data, or performing large-scale searches

  • requires the analysis of unstructured, semi-structured and structured data residing in the enterprise data warehouse

  • requires a "next-generation" data warehouse that use new features and technologies to store cleansed data originating from a variety of sources in a single uniform data format

Explanation

Question 157 of 200

1

Big Data BI

Select one or more of the following:

  • has advance BI and data warehouses technologies and practices to a point where a new generation of these platforms has emerged

  • Volume, Velocity, Variety, Veracity & Value

  • coupling a traditional data warehouse with these new technologies results in a hybrid data warehouse

  • this type of data warehouse acts as a uniform and central repository of structured, semi-structured and unstructured data that can provide tools with all of the data they require

Explanation

Question 158 of 200

1

Big Data BI

Select one or more of the following:

  • Arround 80% of analytics are ________ in nature

  • is directly related to the veracity characteristic

  • this eliminates the need for tools to have to connect to multiple data sources to retrieve or access data

  • A next-generation data warehouse establishes a standarized data access layer accross a range of data sources

Explanation

Question 159 of 200

1

Data Visualization

Select one or more of the following:

  • conforms to a data model or schema

  • is based on a quantifiable indicator that is identified and agreed upon beforehand

  • is a technique whereby analytical results are graphically communicated using elements like charts, maps, data grids, infographics and alerts

  • Graphically representing data can make it easier to understand reports, view trends and identify patterns

Explanation

Question 160 of 200

1

Traditional Data Visualization

Select one or more of the following:

  • contain consolidated and validated information about enterprise-wide business entities

  • the nature of the business may make external data very valuable. The greater the volume and variety of data, the higher the chances of finding hidden insights from patterns

  • provided mostly static charts and graphs in reports and dashboards

  • query data from relational databases, OLAP systems, data warehouses and spreadsheets to present both descriptive and diagnostic analytics results

Explanation

Question 161 of 200

1

contemporary data visualization

Select one of the following:

  • unearths hidden patterns and relationships based on previously unknown attributes of data

  • can be human-generated or machine generated, although it is ultimately the responsibility of machines to generate the processing results

  • can be used by enterprise applications directly, or fed into a data warehouse to enrich existing data.This data is typically analyzed and subjected to analytics

  • are interactive and can provide both summarized and detailed views of data

Explanation

Question 162 of 200

1

Data Visualization

Select one or more of the following:

  • analyses focus on multiple business processes simultaneously

  • semi-structured data

  • they are designed to help people who lack statistical and/or mathematical skills to better understand analytical results, without having to resort to spreadsheets

  • Big data solutions require tools that can seamlessly connect to structured, semi-structured and unstructured data sources and are further capable of handling millions of data records

Explanation

Question 163 of 200

1

Data Visualization

Select one or more of the following:

  • has advance BI and data warehouses technologies and practices to a point where a new generation of these platforms has emerged

  • policies for data archiving data sources and analysis results

  • generally use in-memory analytical technologies that reduce the latency normally attributed to traditional, disk-based tools

  • Big data solutions require tools that can seamlessly connect to structured, semi-structured and unstructured data sources and are further capable of handling millions of data records

Explanation

Question 164 of 200

1

Data Visualization Features

Select one or more of the following:

  • does not generally have any special pre-processing or storage requirements. Examples include banking transactions, OLTP system records and customer records

  • each technology is uniquely relevant to modern-day Big Data Solutions and ecosystems

  • Aggregation

  • Drill-Down

Explanation

Question 165 of 200

1

Data Visualization Features

Select one or more of the following:

  • also known as data discovery, is a specialized form of data analysis that targets large datasets

  • this type of data warehouse acts as a uniform and central repository of structured, semi-structured and unstructured data that can provide tools with all of the data they require

  • Filtering

  • Roll-Up

Explanation

Question 166 of 200

1

Data Visualization Features

Select one of the following:

  • are closesly liked with an enterprise's strategic objectives

  • Filtering

  • used to achieve regulatory compliance

  • What-if Analysis

Explanation

Question 167 of 200

1

Aggregation

Select one of the following:

  • in-house hardware resources are inadequate

  • distinct requierements, such as the combining of multiple unrelated datasets, processing of large ammounts of unstructured data and harvesting of hidden information, in a time-sensitive manner

  • involves extracting hidden or unknown patterns in the data with the intention of identifying previously unknown patterns

  • provides a holistic and sumerized view of data across multiple contexts

Explanation

Question 168 of 200

1

Drill-Down

Select one of the following:

  • Big data solutions access data and generate data, all of which become assets of the business

  • forms the basis for predictive analytics and business intelligence (BI)

  • since the sample size is large, the results can be applied in a generalized manner to the entire dataset

  • enables a detailed view of the data of interest by focusing in on a data subset from the summarized view

Explanation

Question 169 of 200

1

Filtering

Select one of the following:

  • Value and complexity increase as we move from descriptive to prescriptive analytics

  • provides a holistic and sumerized view of data across multiple contexts

  • is a data analysis technique that focuses on describing various data qualities using words

  • helps focus on a particular set of data by filtering away the data that is not of immediate interest

Explanation

Question 170 of 200

1

Roll-Up

Select one of the following:

  • qualitative analysis

  • structured data

  • queries can take several minutes or even longer, depending on the complexity of the query and the number of records queried

  • groups data across multiple categories to show subtotals and totals

Explanation

Question 171 of 200

1

What-if Analysis

Select one of the following:

  • adressing concerns can require the annotation of data with source information and other metadata, when it is generated or as it arrives

  • scientific discoveries

  • also, the quality of the data targeted for processing by big data solutions needs to be assessed

  • enables multiple outcomes to be visualized by enabling related factors to be dynamically changed

Explanation

Question 172 of 200

1

advance visualization tools

Select one or more of the following:

  • is stored in a tabular form

  • These techniques may not provide accurate findings in a timely manner because of the data's volume, velocity and/or variety

  • incorporate predictive and prescriptive data analytics and data transformation features

  • these tools eliminate the need for data pre-processing methods (such as ETL) and provide the ability to directly connect to structured, semi-structured and unstructured data sources

Explanation

Question 173 of 200

1

advance visualization tools

Select one or more of the following:

  • based on the input data, the algorithm develops an understanding of which data belongs to which category

  • can join structured and unstructured data that is kept in memory for fast data access

  • queries and statistical formulae can then be applied as part of various data analysis tasks for viewing data in a user-friendly format, such as on a dashboard

  • correctly formulating questions requires an understanding of business problems and issues, and of the data itself

Explanation

Question 174 of 200

1

business justification

Select one or more of the following:

  • this eliminates the need for tools to have to connect to multiple data sources to retrieve or access data

  • It also leads to data discovery by identifying insights and information that may have been previously absent or unknown

  • as big data initiatives are inherently business-driven, there needs to be a clear business case for adopting a big data solution to ensure that it is justified and that expectations are met

  • clear goals regarding the measurable business value of an enterprise's big data solution need to be set

Explanation

Question 175 of 200

1

business justification

Select one or more of the following:

  • algorithm is first fed sample data where the data categories are already known

  • Leads to an opportunity to collect further "secondary" data, such as when individuals carry out searches or complete surveys

  • anticipated benefits need to be weighed against risk and investments

  • risk associated with collecting accurate and relevant data, and with integrating the big data environment itself, need to be identified and quantified

Explanation

Question 176 of 200

1

business justification

Select one of the following:

  • refers to the quality or fidelity of data

  • a substancial budget may still be required to obtain external data

  • distinct requierements, such as the combining of multiple unrelated datasets, processing of large ammounts of unstructured data and harvesting of hidden information, in a time-sensitive manner

  • it is important to accept that big data solutions are not necessary for all business

Explanation

Question 177 of 200

1

big data frameworks

Select one of the following:

  • based on the input data, the algorithm develops an understanding of which data belongs to which category

  • provides a holistic and sumerized view of data across multiple contexts

  • are interactive and can provide both summarized and detailed views of data

  • are not turn-key solutions

Explanation

Question 178 of 200

1

organizational prerequisites

Select one or more of the following:

  • prescriptive analytics

  • enables multiple outcomes to be visualized by enabling related factors to be dynamically changed

  • in order for data analysis and analytics to be successful and offer value, enterprise need to have data management and big data governance frameworks

  • sound processes and sufficient skillsets for those who will be responsible for implementing, customizing, populating and using big data solutions are also necessary

Explanation

Question 179 of 200

1

organizational prerequisites

Select one or more of the following:

  • is mostly machine-generated and automatically appended to the data

  • Big data solutions require tools that can seamlessly connect to structured, semi-structured and unstructured data sources and are further capable of handling millions of data records

  • also, the quality of the data targeted for processing by big data solutions needs to be assessed

  • outdated, invalid or poorly identified data will result in low-quality input which, regardless of how good the big data solution is, will continue to produce low-quality output

Explanation

Question 180 of 200

1

organizational prerequisites

Select one or more of the following:

  • refers to automated, sofware-based techniques that sift through massive datasets to identify patterns and trends

  • makes predictions by categorizing data based on known patterns

  • the longevity of the big data environment also needs to be planned for

  • a roadmap needs to be defined to ensure that any necessary expansion or augmentation of the environment is planned out to stay in sinc with the requirements of the enterprise

Explanation

Question 181 of 200

1

data procurement

Select one or more of the following:

  • can be important to businesses. Mining this data may allow for customized marketing, automated recomendations and the development of optimized product features

  • Hadoop

  • the adquisition of big data solutions themselves can be economical, due to open-source platform availability and opportunities to leverage commodity hardware

  • a substancial budget may still be required to obtain external data

Explanation

Question 182 of 200

1

data procurement

Select one or more of the following:

  • build upon the results of predictive analytics by prescribing actions that should be taken. The focus is on which prescribed options to follow, and why and when it should be followed, to gain an advantage or mitigate a risk

  • they are designed to help people who lack statistical and/or mathematical skills to better understand analytical results, without having to resort to spreadsheets

  • the nature of the business may make external data very valuable. The greater the volume and variety of data, the higher the chances of finding hidden insights from patterns

  • external data sources include data markets and the government. Government-provided data, like geo-spatial data may be free

Explanation

Question 183 of 200

1

data procurement

Select one of the following:

  • predictive analytics

  • can process massive quantities of data that arrive at varying speeds, may be of many different varieties and have numerous incompatibilities

  • Value and complexity increase as we move from descriptive to prescriptive analytics

  • most commercially relevant data will need to be purchased. Such an investment may be on-going in order to obtain updated versions of the datasets

Explanation

Question 184 of 200

1

privacy

Select one or more of the following:

  • store operational data that is fully normalized

  • Coping with the fast inflow of data requires the enterprise to design highly elastic and avaliable processing solutions and corresponding data storage capabilities

  • performing analytics on datasets can reveal confidential information about organizations or individuals

  • even analyzing separate datasets that contain seemingly benign can reveal private information when the datasets are analyzed jointly

Explanation

Question 185 of 200

1

privacy

Select one or more of the following:

  • predictive analytics

  • descriptive analytics

  • this can lead to intentional or inadvertent breaches of privacy

  • adressing these privacy concerns requires an undestanding of the nature of data being accumulated and relevant data privacy regulations, as well as special techniques for data tagging and anonymization

Explanation

Question 186 of 200

1

privacy

Select one or more of the following:

  • big data security further involves establishing data access levels for different categories of users

  • The maturity of these fields of practice inspired and enabled much of the core functionality expected from contemporary Big Data solutions and tools

  • some of the components of big data solutions lack the robustness of traditional enterprise solution environments when it comes to access control and data security

  • securing big data involves ensuring that data networks provide access to repositories that are sufficiently secured, via custom authentication and autorization mechanisms

Explanation

Question 187 of 200

1

provenance

Select one or more of the following:

  • provide a holistic view of key business areas

  • without data marts, data needs to be extracted from the data warehouse via an ETL process on an ad-hoc basis whenever a query needs to be run

  • refers to the information about the source of the data that helps determine its authenticity and quality. It also used for auditing purposes

  • maintaining as large volumes of data are adquired, combined and put through multiple processing stages can be a complex task

Explanation

Question 188 of 200

1

provenance

Select one or more of the following:

  • provide a holistic view of key business areas

  • store historical data that is aggregated and denormalized to support fast reporting capability

  • adressing concerns can require the annotation of data with source information and other metadata, when it is generated or as it arrives

  • data may also need to be annotated with the source dataset attributes and processing steps details as it passes through the data transformation steps

Explanation

Question 189 of 200

1

Limited Realtime Support

Select one or more of the following:

  • is stored in a tabular form

  • performing analytics on datasets can reveal confidential information about organizations or individuals

  • Dashboards and other applications that require streaming data and alerts often demand realtime or near-realtime data transmissions

  • Many contemporary open-source big data solutions and tools are batch-oriented meaning support for streaming data analysis may either be limited or non-existent

Explanation

Question 190 of 200

1

Limited Realtime Support

Select one or more of the following:

  • algorithm is first fed sample data where the data categories are already known

  • they are designed to help people who lack statistical and/or mathematical skills to better understand analytical results, without having to resort to spreadsheets

  • some realtime data analysis solutions that do exist are proprietary

  • Near-realtime data processing can be archieved by processing transactional data as it arrives and combining it with already summarized batch-processed data

Explanation

Question 191 of 200

1

Distinct performance challenges

Select one of the following:

  • refers to automated, sofware-based techniques that sift through massive datasets to identify patterns and trends

  • anticipated benefits need to be weighed against risk and investments

  • queries can take several minutes or even longer, depending on the complexity of the query and the number of records queried

  • Due to the volumes of data that some big data solutions are required to process, performance can sometimes become a concern

Explanation

Question 192 of 200

1

Distinct governance requirements

Select one of the following:

  • having developed an understanding, the algorithm can then apply the learned behavior to categorize unknown data

  • are considered to provide more value than descriptive analysis, requiring a more advanced skillset

  • the relational data is stored as denormalized data in the form of cubes, this allows the data to be queried during any data analysis task that are performed later

  • Big data solutions access data and generate data, all of which become assets of the business

Explanation

Question 193 of 200

1

governance framework

Select one of the following:

  • can use the output from data mining (identified patterns) for further data classification through supervised learning

  • business are storing increasing amounts of data on customer interaction and from social media avenues in an attempt to harvest this data to increase sales, enable targeted marketing and create new products and service

  • analyses focus on multiple business processes simultaneously

  • is required to ensure that the data and the solution environment itself are regulated, standarized and evolved in a controlled manner

Explanation

Question 194 of 200

1

what a big data governance framework would encompass

Select one or more of the following:

  • does not conform to a data model or data schema

  • big data solutions particularly rely on it when processing semi-structured and unstructured data

  • standardizing how data is tagged and the metadata used for tagging

  • policies that regulate the kind of external data that can be adquired

Explanation

Question 195 of 200

1

what a big data governance framework would encompass

Select one or more of the following:

  • policies for data cleansing and filtering

  • can also be fed back into OLTPs

  • policies for data privacy and data anonymization

  • policies for data archiving data sources and analysis results

Explanation

Question 196 of 200

1

Distinct methodology

Select one or more of the following:

  • upfront capital investment is not available

  • simple insert, delete and update operations with sub-second response times

  • will be required to control how data flows in and out of big data solutions and how feedback loops can be established to enable the processed data to undergo repeated refinements

  • each feedback cycle may reveal the need for existing steps to be modified, or new steps, such as pre-processing for data cleasing, to be added

Explanation

Question 197 of 200

1

Distinct methodology

Select one of the following:

  • the focus is usually on a specific area of the business, such as its marketing or supply chain management.

  • A Not-only SQL (NoSQL) database is a non-relational database that can be use to store it

  • Extract Transform Load (ETL)

  • each iteration can then help fine-tune processing steps, algorithms and data models to improve the accuracy of the result and deliver greater value to the business

Explanation

Question 198 of 200

1

Cloud Computing

Select one or more of the following:

  • generally makes up 80% of the data within an enterprise, and has a faster growth rate than structured data

  • A next-generation data warehouse establishes a standarized data access layer accross a range of data sources

  • introduces remote environments that can host IT infrastructure for, among other things, large-scale storage and processing

  • the adoption of a big data environment may necessitate that some or all of that environment be hosted witin a cloud

Explanation

Question 199 of 200

1

Cloud Computing

Select one or more of the following:

  • Collections or groups of related data (Ex. Tweets stored in a flat file, collection of image files, extract of rows stored in a table, historical weather observations that are stored as XML Files)

  • as big data initiatives are inherently business-driven, there needs to be a clear business case for adopting a big data solution to ensure that it is justified and that expectations are met

  • upfront capital investment is not available

  • the project is to be isolated from the rest of the business so that existing business processes are not impacted

Explanation

Question 200 of 200

1

Cloud Computing

Select one or more of the following:

  • the limits of available computing and storage resources used by an in-house Big Data solution are being reached

  • is typically stored in relational databases and frequently generated by custom enterprise applications, ERP systems amd CRM systems

  • the big data initiative is a proof of concept

  • datasets that need to be processed reside in a cloud

Explanation