Pregunta 1
Pregunta
Big Data dedicated to
Pregunta 2
Pregunta
Requeriments of big data
Pregunta 3
Pregunta
Bit data addresses
Pregunta 4
Respuesta
-
machine generate
-
Only human generate
-
hidden data
Pregunta 5
Pregunta
Benefits Big Data
Pregunta 6
Pregunta
characteristics of data in big data
Respuesta
-
complex, variety, volumen, veracity, value
-
variety, volumen, velocity, veracity, value
-
incompatibilities, value,velocity,volumen, veracity, many data
Pregunta 7
Pregunta
characteristics only of data in BigData
Pregunta 8
Pregunta 9
Pregunta
Human-generated data examples
Respuesta
-
micro bloggin
-
web log
-
sensor data
Pregunta 10
Respuesta
-
more value more veracity, more value more time
-
more value less veracity, more value less time
-
more value more veracity, more value less time
Pregunta 11
Respuesta
-
operational optimization, noise information
-
scientific discoveries, actionable intelligence
-
accurate predictions, shipped cloud computing
Pregunta 12
Respuesta
-
collections or groups of related data, shares the different set of attributes
-
collections or groups of related data, shares the same set of attributes
-
discipline of gaining an understanding of data
Pregunta 13
Respuesta
-
discipline of gaining an understanding of data
-
process of examining data to find facts, relationships, patterns
-
collections or groups of related data, shares the same set of attributes
Pregunta 14
Respuesta
-
process of gaining an understanding of data
-
discipline of gaining an undestanding of data by analizing it via multitude techniques
-
collections or groups of related data
Pregunta 15
Pregunta
in business-oriented, analytics
Respuesta
-
results can lower operational costs and facilitate strategic decison-making
-
help identify the cause of a phenomenon to improve the accuracyof predictions
-
help strengthen the focus on delivering high quality services by driving down costs
Pregunta 16
Pregunta
in the scientific domain, analytics
Respuesta
-
help strengthen the focus on delivering high quality services by driving down costs
-
results can lower operational costs and facilitate strategic decison-making
-
help identify the cause of a phenomenon to improve the accuracyof predictions
Pregunta 17
Pregunta
in services-based enviroments, analytics
Respuesta
-
results can lower operational costs and facilitate strategic decison-making
-
help strengthen the focus on delivering high quality services by driving down costs
-
help identify the cause of a phenomenon to improve the accuracyof predictions
Pregunta 18
Pregunta
Business intelligent
Respuesta
-
process of gaining an understanding of data
-
process of gaining insigths into the the workings of an enterprise
-
discipline of gaining an undestanding of data by analizing it via multitude techniques
Pregunta 19
Pregunta
KPI (Key performance indicators)
Respuesta
-
utilize the consolidated data contained in data warehouses to run analytical queries
-
is a measure for gauging success within a particular context
-
discipline of gaining an undestanding of data by analizing it via multitude techniques
Pregunta 20
Respuesta
-
achieve regulatory compliance
-
help identify the cause of a phenomenon to improve the accuracyof predictions
-
help strengthen the focus on delivering high quality services by driving down costs
Pregunta 21
Pregunta
measure in Big data
Respuesta
-
kilometer, megabyte, gigabyte, terabyte, petabyte, exabyte, settabytte, yottabyte
-
kilobyte, megabyte, gigabyte, terabyte, petabyte, exabyte, settabytte, yottabyte
-
kilobyte, megabyte, gigabyte, terabyte, petabyte, exabyte, settabytte, youtube
Pregunta 22
Pregunta
big data emerged from a combination of business needs and technology innovations
Pregunta 23
Pregunta
analitcs & data science
Respuesta
-
For many business, digital mediums have replaced physical mediums as the de facto
-
Based in opens sorce software tha requires little more than commodity hardware
-
machine learning algoritms, statistical techniques and data warehousing
Pregunta 24
Respuesta
-
Lead to an opportunity to collect further secondary data
-
Collecting and storing more data to potentially find new insigths and gain a competitive edge
-
collecting and processing large quantities of diverse data has become increasingly affordable
Pregunta 25
Pregunta
affordable technology & commodity hardware
Respuesta
-
the madurity of these fields of practice inspired and enabled much of the core functionality
-
use of commodity hardware makes the adoption of Big Data solutions accessible to business
-
some examples include on-demand TV and streaming video
Pregunta 26
Respuesta
-
Has empowered customers to provide feedback in near-realtime via open and public mediums
-
Has resulted in massive data streams
-
are capable of providing highly scalable, on-demand IT resources that can be leased
Pregunta 27
Pregunta
hyper-connected communities & devices
Respuesta
-
A an result, business are storing increasing amounts of data on customer interaction
-
leverage the infrastructure, storage and processing capabilities provided by this enviroments
-
the broadening coverage of internet and the proliferation of cellar and WI-FI networks .
Pregunta 28
Respuesta
-
Businesses are also increasingly interested in incorporating publicly available datasets
-
Is either directly through online interaction or indirectly through the usage of connected devices
-
Can be leased dramatically reduces the required up-front investment of Big Data projects
Pregunta 29
Pregunta
Online Transaction Processing (OLTP)
Respuesta
-
software system that processes transaction-oriented data
-
Is a system used for processing data analysis queries
-
process of loading data from a source system into a target system
Pregunta 30
Pregunta 31
Pregunta
data fully normalized
Pregunta 32
Pregunta
C R U D with subsecond response times
Pregunta 33
Pregunta
Online Analytical Processing (OLAP)
Respuesta
-
software system that processes transaction-oriented data
-
Store operational data that is fully normalized
-
Is a system used for processing data analysis queries
Pregunta 34
Pregunta
data mining and machine learning processes
Pregunta 35
Respuesta
-
Representing a common source of structured analytics input
-
Can serve as both a data source as well as a data sink that capable of receiving data
-
Big data analysis results can also be fed back
Pregunta 36
Respuesta
-
Representing a common source of structured analytics input
-
are used in diagnostic, predictive and prescriptive analytics
-
An example ticket reservation systems and banking and POS transactions
Pregunta 37
Pregunta
data that is aggregated and denormalized
Pregunta 38
Pregunta
OLAP use databases
Respuesta
-
that store historical data in multidimensional arrays and can answer complex queries
-
are comprised of simple insert, delete and update operations
-
that processes transaction-oriented data
Pregunta 39
Pregunta
An OLAP system is always
Pregunta 40
Pregunta
OLAP: denormalized data in the form of cubes
Pregunta 41
Pregunta
Extract-transform-load (ETL)
Respuesta
-
allows the data to be queried during any data analysis tasks that are performes later
-
Is a process of loading data from a source system into a target system
-
queries can take several minutes or even longer, depending on the complexity of query.
Pregunta 42
Pregunta
Extract-transform-load (ETL) source
Respuesta
-
database, flat file or an application
-
on-demand TV and streaming video
-
digitalization and social media
Pregunta 43
Respuesta
-
Represents the main operation through wich data warehouses are fed data
-
Represents the main operation through wich datasets are fed data
-
Represents de main operation through wich database are fed data
Pregunta 44
Respuesta
-
Extract load transform
-
Extract transform load
-
Extract transform leave
Pregunta 45
Pregunta 46
Pregunta
Data Warehouses EDWH has historical data?
Pregunta 47
Pregunta
Data Warehouses EDWH
Respuesta
-
is a subset of the data, that typically belongs to a deparment
-
is a framework open source
-
usually interface with an OLAP sYstem to support analytical queries
Pregunta 48
Respuesta
-
this allows the data to be queried during any data analysis
-
Heavily used by BI to run various analytical queries
-
software system that processes transaction-orientes data
Pregunta 49
Respuesta
-
social media, facebook twitter
-
OLTP, ERP, CRM and SCM systems
-
OLAP, ERP, CRM and SCM systems
Pregunta 50
Respuesta
-
For the amount data contained will continue to increase. The anlysis BI can suffer.
-
Is a process of loading data from a source system into a target system
-
software system that processes transaction-orientes data
Pregunta 51
Respuesta
-
Has established itself as a de facto industry platform for contemporary Big Data solutions
-
usually contain optimized databases, called analytical databases to handle reporting and data analysis
-
Represents de main operation through wich database are fed data
Pregunta 52
Pregunta
EDWH: analytical database can´t exist as separate DBMS
Pregunta 53
Respuesta
-
is a subset of the data, that typically belongs to a deparment
-
can have multiple EDWH
-
based on cleansed data, which is a prerequisite for accurate and error-free reports
Pregunta 54
Pregunta
hadoop is open source framework for
Respuesta
-
large data storage
-
data processing
-
diagnostic, predictive and prescriptive
-
run on commodity hardware
-
denormalized data in the form of cubes
Pregunta 55
Respuesta
-
has established itself as a de facto industry platform for contemporary Big Data solutions
-
is a central, enterprise-wide repository
-
is always fed with data from multiple OLTP system using regular batch processing jobs
Pregunta 56
Pregunta
hadoop can be used as engine of
Pregunta 57
Pregunta
hadoop can process large amounts of structured, semi-structured and unstructured data
Pregunta 58
Pregunta
volumen refers to
Respuesta
-
insert data
-
process data
-
velocity processing
Pregunta 59
Pregunta
Data volumes can include
Pregunta 60
Respuesta
-
multiple types of data that need to be supported by Big Data solutions
-
data translates into the amount of time it takes for the data to be processed
-
data is processed by Big Data solutions is substantial and usually ever growing
Pregunta 61
Pregunta
Depending on the data source, velocity may not always be high
Pregunta 62
Respuesta
-
Quality or fidelity of data
-
usefulness of data for an enterprise
-
refers to the multiple formats and types of data that need to be supported by Big Data Solutions
Pregunta 63
Respuesta
-
The appropriate form of data storage
-
Refers to the quality or fidelity of data
-
Refers ti the usefulness of data for an enterprise
Pregunta 64
Pregunta
Noise and SIgnal refers to
Pregunta 65
Respuesta
-
refers to quality or fidelity of data
-
refers to the multiple formats and types of data that need to be supported
-
refers to usefulness of data for an enterprise
Pregunta 66
Pregunta
the value is directly related to the veracity in that de higher the data fidelity, the more value it holds for the business.
Pregunta 67
Respuesta
-
structured data, unstructured data, semi-structured data
-
structured data, unstructured data, semi-structured data, metadata
Pregunta 68
Pregunta
ERP and CRM are example of
Respuesta
-
unstructured data
-
structured data
-
semi-structured data
Pregunta 69
Pregunta
image, audio and video files are examples of
Respuesta
-
semi-structured data
-
unstructured data
-
strutured data
Pregunta 70
Pregunta
Unstructured data generally makes up 80%
Pregunta 71
Pregunta
unstructured data does generally require special or customized logic when it comes to pre-processing and storage
Pregunta 72
Pregunta
semi-structured data
Respuesta
-
has a defined level of structured and consistency can be relational in nature
-
has a defined level of structured and consistenc, but cannot be relational in nature
-
cannot be inheremtly processed or queried using SQL or traditional programming features
Pregunta 73
Pregunta
semi_structured data
Respuesta
-
CRM or ERP
-
XML or , electronic data interchanges, e-mails, spreaddheets, RSS feeds and senso data
-
image or adio files
Pregunta 74
Pregunta 75
Pregunta
metadata generally machine generated and utomatically appended to the data
Pregunta 76
Pregunta
metadata xml tag
Pregunta 77
Pregunta
semi-structured data and unstructured data have a greater noise-to-signal ratio than structured data
Pregunta 78
Pregunta
can ETL can cleansing data and verification
Pregunta 79
Respuesta
-
quantitative analysis
-
cientics analysis
-
qualitative analysis
-
data mining
Pregunta 80
Pregunta
quantitative analysis
Pregunta 81
Pregunta
qualitative analysis use
Pregunta 82
Pregunta
involve analyzing a smaller sample in greater depth
Respuesta
-
quantitave analysis
-
qualitative analysis
Pregunta 83
Pregunta
analysis that targets large datasets
Respuesta
-
quantitative analysis
-
data mining
-
qualitative analysis
Pregunta 84
Pregunta
data mining (data discovery)
Pregunta 85
Pregunta
data mining forms the basis for predictive analytics and business intelligence (BI)
Pregunta 86
Pregunta
analysis tools can automate data analyses
Pregunta 87
Pregunta
descriptive , diagnostic , predictive , prescriptive
Respuesta
-
types of analytics
-
types of analisys
-
diagnostic
Pregunta 88
Pregunta
questions about events that have already occurred
Respuesta
-
diagnostic analytics
-
descriptive analytics
-
predictive analytics
Pregunta 89
Pregunta
reporting or dahsboards. The reports are generally static, queries are executed in OLTP such CRM and ERP
Respuesta
-
descriptive analytis
-
diagnostic analytics
-
preescriptive analytics
Pregunta 90
Pregunta
determine the causes of a phenomenon that occurred in the past
Respuesta
-
descriptive analytics
-
diagnostic analytics
-
predictive analytics
Pregunta 91
Pregunta
interactive visualization to identify trends and patterns, and queries are executed in OLAP systems
Respuesta
-
descriptive analytics
-
diagnostic analytics
-
preescriptive analytics
Pregunta 92
Pregunta
attemp to determine the outcome of an event that might occur in the future
Respuesta
-
preescriptive analytics
-
predictive analytics
-
descriptive analytics
Pregunta 93
Pregunta
The focus is on which prescribed option to follow and why and when it should be followed, to gain and advantage or mitigate risk
Respuesta
-
descriptive analytics
-
prescriptive analytics
-
predictive analytics
Pregunta 94
Pregunta
incorporate internal data (historical etc..) and external data (social media, demographic data)
Respuesta
-
descriptive analytics
-
prescriptive analytics
-
diagnostic analytics
Pregunta 95
Pregunta
machine learning
Respuesta
-
is the process of teaching computers to learn from existing data and apply the acquired knowledge to formulate predictions about unknow data.
-
is the discipline of teaching computers to learn from existing data and apply the acquired knowledge to formulate predictions about unknow data.
-
is the framework refers to teaching computers to learn from existing data and apply the acquired knowledge to formulate predictions about unknow data.
Pregunta 96
Pregunta
based on the input data and categorys
Respuesta
-
supervised learning
-
unsupervised learning
Pregunta 97
Pregunta
the algorithm attemp to categorized data by grouping data with similara attributes together
Respuesta
-
supervised learning
-
unsupervised learning
Pregunta 98
Pregunta
machine learning makes predictions and identify hidden patterns
Pregunta 99
Pregunta
machine learning can use the output from data mining for further data classification.
Pregunta 100
Pregunta
traditional BI utilizes
Respuesta
-
descriptive and diagnostic
-
diagnostic and predictive
-
descriptive an prescriptive
Pregunta 101
Pregunta
ad-hoc reports and dashboards
Respuesta
-
tradictional Big Data
-
traditional BI
Pregunta 102
Pregunta
ad hoc reporting
Pregunta 103
Respuesta
-
facilitate the development of an enterprise-wide understanding of the way a bussines works
-
focus on indivicual bussines processes
-
descriptive and diagnostic to facilitate the development of an an enterprise-wide understanding of the way a bussines works
Pregunta 104
Pregunta
data visualization analytical results are graphically communicated using elements like
Respuesta
-
charts, maps, data grids, infographics and alerts
-
ad-hoc reports drill, down
-
dashboards
Pregunta 105
Pregunta
data visualization tool in Big Data use
Pregunta 106
Respuesta
-
groups data across multiple categories to show subtotals and totals
-
global an sumarized view of data across multiple context
-
enables a detail view of the data of interest by focusing in on a data subset
Pregunta 107
Pregunta
visualization features: drill down
Respuesta
-
enables a detail view of the data of interest by focusing in on a data subset
-
global an sumarized view of data across multiple context
-
groups data across multiple categories to show subtotals and totals
Pregunta 108
Pregunta
visualization features: roll up
Respuesta
-
global an sumarized view of data across multiple context
-
groups data across multiple categories to show subtotals and totals
-
enables a detail view of the data of interest by focusing in on a data subset
Pregunta 109
Pregunta
data visualization tools for Big data solutions incorporate
Pregunta 110
Pregunta
in the advance visualizations you needs an ETL