P1-Modulo 2 : Big data analysis y technology concepts

Beschreibung

modulo 2 Big Data
Carolina Colorado
Quiz von Carolina Colorado, aktualisiert more than 1 year ago
Carolina Colorado
Erstellt von Carolina Colorado vor mehr als 7 Jahre
61
2

Zusammenfassung der Ressource

Frage 1

Frage
Big data analysis differs from trditional data analysis primary because
Antworten
  • volume, value and varirety
  • volume, velocity and variety
  • veracity, volume and velocity

Frage 2

Frage
In big data analysis and analytics, a fundamental step-by-step process is needed to organize the task involved
Antworten
  • retrieving, processing, producing and visualization data
  • retrieving, processing, producing and repurposing data
  • retrieving, processing, organize and repurposing data

Frage 3

Frage
which are 1,2 and 3 stages of bigData analysis lifecycle
Antworten
  • Data analysis, data identification and ata extraction
  • bussines cased evaluation, data extraction and data analysis
  • bussines case evaluation, data identificaction and data acquisition and filtering

Frage 4

Frage
which are 4,5 and 6 stages of bigData analysis lifecycle
Antworten
  • data analysis, data visualization & utilization of analysis results
  • data extraction, data validation & cleansing and data aggregation & representation
  • data extraction, data aggregation & representation

Frage 5

Frage
which are 7,8 and 9 stages of bigData analysis lifecycle
Antworten
  • data analysis, data visualization and utilization of analysis results
  • data aggregation & represntation, data analyisis and data visualization
  • data identifcation, data acquisition & filtering and data extraction

Frage 6

Frage
The business case evaluation stage requires that a business case be ________, __________ and ______________ prior to proceeding with the actual hands-on analysis tasks.
Antworten
  • organized, created and approbed
  • created, assessed and approbed
  • created, organized and analyzed

Frage 7

Frage
An evaluation of a Big Data analysis bussines case helps decision-makers undertand
Antworten
  • the business resources that will need to be utilized and wich bussines challenges the analysis
  • the data that will need to be utilized and wich bussines challenges the analysis
  • the business resources that will need to be utilized and wich bussines objectives the analysis

Frage 8

Frage
The KPIs is ussefull in Business Case Evaluation
Antworten
  • true
  • false

Frage 9

Frage
based on the business requirements documented in the _______________________________ , it can be determined whether the business problems being addresed are really Big data problems
Antworten
  • use case
  • business case
  • requirements case

Frage 10

Frage
a bussines problem needs to be directly related to one or more of the big data characteristics
Antworten
  • veracity, velocity or variety
  • value, velocity or variety
  • volume, velocity or variety

Frage 11

Frage
Another outcome in Business case evaluation is
Antworten
  • determination of budget required to carry out the analysis project
  • determination of data required to carry out the analysis project
  • determination of resources required to carry out the analysis project

Frage 12

Frage
The invesment can be weighed against the expected benefits of achieving the goals
Antworten
  • false
  • true

Frage 13

Frage
Initial iterations of the big data analysis lifecycle will not required more up-front invesment of Big Data tecnologies, products and training compared to later iterations
Antworten
  • false
  • true

Frage 14

Frage
The data identification stage is dedicated to identifiying the _____________ required for the analysis project
Antworten
  • metadata
  • datasets
  • datamart

Frage 15

Frage
identifying a wider variety of data sources may increase the probability of finding
Antworten
  • hidden patterns and aggregations
  • hidden patterns and correlations
  • hidden resources and datasets

Frage 16

Frage
Can be beneficial to identify as many types of releated data sources and insights as possible, especilly when we don´t know exactly what we're looking for.
Antworten
  • TRUE
  • FALSE

Frage 17

Frage
Depending on the business scope of analysis project and nature of business problems being adressed, the required dataset and their sourcescan be
Antworten
  • structured and not structured
  • big or small of all enterprise
  • internal or external to enterprise

Frage 18

Frage
Internal dataset
Antworten
  • data markets and publicly avalaible datasets
  • internal sources, such as data marts and operational system
  • embedded within blogs or other types of content-based websites

Frage 19

Frage
external datasets
Antworten
  • strudtured data, unstructured data
  • data marts and operational systems
  • Case they may need to be harvested via automated tools

Frage 20

Frage
data is gathered from all of data sources that were identified during the previous stage, and is then subjected to the automated filtering of corrupt data or data that has been deemed to have no value to the analysis obvjectives
Antworten
  • Data identification
  • data acquisition & filtering
  • Data aggregation & representation

Frage 21

Frage
Depending on the type of data source , data may come as a dump of files or may require API integration
Antworten
  • false
  • true

Frage 22

Frage
in many cases , especially where external, unstructured data is concerned, some or most of the acquired data may be irrelevant (noise) and can be discarded as part of the filtering process
Antworten
  • Data acquisition & filtering
  • Data extraction
  • Data validation & cleansing

Frage 23

Frage
data classified as "corrupt" can include records with missing or nonsensical values or invalid data type
Antworten
  • false
  • true

Frage 24

Frage
Data thah is filtered out for one analysis may not be valueable for a different type of analysis
Antworten
  • false
  • true

Frage 25

Frage
it is advisable to store a verbatim copy of the original dataset proceeding with the filtering.
Antworten
  • To save on required storage space, the verbatim copy is compressed after storage
  • To save on required storage space, the verbatim copy is compressed before storage
  • To save on required storage space, the verbatim copy is compressed in the same time of storage

Frage 26

Frage
to be persisted once it gets generated or enters the enterprise boundary
Antworten
  • internal data
  • internal and external data
  • external data

Frage 27

Frage
The data is persisted to disk prior to analysis
Antworten
  • realtime analytics
  • Batch anlytics
  • realtime analytics and batch analytics

Frage 28

Frage
The data is analyzed first and then persisted to disk
Antworten
  • realtime analytics
  • batch analytics
  • realtime analytics and batch analytics

Frage 29

Frage
Can be added via automation to data from both internal and external data sources to improve the classification an querying
Antworten
  • info data
  • data analysis
  • metadata

Frage 30

Frage
Metadata example can include
Antworten
  • datamart size and structure, source information, date and time of creation or collection, language-specific information etc.
  • database size and structure, source information, date and time of creation or collection, language-specific information etc.
  • dataset size and structure, source information, date and time of creation or collection, language-specific information etc.

Frage 31

Frage
it is vital that metadata be machine-readable and passed forward along subsequent analysis stages
Antworten
  • false
  • true

Frage 32

Frage
This helps to maintain data provenance throughout the Big Data analysis lifecycle, wich helps establish and preserve data accuracy and quality
Antworten
  • metadata
  • source information
  • date and time of creation or collection

Frage 33

Frage
Some of the data identified as input for the analysis may arrive in a format incompatible with the big data solution
Antworten
  • true
  • false

Frage 34

Frage
the need to address disparate types of data is more likely with data from
Antworten
  • internal sources
  • external sources

Frage 35

Frage
is dedicated to extracting disparate data and transforming it into a format that the underliying Big Data solution can use for the purpose of the data analysis
Antworten
  • Data acquisition & filtering
  • data validation & cleansing
  • data extraction

Frage 36

Frage
The extent of extraction and transformation required depends
Antworten
  • on the types of analytics and capabilities of the Big Data solution
  • bussines case
  • Data extraction

Frage 37

Frage
Estracting the required fields from delimited textual data (such as with web server log files) may not be necessary
Antworten
  • capabilities of the Big Data Solution
  • underlying Big Data solution can already directly process those files
  • transforming it into a format that underlying Big Data solution

Frage 38

Frage
example of document that not need further transformation
Antworten
  • XML and JSON
  • facebook and twitter
  • image and video

Frage 39

Frage
The invalid data can
Antworten
  • Skew an falsify analysis results
  • Lose business objectives
  • Lose the accuracy of the analysis

Frage 40

Frage
data input into Big Data analyses can be unstructured without any indication of validity
Antworten
  • false
  • true

Frage 41

Frage
the complexity can further make it easy to arrive at a set of suitable validation constraint
Antworten
  • false
  • true

Frage 42

Frage
Is dedicated to establishing (often complex) validation rules and removing any know invalid data
Antworten
  • Data acquisition and filtering
  • Data validation and cleansing
  • Data identification

Frage 43

Frage
Big Data solutions often receive redundant data across different datasets, this redundancy can be exploited to explore interconnected datasets in order to assemble validation parameters and fill in missing valid data
Antworten
  • false
  • true

Frage 44

Frage
For Batch analytics, data validation and cleansing can be achieved via offline
Antworten
  • data minnig
  • ELT operation
  • ETL operation

Frage 45

Frage
Data input in Big Data can be unstructured without any indication of validity
Antworten
  • false
  • true

Frage 46

Frage
provenance can play an important role in determining the accuracy and quality of questionable data
Antworten
  • false
  • true

Frage 47

Frage
data that appears to be invalid may still be valuable in that it may posses
Antworten
  • most important data
  • hidden patterns an trends
  • noise

Frage 48

Frage
Data may be spread across multiple datasets, requiring that datasets be joined together via common files (date or ID)
Antworten
  • false
  • true

Frage 49

Frage
either way a method of data ________________ is required or the dataset representing ther correct value needs to be determined
Antworten
  • aggregation
  • reconciliaton
  • representation

Frage 50

Frage
Dedicated to integrating multiple datasets together to arrive at a unified view
Antworten
  • Data aggregation & representation
  • Data extraction
  • Data visualization

Frage 51

Frage
Can become complicated because od differences in : although the data format may be the same, the data model may be different
Antworten
  • semantics
  • BD engine
  • Data structure

Frage 52

Frage
Can become complicated because od differences in : A valuethat is labelled differently in two different datasets may mean the same thing (surname and last name)
Antworten
  • BD engine
  • Semantics
  • Data structure

Frage 53

Frage
In data Aggregation & Representation reconciling the differences can required complex logic that is executed ___________________.
Antworten
  • ETL process
  • human intervention
  • automatically

Frage 54

Frage
Future data analysis requirements need to be considered during the stage ___________________ to help foster data reusability
Antworten
  • Data extraction
  • Data aggregation & REpresentation
  • Data validation and cleansing

Frage 55

Frage
whether ___________________ is required or not, it is important to understand that the same data can be stored in many different forms. One form may be better suited for a particular type of analysis than another
Antworten
  • data cleansing
  • data aggregation
  • filtering

Frage 56

Frage
A data structured standarized by the Big Data solution can require establishing a central, standard analysis repository, such as a
Antworten
  • untructured database
  • structured database
  • NoSQL database

Frage 57

Frage
the data analysis stage is dedicated to carriying out the actual analysis task, which typically involves one or more types of analytics
Antworten
  • data validation %cleansing
  • Data Analysis
  • Utilization os analysis results

Frage 58

Frage
This stage can be iterative in nature, because repeated until appropiated pattern or correlation is uncovered
Antworten
  • data aggregation & representation
  • Data analysis
  • Data extraction

Frage 59

Frage
The approach taken when carrying out this stage, data analysis, an be classified as ______________________________
Antworten
  • acquisition analysis and filtering analysis
  • confirmatory analysis and exploratory analysis
  • validation analysis and cleansing

Frage 60

Frage
___________________________ adata analysis is a deductive approach where the cause of the phenomenon being investigated is proposed beforehand
Antworten
  • Confirmatory analysis
  • Exploratory analysis
  • Data analysis

Frage 61

Frage
the proposed cause or assumption is called a
Antworten
  • pattern and trend
  • deductive approach
  • hypotesis

Frage 62

Frage
data samples are tipically used
Antworten
  • exploraty analysis
  • confirmatory analysis

Frage 63

Frage
unexpected findings or anomalies are usually ignored since a predetermined cause was assumed
Antworten
  • true
  • false

Frage 64

Frage
is an inductive approach that is closely associated to data mining
Antworten
  • exploratory data analysis
  • confirmatory data analysis
  • correlation analysis

Frage 65

Frage
this analysis provides a general direction that can facilitate the discovery of patterns or annomalies
Antworten
  • confirmation analysis
  • Exploratory analysis

Frage 66

Frage
Large amounts of data and visual analysis are typically used
Antworten
  • Confirmatory analysis
  • Exploratory analysis

Frage 67

Frage
is dedicated to using _____________________ techniques and tools to graphically communicate the analysis results for effective interpretation by bussines users
Antworten
  • Data analysis
  • Data visualization
  • Utiolization of analysis results

Frage 68

Frage
Bussines users needs to be able to understand the results in order to obtain value from analysis and subsequently have de ability to provide feedback from_______________ back to stage __________________
Antworten
  • Data validation and cleaning, data extraction
  • Data analysis, data aggregation & representation
  • Data visualization, Data analysis

Frage 69

Frage
the same results may be presented ina a number a number of different ways.
Antworten
  • false
  • true

Frage 70

Frage
another aspect to keep in mind is that providing a method of drilling down to comparatively simple statistics is crucial, in order for users to understand how to statistics were generated
Antworten
  • true
  • false

Frage 71

Frage
support businessdecission-making, there may be further opportunieties to utilize the analysis results
Antworten
  • Utilization of analysis results
  • Data visualization
  • Data analysis

Frage 72

Frage
The utilization os analysis results is dedicated to determining how and where processed analysis data can be further leveraged
Antworten
  • Utilization of analysis results
  • Data visualization
  • Data analysis

Frage 73

Frage
"models" that encapsulated new insights and understandings about the nature of the patterns and realationships that exist within data that was just analyzed
Antworten
  • utilization of analysis results
  • Data analysis
  • Data validation &cleansing

Frage 74

Frage
A "model" may look like a
Antworten
  • mathematical equation or a set of rules
  • structred database
  • the differents datasets

Frage 75

Frage
Models can be used to improved bussines process logic
Antworten
  • new dataset
  • form the basis of a new system or software program
  • application system logic
  • new bussines case

Frage 76

Frage
the data analysis results may be automatically or manually fed directly into enterprise systems to enhace and optimize their behavior and performance
Antworten
  • input for enterprise systems
  • Bussines process optimization
  • Alerts

Frage 77

Frage
The identiffied patterns correlations and anomalies discovered during the data analysis are used to refine business process
Antworten
  • input for enterprise input
  • alerts
  • Bussines process optimization

Frage 78

Frage
Data analysis results can be used as input for existing events that requires them to take corrective action
Antworten
  • input for enterprise input
  • business process optimization
  • alerts

Frage 79

Frage
Big data nalysis concepts
Antworten
  • statical
  • aggregation
  • visual
  • machine learning
  • Semantic
  • Topic mapping
  • feelings

Frage 80

Frage
statistical analysis
Antworten
  • A/B Testing
  • heat maps
  • correlation
  • Regression
  • filtering

Frage 81

Frage
visual Analysis
Antworten
  • heat maps
  • outlier detection
  • time series analysis
  • Spatial Data Analysis
  • Network analysis

Frage 82

Frage
machine learning
Antworten
  • correlation
  • clasification
  • clustering
  • outlier detection
  • filtering
  • regression

Frage 83

Frage
semantic analysis
Antworten
  • classification
  • network analysis
  • Natural language processing
  • text analytics
  • sentiment analysis

Frage 84

Frage
use statistical methods based on mathematical formulas as means for analizing data
Antworten
  • visual analysis
  • statistical analysis
  • machine learning

Frage 85

Frage
it can also be used to infer patterns ans relationships within the dataset, such as regression and correlation
Antworten
  • statistical analysis
  • semantic analysis
  • analysis topic mapping

Frage 86

Frage
also know as split or bucket testing, compares two versions of an element to determine wich version is superior based on a predefined metric
Antworten
  • correlation
  • A/B testing
  • regression

Frage 87

Frage
A/B testing: the current version of the element is called the ______________ version, whereas the modified version is called the ____________
Antworten
  • official, non official
  • control,reatment
  • principal, copy

Frage 88

Frage
both version, are subjected to an experiment simultaneously. The observationsare recorded to determine wich version is more sccessful
Antworten
  • correlation
  • Regression
  • A/B testing

Frage 89

Frage
Athough ________________________can be implemented in almost domain, it is most often used in marketing
Antworten
  • A/B Testing
  • Regression
  • Correlation

Frage 90

Frage
Generally, the objective is to gauge human behavior with the goal of increasing sales (as per the example below)
Antworten
  • Regression
  • A/B testing
  • Correlation

Frage 91

Frage
is the new version of a drug better than the old one?
Antworten
  • correlation
  • Regression
  • A/B testin

Frage 92

Frage
is an analysis tecnique used to determine whether two variables are related to each other
Antworten
  • Regression
  • Correlation
  • A/B testing

Frage 93

Frage
an example of a relationship between two variables: The value of variable A increases whenever the value of variable B increases
Antworten
  • Regression
  • A/B testing
  • Correlation

Frage 94

Frage
Helps to develop an understanding of a dataset and find relationships that can assist in explaining a phenomenon
Antworten
  • Correlation
  • Regression
  • A/B testing

Frage 95

Frage
commonly used for data mining where the identification between variables in a dataset leads to the discovery of patterns ans anomalies
Antworten
  • regression
  • correlation
  • A/B testing

Frage 96

Frage
When two variables are considered to be correlated they are considered to be aligned based on a linear relationship
Antworten
  • false
  • true

Frage 97

Frage
This mean that when one variable changes, the other variable also changes proportionally and constantly
Antworten
  • A/B testing
  • regression
  • correlation

Frage 98

Frage
______________________ is expresed a a decimal number between -1 to 1, which is know as the correlation coeficient
Antworten
  • Correlation
  • Regression
  • A/B testing

Frage 99

Frage
Correlation +1
Antworten
  • Suggest that there is a strong positive relationship between the two variables
  • suggests that there is no relationship at between two variables
  • Suggest that there is a strong negative relationship between the two variables (hipotesis)

Frage 100

Frage
0 Correlation
Antworten
  • Suggest that there is a strong positive relationship between the two variables
  • suggests that there is no relationship at between two variables
  • Suggest that there is a strong negative relationship between the two variables (hipotesis)

Frage 101

Frage
-1 Correlation
Antworten
  • Suggest that there is a strong negative relationship between the two variables (hipotesis)
  • suggests that there is no relationship at between two variables
  • Suggest that there is a strong positive relationship between the two variables

Frage 102

Frage
sample: "Do students who perform well at elementary school perform equally well at high school"
Antworten
  • regression
  • Correlation
  • A/B testin

Frage 103

Frage
explores how a dependent variable is related to an independent variable within a dataset
Antworten
  • Correlation
  • Regression
  • A/B Testing

Frage 104

Frage
Helpss determine how the value od dependent variable changes in relation to changes in the value of the independent varible
Antworten
  • Correlation
  • Regression
  • A/B testing

Frage 105

Frage
what the analysts discover is that 15% of additional stock in required for enery 5-degree increase in temperature
Antworten
  • regression
  • correlation
  • A/b testing

Frage 106

Frage
more than one independent variable can be tested at the same time
Antworten
  • A/B testing
  • Regression
  • correlation

Frage 107

Frage
in such cases only one independent variable may change. The others are kept constants
Antworten
  • A/B testing
  • Correlation
  • Regression

Frage 108

Frage
can help enable a better understanding of what a phenomenin is and why it ocurred
Antworten
  • Correlation
  • Regression
  • A/B testing

Frage 109

Frage
represents a constant rate of change
Antworten
  • linear regression
  • Non-linear regression

Frage 110

Frage
Represents the variable rate of change
Antworten
  • linear regression
  • non-linear regression

Frage 111

Frage
what will be the grades of a student studying at a high school based on her primary school grades
Antworten
  • correlation
  • regression
  • A/B testing

Frage 112

Frage
_________________does not imply a causation. The change in the value of one variable may not be responsible for the change in the value of the second variable. although both may change at the same rate
Antworten
  • A/B testing
  • correlation
  • Regression

Frage 113

Frage
assumes that both variables are independent
Antworten
  • Regression
  • correlation
  • A/B testing

Frage 114

Frage
Deals with already identified dependent and independent variables
Antworten
  • Correlation
  • Regression
  • A/B Testing

Frage 115

Frage
_________________ can be applied to further explore the relationship and predict the values of the dependent variable, based on the know values of the independent variable
Antworten
  • correlation
  • Regression
  • A/B testing

Frage 116

Frage
is a form of data analysis that involves the graphic representation of data to enable or enhace its visual perception
Antworten
  • statistical analysis
  • visual analysis
  • semantic analysis

Frage 117

Frage
develop a deeper understanding of the data being analyzed. Specifically, it helps identify and highlight hidden patterns, correlations and anomalies.
Antworten
  • statistical analysis
  • visual analysis
  • semantic analysis

Frage 118

Frage
visual analysis
Antworten
  • Heat maps
  • time series analysis
  • outlier detectition
  • network analysis
  • spatial data analysis

Frage 119

Frage
Are an effective visual analysis technique for expressing patterns, data compositions via part-whole relations and geographic distribution of data
Antworten
  • time series analysis
  • heat maps
  • spatial data analysis

Frage 120

Frage
They also facilitate the identification of areas of interest ans the discovery of extreme (high/low) values wihin a dataset
Antworten
  • Network analysis
  • heat maps
  • spatial data analysis

Frage 121

Frage
___________ itself is a visual, color-coded representation of data values
Antworten
  • network analysis
  • heat-maps
  • spatial data analysis
  • time series analysis

Frage 122

Frage
A _______________ can be in the form of a chart or a map, as shown in the following pages
Antworten
  • heat maps
  • time series analysis
  • network analysis
  • spatial data analysis

Frage 123

Frage
A___________ represents a matrix of values in which each cell is color-coded according to the value
Antworten
  • chart
  • map

Frage 124

Frage
A ___________ represents a geographic measure by wich different regions are color-code according to certain theme
Antworten
  • chart
  • map

Frage 125

Frage
How can i visually identify any patterns related to carbon emission across a large number of cities around the world
Antworten
  • Heat maps
  • time series analysis
  • network analysis
  • spatial data analysis

Frage 126

Frage
____________is the analysis of data that is recorded over periodic intervals of time
Antworten
  • heat maps
  • time series analysis
  • network analysis
  • spatial data analysis

Frage 127

Frage
Helps to uncover patterns within data that are time-dependent. Once identified, the patterns can be axtrapollated for future predictions
Antworten
  • heat maps
  • time series analysis
  • network analysis
  • spatial data analysis

Frage 128

Frage
time series analyses are usually used for forecasting by identifiying long-term trends. seasonal periodic patterns and irregular short-term variations in the dataset
Antworten
  • time series analysis
  • heat map
  • network analysis
  • spatial data analysis

Frage 129

Frage
always includes time as a comparision variable
Antworten
  • network analysis
  • heat maps
  • time series analysis
  • spatial data analysis

Frage 130

Frage
is generally expressed using a line chart, with time plotted on the x-axis and the recorded data values plotted on the y-axis
Antworten
  • time series analysis
  • heat map
  • network analysis
  • spatial data analysis

Frage 131

Frage
how much yield should the farmer expect based on historical yield data
Antworten
  • network analysis
  • spatial data analysis
  • heat maps
  • time series analysis

Frage 132

Frage
is an interconected collection of entities
Antworten
  • heat maps
  • time series analysis
  • network analysis
  • spatial data analysis

Frage 133

Frage
An entity can be a person a group or some other business domain object such as a product
Antworten
  • spatial data analysis
  • heat maps
  • time series analysis
  • network analysis

Frage 134

Frage
some conectios may only be one-way, so that transversal in the reverse direction is nor possible
Antworten
  • true
  • false

Frage 135

Frage
is a techniquethat focuses on analizing relationships between entities within the network
Antworten
  • time series analysis
  • heat maps
  • network analysis
  • spatial Data analysis

Frage 136

Frage
There are specialized variations of network analysis
Antworten
  • Graphs
  • route optimization
  • social network analysis
  • spread predictions

Frage 137

Frage
is used to find the shortest routes between the central warehouse and remote stores in order to minimize the durations of deliveries
Antworten
  • heat map
  • network analysis
  • spatial data analysis
  • time series analysis

Frage 138

Frage
How can identify interaction patterns among a very large number of protein-to-protein interactiona?
Antworten
  • spatial data analysis
  • network analysis
  • heat maps
  • time series analysis

Frage 139

Frage
is focused on analizing location-based data in order to find different geographic relationships and patterns between entities
Antworten
  • network analysis
  • spatial data analysis
  • time series analysis
  • Heat maps

Frage 140

Frage
____________________________ is manipulated through a geographical information system (Gis) that plots spatial data on a map generally using its longitude and latitude coordinates
Antworten
  • Spatial data
  • structured data
  • unstructured data

Frage 141

Frage
no two stores can be within a distance of 5 kilometers of each other to prevent the stores from competing with each other.
Antworten
  • time series analysis
  • network analysis
  • heat map
  • spatial data analysis

Frage 142

Frage
how far do customers have to commute in order to get to a supermartket?
Antworten
  • spatial data analysis
  • heat maps
  • time series analysis
  • network analysis

Frage 143

Frage
if the human knowledge can be combined with the processing speed of machines, machines will be able to process large amounts of data without requiring much human intervention
Antworten
  • statisctical analysis
  • visual nalysis
  • machine learning
  • semantic analysis

Frage 144

Frage
machine learning
Antworten
  • classification
  • time series analysis
  • clustering
  • outlier detection
  • filtering

Frage 145

Frage
Two fundamental laws that pertain to machine learning
Antworten
  • law of large numbers
  • law commutative
  • Law of dimishing marginal utility

Frage 146

Frage
the law _____________________________states that the confidence with wich predictions can be made increases as the size of data that is being analyzed increases
Antworten
  • law of large numbers
  • law of dimishing marginal utility

Frage 147

Frage
in other words the accuracy and applicability of the patterns and relationshipsthat are found in a large dataset will be higher that of a smaller dataset
Antworten
  • True
  • False

Frage 148

Frage
the greater the amount of data available for analysis, the better we become of making correct decisions
Antworten
  • True
  • False

Frage 149

Frage
in the context of traditional data analysis, ___________________________ states that, starting with a reasonably large sample size, the value obtained from the analysis of additional data decreases as more data is successively added to the original sample
Antworten
  • the law of diminishing marginal utility
  • the law of large number

Frage 150

Frage
The law of dimishing marginal utility does not apply to big data
Antworten
  • True
  • False

Frage 151

Frage
The greater the volume and variety of data that Big Data solutions can process allows for each additional batch of data to carry greater potential of unearthing new patterns and anomalies. Therefore, the value of each additional batch does not diminish value: rather, it provides more value
Antworten
  • True
  • False

Frage 152

Frage
is a supervised learning technique by witch data is classified into relevant, previously learned categories
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering

Frage 153

Frage
Step 1: The system is fed data that is already categorized or labeled, so that it can develop an understanding of different categories
Antworten
  • clustering
  • classification
  • filtering
  • outlier detection

Frage 154

Frage
step 2: The system is fed unknow (but similar) data for classification, based on the understanding it developed
Antworten
  • classification
  • filtering
  • outlier detection
  • clustering

Frage 155

Frage
A common application of this techniques is for the filtering of e-mail spam. Note that classification can be performed for two or more categories
Antworten
  • filtering
  • clustering
  • classification
  • outlier detection

Frage 156

Frage
Based on old data, a training dataset is compiled that contains tagged examples of customers that have or not previously defaulted
Antworten
  • clustering
  • filtering
  • classification
  • outlier detection

Frage 157

Frage
Does a fingerprint belong to a suspect based on a record of this previous fingerprints
Antworten
  • outlier detection
  • clustering
  • classification
  • filtering

Frage 158

Frage
Is an unsupervised learning technique by wich data is divided into different groups so that the data in each group has similar properties
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering

Frage 159

Frage
There is no prior learning of categories required: instead categories are implicity generated based on the data groupings
Antworten
  • outlier detection
  • clustering
  • filtering
  • classification

Frage 160

Frage
Is generally used in data minig to get an understanding of properties of a given dataset. Afterdeveloping this understanding, classificatioin can be used to make better predictions about similar, but new or unseen data
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering

Frage 161

Frage
In a bank each group is the introduced to one or more financial products most suitable to the characteristics of the overall profile of the group
Antworten
  • clustering
  • filtering
  • outlier detection
  • classification

Frage 162

Frage
How many different categories of elements are there in the periodic table
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering

Frage 163

Frage
Detection is the process of finding data that is significantly different from or inconsistent with the rest of the data within a given dataset
Antworten
  • filtering
  • calssification
  • clustering
  • outlier detection

Frage 164

Frage
this machine learning tecnique is used to identify anomalies, abnormalities and deviations that can be opportunities or risks
Antworten
  • outlier detection
  • classification
  • clustering
  • filtering

Frage 165

Frage
it can be bsaed on either supervised or unsupervised learning
Antworten
  • clustering
  • outlier detection
  • classification
  • filtering

Frage 166

Frage
include fraud detection, medical diagnosis, network data analysis and sensor data analysis
Antworten
  • filtering
  • outlier detection
  • classification
  • clustering

Frage 167

Frage
In order ti find if a transaction is likely to be fraudulent or not, the bank´s IT team builds a sustem emplying ____________________ technique that is based on supervised learning
Antworten
  • classificaction
  • clustering
  • outlier detection
  • filtering

Frage 168

Frage
are there any wrongly identified fruits and vegetables in the training dataset used for classification task
Antworten
  • classification
  • outlier detection
  • clustering
  • filtering

Frage 169

Frage
is the automated process of finding relevant items from a pool of items
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering

Frage 170

Frage
items can be filtered either based on a users own behavior or by matching the behavior of multiple users
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering

Frage 171

Frage
_________________ is generally applied viat the following two approaches
Antworten
  • collaborative filtering
  • user behavior
  • content-based filtering

Frage 172

Frage
items can be filtered either based on a users own behavior or by matching the behavior of multiple users
Antworten
  • clustering
  • filtering
  • classification
  • outlier detection

Frage 173

Frage
A common medium by wich ________________is implemented is via the use of a recomender system
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering

Frage 174

Frage
technique based on the collaboration of users past behavior
Antworten
  • collaborative filtering
  • classification
  • clustering
  • outlier detection
  • content-based filtering

Frage 175

Frage
based on the similarityof users behavior, items are filtered for the target user
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering

Frage 176

Frage
is solely based on the similarity between users behavior, and requires a large amount of user behavior data in order to accurately
Antworten
  • filtering
  • classification
  • clustering
  • outlier detection
  • filtering collaborative

Frage 177

Frage
collaborative filtering is an example of application of law of large numbers
Antworten
  • True
  • False

Frage 178

Frage
technique focused on the similarity between users an items
Antworten
  • classification
  • clustering
  • outlier detection
  • filtering
  • content-based filtering

Frage 179

Frage
A user profile is created based on the users past behavior (likes, ratings, purchase history, etc)
Antworten
  • collaborative filtering
  • content_based filtering

Frage 180

Frage
Contrary to collaborative filtering, content-based filtering is solely dedicated to individual user preferences and does not require data about other users
Antworten
  • True
  • False

Frage 181

Frage
A recomender system predicts user preferences and generate suggestions for the user accordingly
Antworten
  • filtering
  • classification
  • clustering
  • outlier detection

Frage 182

Frage
suggestions commonly pertain to recomending items, such as movies, books, web pages, people etc
Antworten
  • clustering
  • classification
  • filtering
  • outlier dtection

Frage 183

Frage
A recomender system typically uses either collaborative filtering or content-based filtering to generate suggestions
Antworten
  • True
  • False

Frage 184

Frage
recommender system may also be based on a hybrid of both collaborative filtering and content-based filtering to fine-tune the accuracy and effectiveness of generated suggestions
Antworten
  • True
  • False

Frage 185

Frage
Based on matches found between financial product purchased by customers and the properties of similar financial products, the recommnder system automates seggestion for potential financial products that customers may also be interested in
Antworten
  • clustering
  • classification
  • filtering
  • outlier detection

Frage 186

Frage
Wich holiday destinations can be recommended based on the travel history of a holiday makes?
Antworten
  • clustering
  • classification
  • outlier detetcion
  • filtering

Frage 187

Frage
A fragment of text or speech data can carry different meanings in different contexts, whereas a complete sentence may retain its meaning, even if structured in different ways. In order for the machines to extract valuable information, text and speech data needs to be understood by the machines in the same way as humans do. Semantic analysis represents practices for extracting meaningful information from textual and speech data
Antworten
  • statistical analysis
  • semantic analysis
  • visual analysis
  • machinne learning

Frage 188

Frage
types of semantic analysis
Antworten
  • natural language processing
  • human behavior language
  • text analytics
  • sentimental analysis

Frage 189

Frage
Is a computers ability to comprehend human speech and text as naturally understood by humans
Antworten
  • text analytics
  • Natural language Processing
  • sentiment analysis

Frage 190

Frage
This allows computers to perfom a variety of useful task, such as full-text searches
Antworten
  • Text analysis
  • sentiment analysis
  • Natural language processiing (NLP)

Frage 191

Frage
instead of hard-coding the required learning rules, either supervised or unsupervised machine learning is applied to develop the computer undestanding of the natural language
Antworten
  • text analysis
  • natural language processing
  • sentiment analysis

Frage 192

Frage
in general the more learning data the computer has, the more correctly it can decipher human text and speech
Antworten
  • natural language Processing
  • text analytics
  • sentiment analysis

Frage 193

Frage
Natural language processing includes both text and speech recognition
Antworten
  • True
  • False

Frage 194

Frage
For speech recognition the system attempts to comprehend the speech and then performs an action, such as transcribing text
Antworten
  • text analytics
  • sentiment analysis
  • Natural language processing

Frage 195

Frage
How can grammatical mistakes be automaticalle identified?
Antworten
  • text analytics
  • Natural Language processing
  • sentiment analysis

Frage 196

Frage
Unstructured text is generally much more difficult to analyze and search, compared to structured text
Antworten
  • True
  • False

Frage 197

Frage
is the specialized analysis of text through the application of data mining, machine learning and natural language processing techniques to extract value out of unstructured text. Text analytics essentially provides the ability to discover text rather than just search it
Antworten
  • Natural language processing
  • text analytics
  • sentimente analysis

Frage 198

Frage
useful insights from text-based data can be gained by helping business develop an understanding of the information that is contained within a large body of text
Antworten
  • True
  • False

Frage 199

Frage
the basic tenet of text analytics is to turn unstructured text into data that can be searched and analyzed
Antworten
  • Natural language processing
  • text analytics
  • sentiment analysis

Frage 200

Frage
As the amount of digitized documents, e-mail, social media posts and log files increases, businesses have an increasing need to leverage any value that can be extracted from these forms of semi-structured and unstructured data
Antworten
  • text analytics
  • natural language processing
  • sentiment analysis
Zusammenfassung anzeigen Zusammenfassung ausblenden

ähnlicher Inhalt

MÓDULO 2. DE LA INFORMACIÓIN AL CONOCIMIENTO
Drusila Torres Zúñiga
fichas modulo 2
sebastian hoyos
Mapa mental de PARÁMETROS CURRICULARES PARA LA EDUCACIÓN INDÍGENA
rutza
Ejercicos de Práctica
yaniz2003
JUSTICIA INDIGENA
cristian pilco tipan
PRACTICA DOCENTE
camerohdz
MANUAL CONCEPTUAL DE LA METODOLOGIA GENERAL AJUSTADA MGA (2015)
Yorlay Socha
PERDIDA AUDITIVA
fonousta
Planeación por Competencias
haro.gama
Parámetros Curriculares para la Educación Indígena
majocastillo4
actividad 3. parámetros curriculares para la educación indígena
gabb_more