null
US
Iniciar Sesión
Regístrate Gratis
Registro
Hemos detectado que no tienes habilitado Javascript en tu navegador. La naturaleza dinámica de nuestro sitio requiere que Javascript esté habilitado para un funcionamiento adecuado. Por favor lee nuestros
términos y condiciones
para más información.
Siguiente
Copiar y Editar
¡Debes iniciar sesión para completar esta acción!
Regístrate gratis
2061115
DATA PREPROCESSING
Descripción
Mapa Mental sobre DATA PREPROCESSING, creado por sudhibala93 el 18/02/2015.
Mapa Mental por
sudhibala93
, actualizado hace más de 1 año
Más
Menos
Creado por
sudhibala93
hace más de 9 años
253
0
0
Resumen del Recurso
DATA PREPROCESSING
Cleaning
Missing Values
Nota:
Use the attributes mean to fill in the missing value
Use the attribute mean for all sample belonging to the same class as the given tupe
Use the most probable values to fill in the missing values
Data Cleaning
Nota:
Fill in missing values & correct inconsistencies in the data
Ignore the tuple
Nota:
Class label is missing & not effective
Fill in the missing value manuvally
Nota:
time consuming & may not be feasible
Use global constant to fill in the missing value
Nota:
replace values & same constant
Noisy Data
Binning
Nota:
It smooth sorted data value by consulting its neighborhood
smoothing by bin means smoothing by bin medians smoothing by bin boundaries
Clustering
Nota:
Similar values organized into groups & outliers may be detected by clustering
Combained computer & human Inspection
Nota:
Outlier may be identified through a combination of computer and human inspection
Regression
Nota:
Data can be smoothed by fitting the data to function
Linear regression
Multiple linear regression
Inconsistent Data
Nota:
Data inconsistencies may be corrected manually using external reference
Knowledge engineering tools may also be used to detect the violation of known data constrains
Data Reduction
Nota:
It can be applied to obtain a reduced representation of the data set yet closely maintains the integrity of the original data
Dimensionality Reduction
Data Compression
wavelet transforms
Principal components analysis
Numerosity Reduction
Data cube Aggregation
Strategies
Data cube aggregation
Dimension reduction
Nota:
step wise forward selection step wise backward elimination combination of forward selection & backward elimination decision tree induction
Numerosity reduction
Histograms
Clustering
Sampling
Nota:
SRSWOR of size n SRSWR of size n Cluster sample Stratisfied sample
Discretization & hierarchy Generation
For numeric data
Bining
Histogram & Analysis
Cluster Analysis
Entropy based Discretization
Segmantation by Natural Partitioning
For categorical data
Portion of a hierarchy by explicit data grouping
Partial ordering of attributes explicity at the schema level
Set of attributes,but not their partial orderies
Discretization & Concept hierarchy Generation
For Categorical Data
For Numeric Data
Integration & Transformation
Data Integration
Nota:
It can help improve accuracy & speed of the subsequent mining process
Reduce and avoid Redundancies & inconsistencies
Detection and resolution of data value conflicts
Data Transformation
Smoothing
Nota:
Remove the noise from data
Attribute construction
Nota:
new attributes are constructed and added to help the mining process
Aggregation
Nota:
Aggregation operation are applied to the data
Generlization
Nota:
Primitive data are replaced by high-level concept through the use of concept hierarchies
Normalization
Nota:
Attribute data are scaled with in small specified range such as -1.0 to 1.0 or 0.0 to 1.0
Why Data Preprocessing ?
Ease of Mining Process
To Improve the Quality of Data
Mostrar resumen completo
Ocultar resumen completo
¿Quieres crear tus propios
Mapas Mentales
gratis
con GoConqr?
Más información
.
Similar
GED en Español: Todo lo que necesitas saber
Diego Santos
AMÉRICA: PAÍSES~CAPITALES...
Ulises Yo
ESTADO DE FLUJOS DE EFECTIVO
Christian Muñoz
Test: The Passive voice
wendygil_22
EVENTOS EN JAVA
**CR 7**
Ecosystems
ricardico55555
constitucion de una empresa
isabel escobar
FARMACOCINETICA
sofia collazos
=ARTE=...
JL Cadenas
Test de Radicales 1 sencillo
MANUEL LUIS PÉREZ SALAZAR
Sistema óseo
Laura Mon
Explorar la Librería