EDA

Description

Bsc Hons Data Mining Mind Map on EDA, created by Steve Hiscock on 15/12/2013.
Steve Hiscock
Mind Map by Steve Hiscock, updated more than 1 year ago
Steve Hiscock
Created by Steve Hiscock over 11 years ago
32
0
1 2 3 4 5 (0)

Resource summary

EDA
  1. Data Granularity - Levels in the data. ie Time, Years, Months, Weeks, Days, Hours
    1. Consistency - Dates 01/01/2000 or 1/1/00
      1. Corruption and Accuracy - System generated problems / Human errors / Out of date
        1. Data Duplication
          1. Missing Data
            1. SOLUTIONS
              1. capitalisation (transform all)
                1. Combine or concatenations of variables
                  1. Careful use of fomats
                    1. Removals of unwanted characters
                      1. Exclusion
                      2. consistent units
                        1. Add system checks
                          1. Reduce the variable types
                          2. Data Types / Model Roles

                            Attachments:

                            1. Categorise Data
                              1. Discrete Data
                                1. Gender
                                  1. Make of car
                                    1. Number of cars
                                      1. Data that can only take certain values.
                                      2. Continuous Data
                                        1. Data that can take any value (within a range)
                                          1. Bank balances
                                            1. Measurements
                                              1. Dates
                                            2. Data Levels

                                              Attachments:

                                              Show full summary Hide full summary

                                              0 comments

                                              There are no comments, be the first and leave one below:

                                              Similar

                                              Data Warehousing and Mining
                                              i7752068
                                              Insurance Policy Advisor
                                              Sufiah Takeisu
                                              Data Mining Part 1
                                              Kim Graff
                                              Minería de Datos.
                                              Marcos Soledispa
                                              Machine Learning
                                              Alberto Ochoa
                                              Data Mining from Big Data 4V-s
                                              Prohor Leykin
                                              Data Mining Process
                                              Steve Hiscock
                                              Data Mining Tasks
                                              Steve Hiscock
                                              Model Roles
                                              Steve Hiscock
                                              Distribution Types
                                              Steve Hiscock