Ch26 Coding Systems

Character Set
1. A table which maps each character with a unique binary number.
2. ASCII
  1. American Standard Code for Information Interchange
  2. 7 bits, so holds 128 characters.
  3. More recently extended ASCII was released, 8 bits, 256 characters.
  4. Only contains basic English characters and punctuation.
  5. Not enough: advent of World Wide Web created need for universal system.
3. Unicode
  1. Was brought in to replace Unicode and include most characters from across the world.
    1. Note: this includes Emojis!
    2. Includes the characters for over 20 countries.
  2. 16 bit so can code for around 6.5 thousand characters.
  3. Constantly updated and maintained by the Unicode Consortium.
Error Checking
1. Parity Bit
  1. Where an additional bit is added to the end of a binary number to show whether there are an even or an odd number of 1 digits.
  2. Unreliable as if an even number of bits flip or the parity bit flips, it doesn't detect the error.
2. Majority Voting
  1. Data is sent multiple (typically 3) times and if there is a discrepancy, the computer goes with the majority.
  2. Very reliable and can correct data without retransmitting, however, uses a lot of bandwidth.
3. Check Digit
  1. An additional digit is added to the end of a number which is derived from the number.
  2. A simple method is by simply adding up all the digits, and adding up the digits on the sum, repeat this until you are left with a single number.
    1. This is unreliable as if the order changes, the check digit remains the same.
  3. A more advanced method would multiply each number by a weight, so as to ensure that order is needed. An example of this is the modulo-11 method.
Graphics
1. Bit Maps
  1. A bit map graphic is a 2D array of pixels.
  2. Each pixel holds a colour value, stored as a binary number, many thousands of these pixels typically make up an image.
  3. A problem with bit maps is that quality degrades when you zoom in, additionally they take up a large amount of space.
  4. Each pixel will have a colour depth, which is the number of bits allocated to represent the colour of a pixel, the higher the colour depth, the more colours that can be expressed.
2. Vector Graphics
  1. A vector graphic is an image generated from a set of instructions and mathematical formulae. This generates the image consisting of geometric shapes, relative to a point of origin.
  2. These are not suitable for photos as they simplify things significantly, however, they are good for diagrams and CAD/CAM images.
  3. A vector graphic typically takes up little space and can be easily scaled up without losing quality.
Audio
1. Audio is converted to digital data by sampling the sound wave many times per second.
  1. The higher the number of samples per second (measured in Hz), the higher the quality of the sound and the truer it is to the original.
    1. The computer then synthesises a wave by extrapolating the data sampled, this is usually indistinguishable from the original for a human ear.
2. Nyquist's Theorem states that to faithfully recreate sound, you must record at least twice the highest frequency.
3. Humans hear 20Hz to 20kHz.
4. The resolution of sound is the number of bits allocated to each sample, hence, the more bits allocated, the more pressure levels each sample can represent.
Compression
1. Lossy compression
  1. Some data is discard, to reduce file size, however, the new file is usually indistinguishable from the original.
  2. An example would be replacing the hundreds of different shades of blue in an image of the sky, with just a few shades. A human would probably not notice the difference.
  3. JPEG files use this.
  4. It can be useful when transmitting images across the internet, as it reduces the bandwidth significantly. This is especially important for people on slow connections.
2. Lossless compression
  1. Compression of data in such a way that the original data can be reacquired in it's totality, after decompressing. i.e. no data is discarded.
  2. Examples would be run length encoding, where for example, if in an image, there are many pixels of the same colour, the computer just stores this as x*"blue" pixels. Rather than storing "blue pixel", "blue pixel"... This doesn't lose accuracy, it just encodes repeated data more simply. Another example is dictionary-based encoding, where commonly occurring strings in a text file are coded for in a simpler way, perhaps by assigning some token. A simple way to understand this could be replacing every "and" with an "&" this takes up 67% less space than writing "and". Again, no loss of accuracy as the original data can be easily reacquired.
    1. Dictionary based encoding can be used on binary data also, if it is considered as a string of 1s and 0s.

Nächster

Ch26 Coding Systems

Beschreibung

Zusammenfassung der Ressource

ähnlicher Inhalt

	Erstellt von Sam Sully vor fast 9 Jahre