Page 80 - Data Science Algorithms in a Week
P. 80
64 Olmer Garcia and Cesar Diaz
Our work is inspired by the German Traffic Signs data set provided by Stallkamp,
Schlipsing, Salmen, & Igel (2011) that contained about 40k training examples and 12k
testing examples. The same problem can be used as a model for Colombia traffic signs.
This is a classification problem which aims to assign the right class to a new image of a
traffic sign by training on the provided pairs of traffic sign images and their labels. The
project can be broken down into five parts: exploratory data analysis, data preprocessing
and data augmentation, the definition of a CNN architecture, training the model, testing
the model and using it with other images.
Data Analysis
The database is a set of images which can be described computationally like a
dictionary with key/value pairs:
The image data set is a 4D array containing raw pixel data of the traffic sign
images (number of examples, width, height, channels).
The label is an array containing the type of the traffic sign (number of samples,
traffic sign id).
Traffic sign id description is a file, which contains the name and some
description for each traffic sign id.
An array containing tuples, (x1, y1, x2, y2) representing coordinates of a
bounding box around the sign in the image.
It is essential to understand the data and how to manipulate it (Figure 5 shows some
randomly selected samples). This process of understanding and observing the data can
generate important conclusions such as:
Single-image, multi-class classification problem.
Forty-three classes of a traffic sign.
Reliable ground-truth data due to semi-automatic annotation (Stallkamp,
Schlipsing, Salmen, & Igel, 2011).
The images contain one traffic sign each
Images are not necessarily squared; they contain a border of 10% around the
traffic sign and is not centered in the image.
Image sizes vary between 15x15 to 250x250 pixels
The classes were found to be highly imbalanced.