Page 102 - E1 Integrated Workbook STUDENT 2018
P. 102

Chapter 7





                           Big Data





               7.1   What is Big Data?


                                There are several definitions, for example:

                                    Big Data refers to extremely large collections of data that may
                                     be analysed to reveal patterns, trends and associations.

                                    Data collections so large that conventional methods of storing
                                     and processing the data will not work.



               7.2   Characteristics of Big Data – the 3 Vs



                    1.   Volume                     2.   Variety                     3.   Velocity






                    Organisations                   Financial and non-              Data only useful if
                    hold huge                       financial, internal             turned into
                    volumes of data,                and external,                   information in real-
                    for example                     structured and                  time or quickly.
                    supermarkets via                unstructured.
                    loyalty cards.



               A fourth ‘V’, Veracity is sometimes added, i.e. is the data accurate?


               7.3   Processing Big Data

                    The processing of Big Data is known as Big Data analytics.  For example:

                     –     Hadoop software allows the processing of large data sets by utilising large
                           multiple servers simultaneously.

                     –     Google Analytics tracks many features of website traffic.









               90
   97   98   99   100   101   102   103   104   105   106   107