Page 6 - Big Data book
P. 6

Differences between Structured, Semi-structured and Unstructured data:






                       PROPERTIES           STRUCTURED DATA          SEMI-STRUCTURED DATA  UNSTRUCTURED DATA




                                          It   is   based   on                          It   is   based   on
                                          Relational   database    It   is   based   on   character   and
                 Technology               table                    XML/RDF              binary data
                                          Matured   transaction    Transaction   is
                                          and           various    adapted   from       No   transaction
                 Transaction              concurrency              DBMS           not   management and
                 management               technique                matured              no concurrency


                                                                   Versioning   over
                 Version                  Versioning       over    tuples   or   graph   Versioned   as
                 management               tuples,row,tables        is possible          whole
                                                                   It   is   more
                                                                   flexible   than
                                                                   structuded   data
                                                                   but   less   than    it   very   flexible
                                          It   is   sehema         flexible   than      and   there   is
                                          dependent   and   less   unstructured         abbsence         of
                 Flexibility              flexible                 data                 schema


                                                                   It’s   scaling   is
                                          It is very difficult to   simpler   than
                 Scalability              scale DB schema          sstructured data     It is very scalable



                                                                   New technology,
                 Robustness               Very robust              not very spread      —
                                                                   Queries   over
                                          Structured   query       anonymous            Only        textual
                                          allow        complex     nodes          are   query           are
                 Query performance        joining                  possible             possible
   1   2   3   4   5   6