Page 210 - Big Data Analytics for Connected Vehicles and Smart Cities
P. 210

190	  Big	Data	Analytics	for	Connected	Vehicles	and	Smart	Cities	  	  Building a Data Lake	  191


                                          Table 9.1
                               How the Approach Addresses the Challenges
             Data Lake Challenges  How the Approach Addresses the Challenges
             Lack of clear strategy  The approach allows the development of a clear strategy that
                                   can be tested and practiced using a pilot. This strategy will then
                                   be revised and enhanced in the light of lessons learned and
                                   information received during the pilot.
             Existing data scattered and   The adoption of an incremental approach allows early results
             not well understood   while providing the time necessary to bring scattered data
                                   together. Early results will also act as a communication tool to
                                   motivate staff to identify additional data and help to bring it
                                   together.
             Difficulty in turning data into   The delivery of early results illustrates the full process from data
             action                collection to creation of a data lake, demonstration of ability to
                                   turn data into action, and the development of real-life strategies to
                                   serve as models for future analytics.
             Lack of big data skills  The approach enables both public- and private-sector resources to
                                   be combined and transitions to be created from initial project to
                                   full-scale project.
             Insufficient governance and   Data governance and security arrangements are tested during the
             security              pilot project.
             After initial establishment,   Data quality control measures can also be tested and developed
             degradation over time without  during the pilot project, forming a practical platform for preventing
             data quality control  degradation of operation over time.
             Lack of self-service   The ability to share data and analytics in pilot use cases enables
             capabilities and long   the establishment of self-service capabilities and shortened
             development times     development times using agile development approaches.
             Lack of features to motivate   Early delivery of practical results provides the tools necessary to
             and enable smart city and   motivate smart city and transportation exponents on the use of the
             transportation exponents  data lake. This also stimulates exponents to consider their own
                                   use of the data and what further analytics would be required. This
                                   has the added benefit of enabling results-driven data collection
                                   and acquisition.


            ments. This is unlikely to provide the best return on investment for the creation
            of a smart city data lake. In order to realize the new potential and possibilities
            offered by the data lake, it is necessary to implement organizational change
            and to build awareness among end users regarding the potential. One possible
            way to address this challenge is the development of an organizational plan as
            part of a pilot project. The transportation data analytics that emanate from the
            pilot project can also be used to support pilot arrangements for fine-tuning the
            organization. This may also require a cultural change that focuses on the orga-
            nization’s ability to adopt innovative techniques rather than following existing
            processes and procedures. In this respect, data analytics can be used as a bridge
            from the data lake to job functions for end user staff. I have grappled with the
            issue of how to develop organizational arrangements that would be the best fit
            for both technological and commercial layers of the architecture for some time.
   205   206   207   208   209   210   211   212   213   214   215