Page 208 - Big Data Analytics for Connected Vehicles and Smart Cities
P. 208

188	  Big	Data	Analytics	for	Connected	Vehicles	and	Smart	Cities	  	  Building a Data Lake	  189


                 The overall intent is to deliver early results that enable the communica-
            tion of the potential for the data lake. The early results also enable an effective
            dialogue with our range of users who might find the data and analytics useful;
            this also promotes interest and activity around the use of the data lake for mul-
            tiple purposes.
                 The delivery of business value to smart city exponents is also crucial in the
            implementation of the pilot project. The pilot implementation allows the focus
            to be moved from data and data management to the delivery of insight and
            understanding and the subsequent incorporation of these into new strategies
            and new ways of doing business. The definition of business value involves the
            identification of the objectives to be addressed by the analytics and the problem
            statement that summarizes the need issue problem to be addressed. At this stage
            in the process, a catalogue of available data sources is created. This represents
            the state of available data and captures the data’s format, structure, and current
            location. This would also include the establishment of suitable data-sharing
            agreements between smart city transportation partners. It is also beneficial to
            develop a preliminary list of analytics that will be conducted on the pilot data
            lake along with an identification of the proposed users of the analytics and the
            likely benefits that will be attained through the availability of the new insights
            and understanding. This will form part of an overall summary.

            Conducting the Approach Methodology on the Pilot Project
            The entire approach methodology is conducted on a pilot basis, focusing on the
            use cases selected for the pilot. This allows experience to be gained in data inges-
            tion, preparation, discovery, and exchange. During this stage of the pilot, data
            governance and data exchange arrangements would be defined and put into
            pilot operation. This would also include significant dialogue with end users on
            the use of the pilot use case analytics, on the possibility to extend the data lake.
            The conduct of the analytics should address other job-specific needs.

            Developing a 12-Month Roadmap for a Full-Capability Data Lake
            Based on the results of the pilot and the experiences gained, a 12-month road-
            map for a full-capability data lake can be prepared. While the exact contents of
            the roadmap will depend on the needs of the city and organizations in question,
            it would typically contain the following at a minimum:


                 • A full set of use cases to be addressed and supported by the full-capabil-
                  ity data lake;
                 • Sequencing of the use cases across the 12 months;
                 • The development of a six-month action plan with required investment
                  and business justification based on the experiences of the pilot project;
   203   204   205   206   207   208   209   210   211   212   213