Page 208 - Big Data Analytics for Connected Vehicles and Smart Cities
P. 208
188 Big Data Analytics for Connected Vehicles and Smart Cities Building a Data Lake 189
The overall intent is to deliver early results that enable the communica-
tion of the potential for the data lake. The early results also enable an effective
dialogue with our range of users who might find the data and analytics useful;
this also promotes interest and activity around the use of the data lake for mul-
tiple purposes.
The delivery of business value to smart city exponents is also crucial in the
implementation of the pilot project. The pilot implementation allows the focus
to be moved from data and data management to the delivery of insight and
understanding and the subsequent incorporation of these into new strategies
and new ways of doing business. The definition of business value involves the
identification of the objectives to be addressed by the analytics and the problem
statement that summarizes the need issue problem to be addressed. At this stage
in the process, a catalogue of available data sources is created. This represents
the state of available data and captures the data’s format, structure, and current
location. This would also include the establishment of suitable data-sharing
agreements between smart city transportation partners. It is also beneficial to
develop a preliminary list of analytics that will be conducted on the pilot data
lake along with an identification of the proposed users of the analytics and the
likely benefits that will be attained through the availability of the new insights
and understanding. This will form part of an overall summary.
Conducting the Approach Methodology on the Pilot Project
The entire approach methodology is conducted on a pilot basis, focusing on the
use cases selected for the pilot. This allows experience to be gained in data inges-
tion, preparation, discovery, and exchange. During this stage of the pilot, data
governance and data exchange arrangements would be defined and put into
pilot operation. This would also include significant dialogue with end users on
the use of the pilot use case analytics, on the possibility to extend the data lake.
The conduct of the analytics should address other job-specific needs.
Developing a 12-Month Roadmap for a Full-Capability Data Lake
Based on the results of the pilot and the experiences gained, a 12-month road-
map for a full-capability data lake can be prepared. While the exact contents of
the roadmap will depend on the needs of the city and organizations in question,
it would typically contain the following at a minimum:
• A full set of use cases to be addressed and supported by the full-capabil-
ity data lake;
• Sequencing of the use cases across the 12 months;
• The development of a six-month action plan with required investment
and business justification based on the experiences of the pilot project;