Page 34 - Big Data Analytics for Connected Vehicles and Smart Cities
P. 34

14	  Big	Data	Analytics	for	Connected	Vehicles	and	Smart	Cities	  	  Questions to Be Addressed	  15


            a future vision along the process that converts data to information, insights,
            and actionable strategies, then perhaps it would begin to recognize data as a
            raw material  that could transform transportation in smart cities. Many believe
            that the main issue is the inability to store large volumes of data because of cost
            constraints. Fortunately, our new perception of the value of data coincides with
            new abilities to store huge volumes of data at relatively low cost. This issue is
            addressed in Chapter 11, which covers cost and benefits estimates.
                 Advances in data science enable the storage and manipulation of data in
            ways that were not thought possible in the relatively near-term past. Thanks to
            Hadoop, Google, and Amazon, some important new abilities to store and ma-
            nipulate data are converging with the new perception of the value of data. There
            are also new possibilities for structuring and restructuring data to optimize its
            retrieval and management.
                 For example, a new concept known as a data lake allows data from numer-
            ous sources to be merged into a single repository. Data lakes—essentially cen-
            tralized data repositories that can ingest data from multiple sources and make
            it accessible across an organization or enterprise—can contain a wide variety of
            structured and unstructured data. In data lakes, the data is clean, contiguous,
            and easily accessible—unlike the contents of data swamps. Thus, data lakes
            enable an enterprise-wide view of data. Just like magnificent brick buildings
            wrought from humble clay, the data lakes and integrated data exchanges of the
            future can be constructed from data as their raw material. Chapter 10 further
            details data lakes.



            2.4  Questions Instead of Answers

            This chapter addresses a critical ingredient of the book. To engage the reader
            and explain why it’s worthwhile to understand big data and data analytics, it is
            important to define and frame the questions that can be addressed by such tools
            and techniques. The chapter also provides a high-level framework that paves the
            way for the discussion of use cases in Chapter 9. The questions to be addressed
            encapsulate the needs, issues, problems, and objectives that transportation pro-
            fessionals encounter. Identifying and defining the questions to be addressed is
            also a first important step in a results-driven approach to the use of big data and
            data analytics. An understanding of these questions will form the basis for the
            definition of use cases and objectives for the analysis work.


            2.5  Overview of the Questions

            The questions contained in this chapter are not designed to be completely ex-
            haustive but rather, to show some examples of the kinds of questions that can be
   29   30   31   32   33   34   35   36   37   38   39