Page 111 - FULL REPORT 30012024
P. 111

4.3.1.2 ETL Process



                               According to Yulianto (2019), an essential phase in data warehousing is the

                               ETL process, which stands for Extract, Transform, and Load, which involves
                               removing data from several sources, transforming it to fit the needs of the data

                               warehouse, and then loading the data warehouse with the converted data. This
                               ETL process is only being performed for the stroke mortality dataset.



                               i.     Extract

                                      Data can be extracted from the source CSV file, merged_dataset.csv,

                                      during  the  extract  step.  A  stroke  database  was  built,  including  a
                                      collection  named  'strokedashboard'.  After  choosing  the  CSV  file  to

                                      import,  the  MongoDB  Compass  import  interface  is  shown.  This

                                      interface plays a crucial role in setting the data types for each field to
                                      guarantee  data  consistency  throughout  the  import  process.  The

                                      interface seen in Figure 4.30 demonstrates the careful assignment of
                                      data  types  to  each  column.  Specifically,  the  'Country'  field  is

                                      designated  as  a  'String',  the  'Year'  field  as  'Int32',  and  the  different
                                      numerical fields such as total deaths and stroke death rates are assigned

                                      as 'Double'. Precise data type is essential for preserving the integrity of

                                      the data throughout the ETL process.

























                                                Figure 4.30 Data Import Interface in MongoDB Compass.
                                                               94
   106   107   108   109   110   111   112   113   114   115   116