Page 103 - FULL REPORT 30012024
P. 103

4.3.1  Dashboard Module




                                The  development  of  the  dashboard  module  focused  on  visualizing  stroke
                                mortality  data  comprises  several  pivotal  stages,  commencing  with  data

                                cleansing, advancing through the Extract, Transform, Load (ETL) process,

                                and culminating in the creation of the dashboard within Power BI.


                        4.3.1.1 Data Cleaning


                                  This section focuses on data cleaning, a crucial step in preparing datasets

                                  for  analysis.  This  part  details  how  stroke  mortality-related  datasets  are
                                  downloaded,  reviewed,  and  refined  to  ensure  data  quality  for  effective

                                  visualization on the dashboard.


                                  i.      Global Stroke Mortality Dataset


                                          The datasets, encompassing stroke mortality, total annual deaths,

                                          daily  smoking  prevalence,  population  growth,  and  hypertension
                                          prevalence, were meticulously curated through a thorough cleaning

                                          process  to  enhance  their  quality  for  in-depth  analysis.  Initially

                                          downloaded  in  CSV  format,  each  dataset  was  examined  and
                                          organized  in  Microsoft  Excel.  During  this  process,  unnecessary

                                          columns were systematically identified and removed, ensuring that
                                          only  essential  data  was  retained  for  analytical  purposes.  This

                                          careful  pruning  and  organization  led  to  the  merging  of  these

                                          individual datasets into a single, comprehensive dataset, optimized
                                          for analysis.


                                          The  initial  dataset,  named  "stroke-death-rate.csv,"  was  acquired

                                          from OurWorldInData.org. It consists of the following columns:

                                          Entity,  Code,  Year,  Deaths:  Stroke;  Sex:  Both;  Age:  Age-
                                          standardised (Rate). The column 'Deaths - Stroke - Sex: Both - Age:

                                          Age-standardised (Rate)' was changed to 'Total Death' in order to
                                                               86
   98   99   100   101   102   103   104   105   106   107   108