Page 105 - FULL REPORT 30012024
P. 105

cleaning  and  renaming  process,  while  Figure  4.21  displays  the

                                          dataset after these modifications were applied.



























                                                 Figure 4.20 The death rate dataset before data cleaning.























                                                  Figure 4.21 The death rate dataset after data cleaning.

                                         Next, the dataset on the prevalence of hypertension, initially titled

                                         "hypertension-adults-30-79.csv,"     was      obtained      from
                                         OurWorldInData.org. Prior to the cleaning procedure, it consisted of

                                         columns such as 'Entity', 'Code', 'Year', and a long column labelled
                                         'Indicator: Prevalence of hypertension among individuals aged 30-

                                         79  years,  age-standardised:  Sex:  Both  sexes'.  The  cleaning

                                         procedure entailed renaming the dataset as "cleanhypertension.csv"
                                         and  simplifying  the  column  name  'Indicator:Prevalence  of

                                                               88
   100   101   102   103   104   105   106   107   108   109   110