Page 69 - Annual report 2021-22
P. 69

Annual Report 2021-22 |


               discovery. A unified BC resource with ease of navigation, analysis, and visualization of the results, all
               at one place will help researchers to expedite the analyses for better insights. But data acquisition,
               harmonization, storage, and visualization can be challenging and require computational automation
               with  domain  expertise.  For  instance,  fetching  bulk  and  single-cell  datasets  from  Gene  Expression
               Omnibus  (GEO)  resource,  harmonizing,  visualization  along  with  metadata  require  advanced  data
               analytical  approaches.  They  are  developing  semi-automated  pipelines  to  merge  gene  expression    52
               series matrices with available metadata to be stored in the backend structured query language-based
               system. The respective gene count matrix will be fed into the database which will be constructed and
               managed using supporting modalities viz. PHP, Apache, SQL, and R Shiny applications. Visualization is
               planned  in  the  form  of  plots  and  graphs  generated  as  a  result  of  applied  statistical  tests  with
               significance level. Based on the data collated in this resource, they will be able to perform supervised,
               semi-supervised and unsupervised machine learning approaches for the identification and in silico
               validation of diagnostic, predictive and prognostic biomarkers, wherever sufficient data points will be
               available.  Furthermore,  they  are  working  on  identification  of  the  co-expression  based  conserved
               modules  in  independent  datasets  with  special  focus  on  immune  infiltration  and  tumor
               microenvironment  to  discover  immune  hot  and  cold  regions  in  patient-specific  manner  to  aid  in
               precision therapy. In summary, the group is working on development of the breast cancer atlas where
               user  can  search,  browse,  and  analyze  the  reported/novel  potential  multi-omics  biomarkers.  This
               resource will comprise of bulk and single-cell datasets encompassing genomics, transcriptomics, and
               proteomics data of the samples which will not only be helpful in clinical settings for identifying patient-
               specific biomarker using machine/deep learning algorithms but also in generating the data-driven
               hypotheses for patient stratification, drug responsiveness, and in estimating survival groups.























               “Light brings us the news of the universe.” – William Bragg
   64   65   66   67   68   69   70   71   72   73   74