Introduction Liver fibrosis can progress to cirrhosis, liver failure, or hepatocellular carcinoma, which often requires transplantation and burdens healthcare systems around the world. Advances in single-cell RNA sequencing and machine learning have enhanced the understanding of immune responses in many liver diseases particularly alcohol liver cirrhosis (ALC) and non-alcoholic steatohepatitis (NASH). This study aims to identify key biomarkers involved in these conditions and assess their potential as non-invasive diagnostic tools.
Methods Two gene expression profiles GSE136103 and GSE115469 were used to conduct differential gene expression (DEG) analysis. Using the results from DEG analysis, we then applied two machine learning network-based algorithms, master regulator analysis (MRA) and weighted key driver analysis (wKDA), to identify potential biomarker genes for NASH and ALC.
Results A total of 1,435 and 5,074 DEGs were identified for ALC and NASH compared to healthy controls, including 1,077 shared DEGs between the two diseases. The MRA showed HLA-DPA1, HLA-DRB1, IFI44L, ISG15, and CD74 as the potential master regulators of ALC and HLA-DPB1, HLA-DQB1, HLA-DRB5, PFN1, and TMSB4X as the potential master regulators of NASH. In addition, wKDA analysis indicated CD300A, FCGR2A, RGS1, HLA-DMB, and C1QA as the key drivers of ALC and INPP5D, NCKAP1L, RAC2, PTPRC, and TYROBP as key drivers of NASH.
Conclusion This study presented a comprehensive framework for analyzing single-cell RNA-seq data, demonstrating the potential of combining advanced network-based machine-learning techniques with conventional DEG analysis to uncover actionable prognostic markers for ALC and NASH with potential use as target biomarkers in drug development.
Competing Interest StatementThe work described herein represent the views of the authors and not of the Applied Pharmaceutical Innovation or any of its funders.
Funding StatementThis study did not receive any funding
Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The datasets analyzed in the current study are publicly available in the Gene Expression Omnibus (GEO) repository under accession numbers GSE136103 and GSE115469. These can be accessed via the following link: http://www.ncbi.nlm.nih.gov/geo
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data AvailabilityThe datasets analyzed in the current study are publicly available in the Gene Expression Omnibus (GEO) repository under accession numbers GSE136103 and GSE115469. These can be accessed via the following link: http://www.ncbi.nlm.nih.gov/geo
Comments (0)