Diagnostic accuracy of the Fibrosis-4 index for advanced liver fibrosis in nonalcoholic fatty liver disease with type 2 diabetes: A systematic review and meta-analysis
Article information
Abstract
Background/Aims
The Fibrosis-4 index (FIB-4) is a noninvasive test widely used to rule out advanced liver fibrosis (AF) in patients with nonalcoholic fatty liver disease (NAFLD). However, its diagnostic accuracy in NAFLD patients with type 2 diabetes mellitus (T2DM) is controversial due to the high prevalence of AF in this population.
Methods
Research focusing on the diagnostic accuracy of FIB-4 for liver fibrosis as validated by liver histology in NAFLD patients with T2DM was included, and 12 studies (n=5,624) were finally included in the meta-analysis. Sensitivity, specificity, hierarchical summary receiver operating characteristic (HSROC), positive predictive values (PPVs), and negative predictive values (NPVs) at low cutoffs (1.3–1.67) and high cutoffs (2.67–3.25) for ruling in and out AF were calculated.
Results
At low cutoffs, the meta-analysis revealed a sensitivity of 0.74, specificity of 0.62, and HSROC of 0.75. At high cutoffs, the analysis showed a sensitivity of 0.33, specificity of 0.92, and HSROC of 0.85, suggesting FIB-4 as useful for identifying or excluding AF. In subgroup analyses, high mean age and F3 prevalence were associated with lower sensitivity. The calculated NPV and PPV were 0.82 and 0.49 at low cutoffs, whereas the NPV was 0.28 and the PPV was 0.70 at high cutoffs. There were insufficient estimated NPVs <0.90 at a hypothesized prevalence of AF >30% at an FIB-4 cutoff range of 1.3–1.67.
Conclusions
Collectively, FIB-4 has moderate diagnostic accuracy for identifying or excluding AF in NAFLD patients with T2DM, but more evidence must be accumulated due to the limited number of currently reported studies and their heterogeneity.
Graphical Abstract
INTRODUCTION
Among noninvasive liver fibrosis assessments, the Fibrosis-4 index (FIB-4) is a representative serum marker for the screening liver fibrosis, originally developed using cohorts of patients with chronic hepatitis C and human immunodeficiency virus co-infections [1,2]. It underpins an algorithm that excludes patients unlikely to have advanced liver fibrosis (≥F3) (AF) based on its high sensitivity. Subsequently, other noninvasive tests such as vibration-controlled transient elastography (VCTE) or magnetic resonance elastography (MRE) can be applied. Commonly used threshold values for FIB-4 for the exclusion of AF include 1.3 or 1.45 [3]. In patients with nonalcoholic fatty liver disease (NAFLD), a lower threshold (FIB-4 <1.3) identifies a low-risk group for AF, where follow-up observation is deemed sufficient, while it is recommended that intermediate- (FIB-4 1.3–2.67) or high-risk (FIB-4 >2.67) patients undergo further fibrosis assessments such as VCTE, MRE, or liver biopsy [4].
However, recent studies have indicated that NAFLD patients with type 2 diabetes mellitus (T2DM) have significantly higher rates of fibrosis progression compared to those without T2DM over periods of 4, 8, and 12 years (with cumulative incidence rates of 24% vs. 20%, 60% vs. 50%, and 93% vs. 76%, respectively) [5]. Within the low-risk group (FIB-4 <1.3), T2DM was identified as an independent risk factor for liver-related complications, and a higher prevalence of liver stiffness greater than 8 kPa with VCTE was observed in diabetic patients (77%) compared to non-diabetic patients (40%) [6]. Similarly, even among NAFLD patients with T2DM who were classified as low risk based on FIB-4, a substantial proportion (51.5%) had >8 kPa liver stiffness measured by VCTE, raising concerns about the reduced sensitivity of FIB-4 in this population [7].
Drawing on existing research, a recent proposal in the United States suggested re-evaluating FIB-4 every 1–2 years in NAFLD patients with T2DM and every 2–3 years in those without [8], although research into the diagnostic performance and utility of FIB-4 in this population remains limited. A recent meta-analysis of individual patient data, which collected data from 1,780 biopsy-proven NAFLD patients with T2DM across five studies, found the diagnostic AUC of FIB-4 for AF to be 0.75, with sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) at a threshold of 1.3 of 73%, 62%, 62%, and 72%, respectively [9], reflecting the potential insufficient sensitivity and NPV of FIB-4 alone for ruling out AF patients in this population.
However, that study was not able to compile data from all existing studies due to the nature of individual patient data analysis and only collected data from studies published between 2009–2018. Due to the heterogeneity of the reported diagnostic accuracy of FIB-4 in this population, a systemic review and meta-analysis collecting various studies with larger samples and recently reported data is needed. The present study incorporated 12 studies to investigate the diagnostic accuracy of FIB-4 using data from biopsy-confirmed AF in NAFLD patients with T2DM and showed not only the diagnostic values of FIB-4 at low and high cutoffs, but also analyzed potential factors able to affect the accuracy of this representative noninvasive marker. Importantly, we focused on whether an FIB-4 cutoff of 1.3 might be appropriate to rule out patients with AF in terms of sensitivity and NPV in this subpopulation.
MATERIALS AND METHODS
This meta-analysis was conducted in accordance with a protocol previously registered with PROSPERO (the International Prospective Register of Systematic Reviews [ID no. CRD42024454538]). The execution of this systematic review and meta-analysis was in line with the recommendations outlined in the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) Extension for Diagnostic Test Accuracy.
Inclusion criteria, exclusion criteria, and study outcome
Patients with non-alcoholic fatty liver disease (NAFLD) with T2DM are considered NAFLD with T2DM because NAFLD specifically includes metabolic risk factors like T2DM in its definition, reflecting the same underlying metabolic disturbances present in NAFLD with T2DM [10]. Therefore, studies focusing on the diagnostic accuracy of FIB-4 for liver fibrosis as validated by liver histology in NAFLD patients with T2DM were considered suitable for inclusion in this study. Selection criteria are as follows: (1) diagnosed NAFLD cases and (2) patients who underwent FIB-4 with liver biopsy. Eligible study formats for inclusion were randomized controlled trials, cross-sectional analyses, and cohort studies either prospective or retrospective in nature. We excluded (1) individual case studies, (2) case series involving fewer than five patients, (3) literature reviews, (4) studies of patients with chronic viral infections like hepatitis B or C, (5) studies of patients with excessive alcohol intake, (6) studies of patients with diagnoses of fatty liver based on imaging or serological methods without histological confirmation, (7) studies of patients with type I diabetes, (8) studies not published in English, and (9) potential inclusion of patients with excessive alcohol consumption (defined as <30 g/day in men and <20 g/day in women). The primary outcome in this meta-analysis was the diagnostic accuracy of FIB-4 compared against liver histology in NAFLD patients with T2DM.
Search strategy
We conducted paper searches across multiple databases, including PubMed (MEDLINE), EMBASE, the Cochrane Library, and the Korean Medical Database for studies in English published from January 1, 2008, to October 20th, 2023. Our search used the Population, Index Test, Comparison, and Outcome (PICO) model, with specific keywords detailed in Supplementary Table 1. Search terms included “NAFLD,” “FIB-4,” and “diabetes.” We used a combination of unstructured text and specialized indexing terms, including Medical Subject Headings in PubMed and EMTREE terms in EMBASE, to enhance the accuracy and comprehensiveness of our literature search. Details on our search methodology and the individual results from each database are documented in Supplementary Table 2.
Selection of studies and extraction of data
During the initial phase of study selection, reviewers J.W.H. and M.N.K. independently screened titles and abstracts to extract those considered relevant. Subsequent to their individual review of the full-text articles, any differences in their evaluations were reconciled through consultation with a third reviewer, either D.W.J. or S.U.K. In addition, these reviewers conducted subsequent reviews, encompassing a detailed examination of the entire texts and an assessment of the possibility of bias. Both reviewers independently extracted data on study attributes and outcomes, organizing this information in a uniform manner, and any inconsistencies were addressed and resolved through discussions with D.W.J. and S.U.K.
Quality and risk-of-bias assessment
To assess the potential for bias, we applied the Cochrane risk-of-bias assessment tool; pertinent details are presented in Supplementary Figure 1A. In cases of disagreement, discussions were held with reviewers (J.W.H. and M.N.K.) to reach a consensus. The evaluation of bias risk used the QUADAS-2 tool [11]. Summaries of these bias assessments are available in Supplementary Figure 1B. Publication bias was evaluated using funnel plots and Eager’s test (Supplementary Fig. 2).
Statistical analysis
The diagnostic meta-analysis procedure, focusing on sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), PPV, NPV, and diagnostic odds ratio (DOR), was conducted using RevMan 5 (Cochrane, London, UK), MetaDisc version 1.4 (Clinical BioStatistics Unit-Hospital Ramon y Cajal, Madrid, Spain), and R 4.4.1 (R Core Team). Summary receiver operating characteristic (SROC), hierarchical summary receiver operating characteristic (HSROC) curves, and forest plots were generated, and the meta-regression analysis was performed. To identify the heterogeneity observed across the studies, a random-effects model was applied, and the I2 statistic was calculated.
RESULTS
Study characteristics
A review of titles and abstracts in the chosen databases identified 28 related studies. Of these, 16 were excluded due to a non-relevant patient group (n=3) or index test (n=2), absence of comparative tests (n=2), and a deficit of data (n=9). In the end, 12 studies qualified for the subsequent analysis, which is detailed in Supplementary Figure 3.
These 12 studies observing AF in patients with T2DM and NAFLD as determined by FIB-4 and liver biopsy were included in this systematic review and meta-analysis as shown in Table 1 [9,12-22]. The studies comprised one multinational study, one study from Japan, and the remaining studies from non-Asian regions, with two being prospective cohort studies and the remaining 10 being cross-sectional studies. The risk of bias was as outlined in Supplementary Figure 2. The low and high cutoff values for FIB-4 used across these studies ranged from 1.3–1.67 and 2.67–3.25, respectively. The mean age of subjects ranged from 50–61.9 years. Few Asian patients were included in each study except the Japanese study, although some studies did not report the percentage of Asian patients. The mean body mass index (BMI) ranged from 27.4–37 kg/m2, with the lowest values in the study from Japan [16]. Mean serum levels of hemoglobin A1c (HbA1c) and alanine aminotransferase (ALT) were 6.5–7.5%, and 28–64 U/L, respectively. The prevalence of AF measured by liver biopsy varied widely across the reviewed literature, ranging from 11.4–48.4%. Notably, the study by Pennisi et al. [9] reported a prevalence of AF at 46.2%, and Bertot et al. [12] reported 48.4%, which were higher than the prevalence rates of 11.8–34.0% reported in other studies.
Diagnostic accuracy of FIB-4 for excluding and including AF
To evaluate diagnostic accuracy, a meta-analysis was initially conducted focusing on FIB-4 lower cutoff values of 1.3–1.67 (low cutoffs), and 5,181 patients across 10 studies were included in the analysis (Table 2). There was no significant publication bias among these studies (Supplementary Fig. 2A). The results revealed a sensitivity of 0.74, specificity of 0.62, PLR of 1.98, NLR of 0.42, DOR of 4.69, SROC of 0.75, and HSROC of 0.75 (Fig. 1A and Supplementary Fig. 4). Significantly, the low heterogeneity in sensitivity, NLR, and DOR across studies underscores the reliability of our findings. These findings suggest that FIB-4 at lower cutoff values is useful for diagnosing AF, and that lower cutoff values of FIB-4 can be employed to exclude AF.
Separately, to investigate the diagnostic accuracy at FIB-4 cutoff values of 2.67–3.25 (high cutoffs), additional analysis was conducted using 4,945 patients across nine studies (Table 2). There was no significant publication bias among these studies (Supplementary Fig. 2B). The results showed a sensitivity of 0.33, specificity of 0.92, PLR of 4.51, NLR of 0.67, DOR of 7.49, SROC of 0.82, and HSROC of 0.85 (Supplementary Fig. 5). However, there was significant heterogeneity among included studies. Nevertheless, in patients with T2DM, FIB-4 at higher cutoff values might serve as a diagnostic tool for AF, exhibiting high specificity.
Influences of various factors on the diagnostic accuracy
Supplementary Table 3 shows the results of a meta-regression analysis evaluating the influence of various factors on diagnosis of AF according to FIB-4 cutoff points. Factors analyzed included the percentage of Asian participants, HbA1c level, age, BMI, ALT level, and the prevalence of AF determined by biopsy. The results, indicated by coefficients, confidence intervals, and P-values, revealed mostly not significant associations between these variables and the likelihood of diagnosing AF, with a few exceptions; Asian, for instance, displayed a slight but significant negative relationship with AF diagnosis at the high cutoff point, while age presented a small negative association at the same threshold.
Table 3 shows the subgroup analyses for sensitivity and specificity of low cutoff FIB-4 values. We could not complete a subgroup analysis with regard to race or study region because just one study was performed in Asia. The subgroup analyses showed that high mean age and F3 prevalence were associated with lower sensitivity, whereas low mean HbA1c, age, and BMI were associated with lower specificity, although the findings of this subgroup analysis should be interpreted with caution due to the limited number of studies and the existence of study heterogeneity. Nevertheless, these findings suggest the complexity of diagnosing AF and highlight the potential need for incorporating additional factors to improve accuracy in future studies.
Sensitivity of FIB-4 for screening AF according to the cutoff points
Next, we assessed the diagnostic accuracy of FIB-4 at low cutoff values of 1.3, 1.45, and 1.67 (Table 4). At a 1.3 cutoff, the analysis yielded an SROC of 0.74, sensitivity of 0.74, specificity of 0.60, PLR of 1.93, NLR of 0.43, and DOR of 4.70. For the 1.45 cutoff, the sensitivity was also 0.74, and other values also remained consistent with a minor improvement in specificity to 0.61 and an adjusted PLR of 1.94. The highest threshold of 1.67 led to a slight increase in SROC to 0.75 and specificity to 0.62 along with slight changes in both PLR and NLR, indicating a marginal enhancement in diagnostic precision, although the sensitivity was also not affected. Among reviewed articles, two applied extremely lower cutoffs of FIB-4 (0.91 [21] and 0.97 [9]), which showed respective sensitivities of 0.89 and 0.90. Considering a high prevalence of AF in NAFLD patients with T2DM, these findings suggest that either the currently recommended FIB-4 cutoff of 1.3 or larger cutoffs might have an insufficient sensitivity <0.8. Although lowering the cutoff to <1.0 might maximize sensitivity, the usefulness and diagnostic accuracy of such an approach should be investigated in future studies.
NPV and PPV of FIB-4 at low and high cutoffs
We opted to focus on NPV due to its critical role in excluding AF in patients classified as low-risk by FIB-4. This approach ensures that cases of AF are not missed, improving the precision of the diagnostic process. In the reviewed literature, the values for NPV ranged from 0.73–0.95, demonstrating variability across studies. The calculated NPV from meta-analysis for FIB-4 low cutoffs was 0.82 (Fig. 2A). Among reviewed articles, two applied extremely low cutoffs of FIB-4 (0.91 [21] and 0.97 [9]), which led to NPVs of 0.84 and 0.79, respectively. In addition, the calculated PPV of low cutoffs was 0.49. We also evaluated NPV and PPV at FIB-4 high cutoffs. PPV ranged from 0.44 to 0.91, and calculated PPV from meta-analysis was 0.70 (Fig. 2B). NPV was 0.28. Although there was significant heterogeneity among the studies, which necessitates cautious interpretation, this variability might be attributable to differences in the presence of AF in NAFLD patients with T2DM across regions and races, considering that both NPV and PPV are significantly affected by prevalence.
Estimated NPV of FIB-4 at low cutoffs according to the prevalence of AF in NAFLD patients with T2DM
Among reviewed articles, one applying extremely low cutoffs of FIB-4 yielded an NPV of 0.84 [15], which was not different from the calculated NPV of 0.82 using the FIB-4 cutoff range of 1.3–1.67. Table 5 shows the potential impact of varying prevalence rates of AF on the NPV and PPV derived from low FIB-4 cutoffs and minimum or maximum sensitivity and specificity values from included studies in the meta-analysis. With an increase in AF prevalence, there is a noticeable decline in the NPV for FIB-4 at low cutoffs. This downward trend in NPV was evident across a range of prevalence rates (from 0.01–0.5); for instance, at a prevalence rate of 0.01, the NPV decreases from 1.00 to 0.70 at a prevalence rate of 0.5, reflecting a significant reduction in the ability of FIB-4 to accurately identify individuals without AF as the prevalence increases. In particular, there were insufficient estimated NPVs below 0.90 when we hypothesized a prevalence of AF >30% at FIB-4 values of 1.3–1.67. Conversely, PPVs exhibited an upward trend, starting from 0.02 at a prevalence rate of 0.01 and escalating to 0.66 at a prevalence rate of 0.5, indicating an enhanced capacity to correctly identify individuals with AF as the prevalence rises. This analysis highlights the critical impact of AF prevalence on the NPV of FIB-4 for excluding AF, emphasizing how the NPV decreases with higher prevalence rates.
DISCUSSION
In this systemic review and meta-analysis, we performed an in-depth analysis to assess the diagnostic accuracy of FIB-4 for excluding or including AF in NAFLD patients with T2DM. A recent individual patient data meta-analysis of five studies showed the sensitivity, specificity, NPV, PPV, and AUC of noninvasive tests, including FIB-4, for diagnosing AF [9], but our study focused on the diagnostic accuracy of FIB-4 and potential related factors. Importantly, we further analyzed the potential influence of AF prevalence on the NPV at a low cutoff, which raises the necessity of further investigation to uncover the real prevalence of AF in this population. Of note, we first simulated NPV and PPV according to various prevalence rates of AF in NAFLD patients with T2DM, which could provide useful information for screening AF using FIB-4.
This diagnostic meta-analysis for FIB-4 with a cutoff range of 1.3–1.67 (low cutoffs) demonstrated a sensitivity of 0.74, specificity of 0.62, and a diagnostic AUC of 0.75. Notably, the consistency across studies in terms of sensitivity, NLR, and DOR was sufficient, lending credibility to the results. In a recent meta-analysis involving general NAFLD population, a FIB-4 value between 1.02 and 1.45 to ensure diagnostic accuracy for AF had a sensitivity of 0.69, specificity of 0.54, and a diagnostic AUC of 0.73 [23], which are values comparable to those from our analysis. These findings suggest that FIB-4, when used with lower cutoff values, might not be significantly different between diabetic and general NAFLD populations in terms of sensitivity and specificity.
For the high FIB-4 cutoff range of 2.67–3.25, the meta-analysis revealed a sensitivity of 0.33, specificity of 0.92, and diagnostic AUC of 0.85, although there was significant inconsistency across studies. Recent data on general NAFLD population for AF using FIB-4 values of 2.67–3.25 showed a sensitivity of 0.34–0.39, specificity of 0.95, and diagnostic AUC of 0.74–0.75 [23]. The high variability across synthesized studies calls for cautious interpretation, and it remains uncertain whether additional tests for diagnosing AF in NAFLD patients with T2DM can be conclusively ruled out based on current evidence. However, in this population, the high cutoff of FIB-4 might be a highly specific test for diagnosing AF, comparable to the general NAFLD population. While the specificity of the test was high in high-risk patients with FIB-4 >2.67, current evidence cannot eliminate the need for further noninvasive tests for fibrosis or liver biopsy to confirm fibrosis. Furthermore, complementing FIB-4 with related factors like race or age, as observed in the meta-regression analysis, could enhance the power of FIB-4 for ruling-in AF in this population. Overall, our findings suggest that the diagnostic AUC of FIB-4 is equally effective in NAFLD patients with T2DM as it is in general NAFLD patients.
We found that FIB-4 at low cutoffs had a sensitivity of 0.74 and NPV of 0.82; thus, it should be complemented by the adjustment of cutoff values or supplementation of other related factors of liver fibrosis due to an insufficient accuracy to rule out AF in NAFLD patients with T2DM. Both European and American guidelines have suggested that VCTE or other imaging-based tests be considered as initial tests in the tertiary setting, particularly in high-risk patients like those with T2DM [24,25]. However, due to the cost-effectiveness and the limited availability of VCTE, additional advancement in serum-based noninvasive tests is necessary to achieve a sensitivity or NPV >0.9 for the initial screening of AF in this population. Furthermore, there is controversy regarding the suitability of the commonly used cutoff value of 1.3 for excluding AF in this population. In this regard, one study suggested lowering the FIB-4 cutoff to 1.0 to maintain a sensitivity >90% in patients with T2DM [19]. Other studies have confirmed that lowering the FIB-4 cutoff to 0.91 resulted in a sensitivity of 0.89 [21], while lowering it to 0.97 resulted in a sensitivity of 0.9 [9]. Therefore, further research is necessary to determine the appropriate cutoff values, and the potential lowered cutoff in this population should also be considered in real clinical practice. Moreover, there are few studies to investigate T2DM-specific scores adding various clinical factors to the FIB-4 to minimize undetected population by initial screening for AF [22,26], although they need to be validated in future studies. As previously reported, the combination with other serum tests such as M2BPGi could also enhance diagnostic accuracy [27]. Sequential combinations with imaging-based tests such as VCTE should also be considered due to its better diagnostic accuracy [9]. A recent guideline suggested that FIB-4 can be reassessed periodically every 2–3 years in patients without T2DM, but 1–2 years in patients with T2DM [8]. Therefore, frequent, repeated assessment in this population could minimize the potential non-detection of AF patients.
NPV and PPV are influenced by sensitivity, which underscores the need to consider lower cutoff points to rule out AF in NAFLD patients with T2DM. However, they can be critically influenced by AF prevalence, although the sensitivity and specificity can be maintained regardless. Recent prospective cohort studies in the United States have shown a higher prevalence of AF detected via MRE, around 14% in patients with T2DM [19]. Other cohort studies have confirmed a significantly higher prevalence of AF in patients with T2DM compared to those without (33% vs. 19%) through biopsy [5]. A recent meta-analysis targeting general NAFLD population also showed that the NPV of FIB-4 at a cutoff of 1.3 varies with the prevalence of AF [28]. Thus, it is crucial to recognize that NPV and PPV are highly dependent on disease prevalence which can vary significantly across different populations and studies. As shown in Table 5, high prevalence can inflate PPV, whereas NPV can be overly optimistic in low-prevalence settings, masking the true likelihood of false negatives. Therefore, it is important to contextualize these values within the specific prevalence rates of the study population. Additionally, due to this variability, sensitivity and specificity could be highlighted as more stable measures, providing a clearer indication of a diagnostic performance independent of prevalence. In addition, studies investigating the prevalence of AF in NAFLD patients with T2DM should be performed, and large datasets of various races and regions should be accumulated to determine accurate cutoff points and appropriate strategies to screen AF patients using FIB-4.
Despite including a significant number of studies investigating biopsy-confirmed NAFLD patients with T2DM, there are several limitations in this study. Our findings may not be applicable to east Asian regions with varying rates of NAFLD and T2DM because most included studies were from Western countries analyzing non-Asian participants. In addition, selection bias may have arisen from the clinicians’ discretionary decisions to perform liver biopsies, and the true prevalence of AF may be affected by this factor, which is important for the NPV of FIB-4. Nevertheless, we incorporated current evidence and calculated the overall diagnostic accuracy of FIB-4 in this population.
Collectively, FIB-4 for identifying or excluding AF in NAFLD patients with T2DM might offer moderate diagnostic accuracy, although a limited number and heterogeneity of current published studies were observed. Concerns remain about potential low sensitivity and NPV due to the higher prevalence of AF in this group, and there is a pressing need for more research on the ideal cutoffs. Additionally, investigating the prevalence of AF across ethnicities and regions is mandatory for assessing the effectiveness of initial FIB-4 screening approaches.
Notes
Authors’ contribution
JW Han, JH Yu, HA Lee, and DW Jun were responsible for the concept and design of the study, the data acquisition, analysis and interpretation of the data, and manuscript drafting. M Choi helped with the statistical analysis and data interpretation. MN Kim, YE Chon, HY Kim, JH An, YJ Jin, and SU Kim helped with the data interpretation.
Conflicts of Interest
The authors have no conflict of interest to declare.
Acknowledgements
The authors thank the Clinical Practice Guideline Committee for Noninvasive Tests (NIT) to Assess Liver Fibrosis in Chronic Liver Disease of the Korean Association for the Study of the Liver (KASL) for providing the opportunity to conduct this research.
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2022R1I1A1A01063636 to J.W.H.). This research was also supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (HI23C1489 to J.W.H.).
SUPPLEMENTAL MATERIAL
Supplementary material is available at Clinical and Molecular Hepatology website (http://www.e-cmh.org).
Abbreviations
FIB-4
Fibrosis-4 index
AF
advanced liver fibrosis
VCTE
vibration-controlled transient elastography
MRE
magnetic resonance elastography
NAFLD
nonalcoholic fatty liver disease
T2DM
type 2 diabetes mellitus
PPV
positive predictive value
NPV
negative predictive value
NAFLD
non-alcoholic fatty liver disease
PLR
positive likelihood ratio
NLR
negative likelihood ratio
DOR
diagnostic odds ratio
BMI
body mass index
HbA1c
hemoglobin A1c
ALT
alanine aminotransferase
References
Article information Continued
Notes
Study Highlights
• This study is the first to provide a comprehensive analysis of the diagnostic accuracy of FIB-4 specifically in NAFLD patients with T2DM, focusing on its effectiveness at varying cutoff points. The meta-analysis highlights the potential of FIB-4, particularly at low cutoffs, for excluding advanced fibrosis with moderate sensitivity and specificity. Importantly, the study emphasizes the need to consider AF prevalence when interpreting negative predictive values and explores the potential of adjusting cutoff values for improved diagnostic accuracy. The findings contribute valuable insights for optimizing noninvasive screening strategies in this high-risk population.