Bing-Jie Ju, Ming Jin, Yang Tian, Xiang Zhen, De-Xing Kong, Wei-Lin Wang, Sheng Yan
Bing-Jie Ju, Ming Jin, Yang Tian, Xiang Zhen, De-Xing Kong, Wei-Lin Wang, Sheng Yan, Department of Hepatobiliary and Pancreatic Surgery, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310009, Zhejiang Province, China
Bing-Jie Ju, Ming Jin, Yang Tian, Xiang Zhen, Wei-Lin Wang, Sheng Yan, Key Laboratory of Precision Diagnosis and Treatment for Hepatobiliary and Pancreatic Tumor of Zhejiang Province, Hangzhou 310009, Zhejiang Province, China
Bing-Jie Ju, Ming Jin, Yang Tian, Xiang Zhen, Wei-Lin Wang, Sheng Yan, Research Center of Diagnosis and Treatment Technology for Hepatocellular Carcinoma of Zhejiang Province, Hangzhou 310009, Zhejiang Province, China
Bing-Jie Ju, Ming Jin, Yang Tian, Xiang Zhen, Wei-Lin Wang, Sheng Yan, Clinical Medicine Innovation Center of Precision Diagnosis and Treatment for Hepatobiliary and Pancreatic Disease of Zhejiang University, Hangzhou 310009, Zhejiang Province, China
Bing-Jie Ju, Ming Jin, Yang Tian, Xiang Zhen, Wei-Lin Wang, Sheng Yan, Clinical Research Center of Hepatobiliary and Pancreatic Diseases of Zhejiang Province, Hangzhou 310009, Zhejiang Province, China
De-Xing Kong, School of Mathematical Sciences, Zhejiang University, Hangzhou 310027, Zhejiang Province, China
Abstract BACKGROUND Post-hepatectomy liver failure (PHLF) increases morbidity and mortality after liver resection for patients with advanced liver fibrosis and cirrhosis. Preoperative liver stiffness using two-dimensional shear wave elastography (2D-SWE) is widely used to evaluate the degree of fibrosis. However, the 2D-SWE results were not accurate. A durometer measures hardness by quantifying the ability of a material to locally resist the intrusion of hard objects into its surface. However, the durometer score can only be obtained during surgery.AIM To measure correlations among 2D-SWE, palpation by surgeons, and durometermeasured objective liver hardness and to construct a liver hardness regression model.METHODS We enrolled 74 hepatectomy patients with liver hardness in a derivation cohort. Tactile-based liver hardness scores (0-100) were determined through palpation of the liver tissue by surgeons. Additionally, liver hardness was measured using a durometer. Correlation coefficients for durometer-measured hardness and preoperative parameters were calculated. Multiple linear regression models were constructed to select the best predictive durometer scale. Receiver operating characteristic (ROC) curves and univariate and multivariate analyses were used to calculate the best model’s prediction of PHLF and risk factors for PHLF, respectively. A separate validation cohort (n = 162) was used to evaluate the model.RESULTS The stiffness measured using 2D-SWE and palpation scale had good linear correlation with durometer-measured hardness (Pearson rank correlation coefficient 0.704 and 0.729, respectively, P < 0.001). The best model for the durometer scale (hardness scale model) was based on stiffness, hepatitis B virus surface antigen, and albumin level and had an R2 value of 0.580. The area under the ROC for the durometer and hardness scale for PHLF prediction were 0.807 (P = 0.002) and 0.785 (P = 0.005), respectively. The optimal cutoff value of the durometer and hardness scale was 27.38 (sensitivity = 0.900, specificity = 0.660) and 27.87 (sensitivity = 0.700, specificity = 0.787), respectively. Patients with a hardness scale score of > 27.87 were at a significantly higher risk of PHLF with hazard ratios of 7.835 (P = 0.015). The model’s PHLF predictive ability was confirmed in the validation cohort.CONCLUSION Liver stiffness assessed by 2D-SWE and palpation correlated well with durometer hardness values. The multiple linear regression model predicted durometer hardness values and PHLF.
Key Words: Hepatectomy; Liver Hardness; Durometer; Two-dimensional shear wave elastography; Post-hepatectomy liver failure; Liver failure
Post-hepatectomy liver failure (PHLF) represents postoperative dysfunction in the liver’s synthesis, excretory, and detoxification functions. Morbidity and mortality after liver resection increase, particularly in patients with advanced liver fibrosis and cirrhosis[1]. Cirrhosis is not only related to the occurrence of PHLF but also causes a coagulation disorder that increases the risk of increased blood loss and the need for blood transfusion during surgery[2,3]. In China, more than 80% of hepatocellular carcinoma (HCC) cases are related to hepatitis B virus (HBV) infections that can cause liver dysfunction and chronic fibrosis[4]. In preoperative evaluations for hepatectomy, liver function and the degree of liver fibrosis are commonly considered.
Liver fibrosis biomarkers such as serum biomarkers and fibrosis models are used alone or in combination to determine the degree of liver fibrosis[5-9]and predict PHLF[10-12]. Currently, liver stiffness measurements (LSM) using imaging methods, including transient elastography and magnetic resonance elastography (MRE), have been reported as sufficient tools to diagnose liver fibrosis[13-18]and to predict the occurrence of PHLF[19-21]. Two-dimensional shear wave elastography (2D-SWE) reportedly improves fibrosis diagnostic accuracy compared to real-time tissue elastography[22]. However, the predictive accuracy of 2D-SWE for PHLF has not yet been discussed.
In clinical work, we found that the LSM results of some patients were inconsistent with the findings of palpation by surgeons. When we reviewed the stiffness measured by 2D-SWE and fibrosis pathology in hepatectomy patients at our center, we found that a group of patients had high-grade fibrosis but a low level of stiffness. This led to situations in which cases required open surgery after visualization under laparoscopy. For patients with a high degree of stiffness but a low fibrosis grade, open surgery may be avoided. The results of SWE can be influenced by age, inflammation, and obesity[22-24].
Liver hardness measured by the durometer is correlated with the degree of liver fibrosis and is a potential parameter for predicting PHLF[25,26]. However, these studies did not analyze the predictive accuracy of hardness for PHLF. Furthermore, the correlation between liver hardness measured by the durometer and liver stiffness measured by 2D-SWE has not been discussed yet. Thus, the aim of this study was to measure correlations between 2D-SWE and durometer-measured objective liver hardness and to improve 2D-SWE's ability to accurately predict liver hardness by constructing a preoperative liver hardness linear regression model.
We screened 279 patients who were scheduled to undergo liver resection for liver masses or intrahepatic bile duct stones at the Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, China, from March 2019 to December 2019 for inclusion in this study.
Forty-three patients were excluded because of the following criteria: Preoperative condition precluding surgery (n= 30), no preoperative LSM (n= 7), unsuccessful LSM (n= 1), and history of hepatectomy (n= 5). To develop the new hardness scale model, a derivation cohort that included 74 patients with liver hardness, measured by the durometer, between March 2019 and June 2019 was created. The validation cohort included the remaining 162 patients between July 2019 and December 2019. All surgeries were performed by three medical teams. Liver resection of more than and less than three segments was defined as major resection and minor resection, respectively[27]. The study protocol was approved by the institution’s ethics committee of each hospital and conformed to the ethical guidelines of the 1975 Declaration of Helsinki. The requirement for informed consent was waived.
Patients underwent a thorough medical history enquiry, routine preoperative laboratory measurements, and HBV and fibrosis serologic marker testing. A biomarker model associated with liver fibrosis, fibrosis-4 (FIB-4), was calculated as previously described by Sterlinget al[28]: [age (year)] × [AST (U/L)]/[platelet count (109/L)] × [ALT (U/L)1?2] (AST, aspartate aminotransferase; ALT, alanine aminotransferase). Preoperative imaging examinations including ultrasonography, computed tomography, and magnetic resonance were performed to evaluate the tumors. Some patients underwent the indocyanine green retention rate after 15 min test to ensure minimal residual liver volume. The resection range was determined by the size and location of the lesion, and minimum residual liver volume. Depending on the condition of the patients before and during the operation, the patients were transferred to the intensive care unit for treatment, as necessary. According to the criterion of the International Study Group of Liver Surgery[1], PHLF was defined as an increased international normalized ratio and concomitant hyperbilirubinemia on or after postoperative day 5.
Liver stiffness was measured using 2D-SWE (SuperSonic Imagine, Aix-en-Provence, France), operated by sonographers, 3 d before surgery. The measurement, as recommended by the World Federation of Ultrasound in Medicine and Biology guidelines, of the intercostal space on the right lobe of the patient’s liver was recorded[29]. The mean stiffness was used for further statistical analysis. A hand-held durometer (Rex Gauge, Buffalo Grove, IL, United States) was placed perpendicularly to the liver surface to obtain an objective measurement of liver hardness during surgery (Figure 1). The area of liver tissue tested was more than 1 cm thick, and without large blood vessels or tumors[25]. Ten readings were taken at different sites of the liver tissue. The durometer measurements were scored using a 0 to 100 scale in durometer units (DU). Three surgeons who had performed over 50 cases of open hepatectomy each, performed palpation of the same part of each liver, independently, and gave a score on a scale from 1 to 100. A palpation score of 50 represented a normal liver while a higher score indicated a correspondingly harder liver.
The statistical methods used in this study were reviewed by Professor Dexing Kong from the School of Mathematical Sciences, Zhejiang University. All statistical analyses were performed using SPSS software (version 20.0; IBM Corp., Armonk, NY, United States). The chi-square test, Student’sttest, and Mann-Whitney test were used to compare the patients’ clinical features and postoperative outcomes. The medians and quartiles of parameters were used in the analysis. The means of durometric readings and palpation score of each sample were calculated. The Pearson rank correlation coefficientRand the Spearman rank correlation coefficientRwere calculated to assess the independent variables associated with the durometer measurement. Only significant parameters associated with the durometer measurement in the correlation analysis were used for multiple linear regression analysis, and the hardness scale model was determined by mixed stepwise regression. The receiver operating characteristic (ROC) and corresponding area under the ROC (AUROC) curves were used to assess the predictive ability of the hardness scale for PHLF. The optimal cutoff value was set as the value maximizing Youden index, which was defined as sensitivity-specificity+1. Univariate analysis and multivariate analysis were performed by logistic regression to identify predictive factors for PHLF. Odds ratios with 95% confidence intervals derived from logistic regression were calculated.Pvalues were two-tailed and considered statistically significant if less than 0.05.
Overall, 58 males and 16 females were included in the derivation cohort. Seventy of the patients underwent surgery for liver masses (malignant tumorsn= 62; benign massesn= 8). The remaining four patients underwent surgery for intrahepatic bile duct stones. There were 57 patients who had HBV infections defined as hepatitis B surface antigens (HBsAg)-pos/neg +anti-HBc-pos[30]. One death occurred within 1 mo after the operation due to postoperative bleeding.
The validation cohort included 126 males and 36 females. The mean age was 56.30 ± 11.20 (range: 15-85) years. Further, 148 patients underwent surgery for malignant liver tumors, and 14 for benign liver diseases. There were 148 patients who had HBV infections. Patient characteristics of the two cohorts are shown in Table 1.
The characteristics, blood tests, plasma biochemical parameters, HBV tests, and liver fibrosis biomarkers of all patients were determined. The results of the correlation analysis between the patients’ clinical parameters and durometer measurements are shown in Table 2.
The Pearson and Spearman rank correlation coefficients between the durometer measurement and SWE stiffness were 0.704 (P< 0.001) and 0.769 (P< 0.001), respectively. A scatterplot representing the correlation is shown in Figure 2.
Seven preoperative parameters were associated significantly with the durometer measurement. Among these parameters, four (thrombin time [TT], HBV surface antigen, hyaluronic acid [HA] and SWE stiffness) and three (platelet, albumin and cholinesterase) parameters had a positive and negative linear correlation (P< 0.05) with the durometer measurement, respectively. Preoperative FIB-4 as a chronic disease score also had a positive correlation with liver hardness (r= 0.378,P= 0.001). These results showed that coagulation, HBV infection, liver function, and the degree of fibrosis were factors indicating liver hardness and consistent with signs of chronic liver diseases.

Table 1 Patient characteristics from both cohorts
The parameters mentioned above were used to construct the model to determine the liver hardness scale. Multiple linear regression (R2, goodness-of-fit measure) models were developed to explore the multiple linear regression relationships between these parameters and the durometer measurement. The results of parameter estimates were obtained by mixed stepwise regression analysis (Table 3). The best multiple linear regression model for liver hardness scale was:

Table 2 Correlation between preoperative clinical parameters and durometer measurements

Figure 1 Measurement of liver hardness using a durometer.
Hardness scale = 27.172 + 0.414 × SWE + 0.001 × HBsAg - 0.103 × albumin (P< 0.001).
Hardness scale represented the predicted value of durometer measurement. TheR2and adjustedR2values of this model were 0.580 and 0.552, respectively. They were higher than the univariate linear regression model built with SWE: Hardness scale = 20.707 + 0.616 × SWE (P< 0.001). TheR2and adjustedR2values of the second model were 0.527 and 0.519, respectively.

Table 3 Parameter estimates of the best multiple linear regression models for durometer

Figure 2 Correlation between liver stiffness and durometer scale.
Depending on the ROC curve, the durometer measurement and hardness scale could be used to predict PHLF in the derivation cohort. The AUROC of the durometer measurement and the hardness scale were 0.807 (P= 0.002) and 0.785 (P= 0.005), respectively. The optimal cutoff values of the durometer measurement and hardness scale were 27.38 DU (sensitivity = 0.900, specificity = 0.660) and 27.87, respectively. The maximizing Youden index of the hardness scale was 0.487 (sensitivity = 0.700, specificity = 0.787). The ROC curve is shown in Figure 3A. To eliminate the effect of a small remnant liver volume on PHLF after major resection, we evaluated the efficacy of the hardness scale model in predicting the risk of PHLF in patients who underwent minor hepatectomy. The AUROC was 0.780 (P= 0.019).
The prognostic accuracy of the hardness scale model was also evaluated in the validation cohort. The derivation and validation cohorts exhibited similar baseline characteristics (Table 1). The model appeared to significantly predict PHLF in the validation cohort (Figure 3B), with an AUROC of 0.765 (P< 0.001). The AUROC was 0.768 (P= 0.001) in patients who underwent minor hepatectomy.
Similar prognostic accuracy for 2D-SWE was observed between the two cohorts. The AUROC in the derivation and validation cohorts was 0.762 (P= 0.010) and 0.747 (P= 0.001), respectively.

Figure 3 Receiver operating characteristic curve for post-hepatectomy liver failure.
The patients were divided into two groups based on the hardness scale cutoff value: The low scale group comprising 52 patients (hardness scale ≤ 27.87) and the high scale group comprising 22 patients (hardness scale > 27.87). Patient characteristics of the two groups were compared as indicated in Table 4. The differences between the high and low hardness scale groups were noted for age, coagulation, liver synthesis function, and fibrosis. Preoperative platelet counts, levels of albumin, and cholinesterase were significantly lower among the high hardness scale group than among the low hardness scale group. The incidence of PHLF in the high hardness scale group was higher than that in the low scale group (36.4%vs11.5%;P< 0.05). Four patients in the high hardness scale group developed massive pleural effusion and required drainage. The difference between the two groups in postoperative hospital stays indicated that patients with high hardness scale required a longer time to recover.
To identify the predictive factors for PHLF, univariate and multivariate analyses was performed for factors of patients with and without liver insufficiency. The results are shown in Table 5. The hardness scale (> 27.87) and major liver resection were identified as independent predictive factors of PHLF.
A model to transform the hardness scale to the palpation s cale was designed as follows: Predicted value of palpation scale = 2.451 × hardness scale - 0.023 × platelet (P< 0.001). TheR2and adjustedR2values were 0.647 and 0.634, respectively. The ability of the palpation scale to predict PHLF was similar to the durometer measurement (AUROC 0.781vs0.807,P< 0.05) (Figure 3A).

Table 4 Differences of characteristics for high and low hardness scale groups
LSM is a good technique to evaluate intraoperative and postoperative outcomes[31-33]; however, it is not an accurate indicator of liver hardness when used alone because it can be impacted by age, gender, fibrosis, obesity, increased parietal wall thickness and, inflammation[22,23,34]. For patients with a high stiffness score but a soft liver, open surgery may be avoided.
Given the establishment of enhanced recovery after surgery (ERAS)[35]and the motivation to avoid wasting medical resources, this study evaluated liver hardness measurements using a durometer as an objective indicator. The durometer is placed directly on the liver surface, which eliminates interference factors such as obesity and abdominal wall thickness. The positive correlation between durometer readings and liver fibrosis has been shown previously[25,26]. However, the durometer score can only be obtained during surgery. Therefore, we sought to construct a model to predict the durometer score based on 2D-SWE and preoperative laboratory parameters.
Patient selection in this study was not limited to HCC because the resection range was primarily determined by the location of the lesion, liver tolerance, and whether an R0 resection could be performed[36]. The surgical processing of malignant tumors and benign masses was similar despite the fact that these cannot be identified before surgery.
The preoperative parameters that were correlated with the durometer measurement in the present study may be divided into four categories: Coagulation, liver function, HBV infection, and fibrosis. Recent studies have demonstrated that platelets have a positive role in promoting liver regeneration and protecting hepatocytes[37]. A low platelet count caused by liver disease increases the risk of intraoperative bleeding. Operative blood loss has been reported as an independent predictor of both perioperative morbidity and mortality[36,38]. Preoperative albumin and cholinesterase were associated with liver hardness. A decrease in these two parameters indicates impaired liver synthetic function, which is consistent with the symptoms of liver fibrosis[39]. Decreased serum cholinesterase is an excellent biomarker of cirrhosis[40]. The presence of HBsAg is a hallmark of HBV infection[41], and titers of HBsAg are useful for discriminating different chronic hepatitis B phases and stages of chronic liver disease[42]. HA is one of the liver fibrosis serum biomarkers, and its level increases with advanced fibrosis. Hence, the level of HA has been used as an independent predictor of liver-related mortality in patients with liver disease[43]. Hansenet al[5]reported that a combination of multiple biomarkers improved diagnostic accuracy of high grades of liver fibrosis.

Table 5 Risk factors for post-hepatectomy liver failure
To select parameters for a linear regression model, we performed correlation analysis of preoperative parameters and durometer scores. The levels of seven parameters (TT, HBsAg, HA, stiffness, platelet count, albumin, cholinesterase) were significantly associated with the durometer measurement (Table 2). Using the plasma biochemical parameters mentioned above, we developed several possible multiple linear regression models. Model one that included three parameters (albumin, SWE, and HBsAg) had the best linear fit for durometer measurement. TheR2value for model one was 0.580, which was higher than theR2value for the model based on stiffness measured by SWE alone. These results indicated that model one, which included serum parameters and liver stiffness, improved preoperative clinical assessment for liver hardness as opposed to stiffness alone.
Based on the ROC curve, durometer measurements, hardness scale, and palpation scale had a predictive ability for postoperative liver dysfunction. The predictive accuracy of 2D-SWE for all grades of PHLF in the derivation cohort was lower than that of the durometer measurement and palpation scale and the hardness scale in both cohorts. The AUROC of the validation cohort (0.747,P< 0.001) was higher than the results of 1D-SWE of HCC reported by Chonget al[19].
In comparing the characteristics of the hardness scale ≤ 27.87 and > 27.87 groups, age, platelet count, HA, and cholinesterase were significantly different between the two groups. The > 27.87 hardness scale group had a higher incidence of PHLF, which might have prolonged postoperative hospital stays.
Preoperative platelet < 150 × 109U/L, albumin ≤ 42 g/L, ALT ≥ 44 U/L, major resection, and hardness-related parameters (stiffness, > 12 kPa, durometer scale, > 27.38 DU, hardness scale, > 27.87) were identified by univariate analyses as risk factors for PHLF. In the present study, the main preoperative factors that predicted PHLF were the presence of impaired preoperative liver function, chronic liver disease, and major liver resection, which are consistent with the conclusions drawn in previous studies[11,12,19].
Quantifying the hardness of liver tissue may aid surgeons in surgical planning and postoperative management. Liver hardness provides vital information regarding the liver. It indicates difficulty of hemostasis and possible postoperative liver dysfunction. Determining liver hardness before surgery using a hardness scale model may prompt the surgeon to prepare for intraoperative blood transfusion, low central venous pressure during liver parenchymal transection, and preventive administration of liver protective drugs. For cancer patients with a high-grade fibrosis but low hardness scale scores, appropriate expansion of the resection range while ensuring the minimum residual liver volume can be considered to ensure R0 resection. Therefore, liver hardness is important in daily surgical practice.
While the durometer is a useful tool, it remains difficult for surgeons to use durometer values in clinical decision-making. The quantified palpation method is intuitive and easier to apply than the durometer measurement. Palpation by surgeons has been shown to be correlated with hardness obtained by a durometer in the pancreas[44,45]. In the present study, we found that the palpation scale was correlated with the durometer measurement and was a reliable predictive factor for PHLF. Although the palpation scale is not a precise method for liver hardness, it provides surgeons with an intuitive method. The present study has several limitations. This study was restricted to the Chinese population. In China, HBV infection is the major cause of chronic fibrosis and cirrhosis[4]. Thus, the parameter reflecting the grades of HBV infections was included in the hardness scale model. In locations with other major causes of chronic fibrosis beside HBV infections, the parameters correlated with liver hardness may be different. The derivation cohort was derived from patients with durometer-measured-hardness. However, this resulted in a small cohort that did not allow us to allot some of these patients to a verification cohort. Although the predictive accuracy of the hardness scale model for PHLF has been validated, the predictive accuracy for durometer-measured hardness of the new model requires further validation for a larger cohort. In this cohort, the lack of patients with grade C PHLF led to the failure of 2D-SWE’s predictive efficiency for high-grade PHLF. The accuracy of palpation of different surgeons might be related to their surgical expertise. Therefore, the palpation scale should be verified by more surgeons with a range of surgical expertise.
In subsequent studies, additional surgical groups and patients should be included to improve the efficiency and accuracy of the hardness scale model. Moreover, the significance of liver hardness in perioperative management and long-time postoperative outcomes should continue to be explored. Considering that the degree of fibrosis may not be uniform in the liver, efforts to combine durometer readings and imaging LSM to establish a liver stiffness contour system must be useful. The ultimate goal should be to combine 3D printing technology with guide surgical planning and drill surgery procedures, which may in turn improve the accuracy of medicine and ERAS.
In this study, 2D-SWE showed a good correlation with durometer-measured liver hardness. The hardness scale model based on liver stiffness, HBsAg and albumin could predict liver hardness before surgery. Additionally, the model and liver hardness demonstrated PHLF predictive ability.
Preoperative noninvasive measurement of liver stiffness using shear wave elastography (SWE) is widely used to evaluate the degree of fibrosis. The SWE measurement involves applying a time-varying force to the tissue, and liver tissue with increased stiffness is considered harder, indicating severe fibrosis. Some studies reported that SWE predicted high-grade post-hepatectomy liver failure (PHF).
The results of the two-dimensional SWE are not a good representation of the actual hardness of the liver. Predicting liver hardness before surgery can prompt the surgeon to prepare for surgical practice.
This study aimed to construct a preoperative liver hardness model.
Correlation coefficients for durometer-measured hardness and preoperative parameters were calculated. Multiple linear regression models were constructed to select the best predictive durometer scale.
In the present study, we developed a linear regression model to predict liver hardness and found that surgeons’ subjective palpation scores were comparable with durometer measures of liver hardness. The hardness model had a good ability to predict PHF.
Liver stiffness assessed by two-dimensional shear wave elastography correlated well with durometer hardness values. The multiple linear regression model predicted durometer hardness values and post-hepatectomy liver failure.
The ultimate goal should be to combine three-dimensional printing technology to build a model that works in both touch and vision to guide surgical planning and drill surgery procedure with the spirit of accuracy medicine and enhanced recovery after surgery.
World Journal of Gastrointestinal Surgery2021年2期