Introduction

Oesophageal cancer (OC) is the sixth most common cause of cancer-related death worldwide and is a major global health challenge [1]. Neoadjuvant chemotherapy (NACT) is generally used for locally advanced oesophageal squamous cell carcinoma (OSCC) [2]. The response of the primary tumour to NACT is of great prognostic importance [3,4,5], and evaluation by positron emission tomography-computed tomography (PET-CT) has been reported to be useful in developing treatment strategies for OSCC [6, 7]. A relationship has been reported between the standardised uptake value (SUV), a commonly used parameter for semi-quantitative analysis of PET-CT images, and the prognosis and treatment response in OC [8, 9]. However, the SUV is influenced by several factors, including body composition and habitus, length of uptake period, plasma glucose, and partial volume effects. Furthermore, maximum SUV (SUVmax) does not represent the whole tumour [10,11,12].

In contrast, tumour volume indices that take into account metabolic activity, such as metabolic tumour volume (MTV) and total lesion glycolysis (TLG), have been attracting attention as new indices for PET-CT [13]. MTV is measured by contouring margins defined by thresholds, whereas TLG is calculated by multiplying MTV by the mean SUV (SUVmean) [14]. The use of MTV and TLG has been proposed to assess disease burden and tumour invasiveness by quantifying the metabolic volume burden and activity of tumours [15]. Several reports have been published on the relationship between these PET-CT parameters before and after chemoradiotherapy (CRT) and the prognosis of OC [16,17,18,19,20,21]. However, evidence of PET-CT parameters, particularly volumetric ones, before and after NACT being associated with OC patient survival is limited [22, 23].

Therefore, we aimed to investigate the utility of measuring SUVmax, MTV, and TLG values of the primary tumour in addition to the number of positive lymph nodes (LNs) using PET-CT before and after NACT as indicators of treatment efficacy and prognosis in patients with locally advanced OSCC.

Methods

Patient eligibility

This retrospective study included 285 consecutive patients with thoracic OC without distant metastasis who underwent surgery after NACT at Osaka University Hospital from January 2010 to December 2016. Among these patients, 233 had histologically confirmed squamous cell carcinoma and underwent PET-CT before and after NACT. With the availability of data on PET-CT parameters (SUVmax, MTV, and TLG), 226 cases were finally analysed after excluding 7 with SUVmax values <2.5 in the primary tumour before NACT. Patients with cervical or coeliac LN metastasis were eligible for inclusion. All patients were staged according to UICC criteria before and after surgery. Clinical staging before NACT was based on esophagography, endoscopy, and computed tomography (CT) of the neck, chest, and upper abdomen using continuous 5-mm-thick slices. All patients had adequate cardiac, hepatic, renal, and bone marrow reserves and could tolerate both the NACT and surgery. This study was approved by the Human Ethics Review Committee of the Osaka University Graduate School of Medicine (Osaka, Japan, approval number; 08226), and signed consent for participation and publication was obtained from each patient.

Surgical treatment

Our standard surgical procedures comprised subtotal esophagectomy with mediastinal lymphadenectomy via right thoracotomy, upper abdominal lymphadenectomy, gastric tube reconstruction, and anastomosis in the cervical incision [6, 7, 24]. A three-field lymphadenectomy was performed for patients with supraclavicular or recurrent laryngeal nerve lymph node metastases on preoperative staging or intraoperative diagnosis, and patients with a primary tumour located in the upper third of the thoracic oesophagus [25]. The remaining patients underwent a two-field lymphadenectomy.

18F-FDG PET-CT acquisition and analysis

All patients underwent 18F-fluorodeoxyglucose (18F-FDG) PET-CT before and within 2ā€“3 weeks after completion of NACT as described previously [6, 7, 22, 26]. 18F-FDG PET-CT was performed with an integrated scanner (Gemini GXL; Philips, Amsterdam, the Netherlands), and whole-body images, generally from the top of the skull to mid-thigh, were acquired approximately 60ā€‰min after intravenous injection of [18F]-FDG at a dose of 3.7 MBq (0ā€“10ā€‰mCu) per kilogram of body weight. Regions of interest (ROIs) were placed over the areas of the primary tumours with maximum FDG uptake on the baseline scans for semi-quantitative analysis. The SUVmax was calculated using previously reported methods [26], and MTV was defined as the tumour volume with SUVā€‰>2.5 [22]. TLG was defined as the SUVmean multiplied by MTV. Semi-quantitative and volumetric analyses of the primary tumours were performed to measure the PET-CT parameters reported in the present study using the volume viewer software SYNAPSE VINCENTĀ® (Fujifilm Medical, Tokyo, Japan), which can easily delineate a volume of interest (VOI) with an iso-contour threshold method based on the SUV. The software automatically calculates SUVmax, MTV, and TLG within the entire primary tumour when a spherical VOI is drawn to encompass the primary lesion (Supplemental Fig.Ā 1). SUVmax, MTV, and TLG were measured before and after NACT in all patients, and the reduction of each parameter was calculated and included in the analysis. LNs with SUVmax ā‰„2.5 by PET-CT were considered positive. All assessments were performed by one radiologist and more than two surgeons that specialise in OC.

Neoadjuvant chemotherapy

Our hospital adopted the NACT regimen comprising triplet chemotherapy with either 5-fluorouracil (5-FU), cisplatin, and doxorubicin (FAP), or 5-FU, cisplatin, and docetaxel (DCF) as described previously [27,28,29,30]. For the FAP regimen, 700ā€‰mg/m2 5-FU was given by continuous intravenous infusion on days 1ā€“7, along with 70ā€‰mg/m2 cisplatin by intravenous infusion, and 35ā€‰mg/m2 doxorubicin by rapid intravenous infusion on day 1 [31,32,33]. For the DCF regimen, cisplatin was administered at 70ā€‰mg/m2, docetaxel at 70ā€‰mg/m2 by rapid intravenous infusion on day 1, and 5-FU at 700ā€‰mg/m2 by continuous intravenous infusion on days 1ā€“5 [31]. With either regimen, two courses of NACT were usually used at a 3ā€“4-week interval. Other regimens used in the present study included 5-FU plus cisplatin (5-FU, 800ā€‰mg/m2/day, days 1ā€“5; cisplatin, 80ā€‰mg/m2/day, day 1, repeated every 3 weeks) for three patients and nedaplatin plus paclitaxel for one patient [34].

Evaluation of clinical response

All patients were re-staged by CT, endoscopy, and PET-CT to evaluate the clinical response 2 to 3 weeks after the completion of NACT [22, 35]. The response was categorised based on the World Health Organisation response criteria for measurable disease and the Japanese Society for Oesophageal Disease criteria. Histopathological findings were classified according to UICC TNM classifications, and the degree of histopathological tumour regression in the surgical specimens was classified into five categories. The extent of viable residual carcinoma at the primary tumour was assessed semi-quantitatively based on the estimated percentage of viable residual carcinoma about the macroscopically identifiable tumour bed that was evaluated histopathologically. The percentage of viable residual tumour cells within the entire cancerous tissue was assessed as follows: grade 0, no significant response to NACT; grade 1a, more than two-thirds of residual tumour cells; grade 1b, one-third to two-thirds of residual tumour cells; grade 2, less than one-third of residual tumour cells; grade 3, no viable residual tumour cells [36, 37]. The severity of postoperative complications was evaluated according to the Clavienā€“Dindo classification system [38]. Progression-free survival (PFS) was defined as the time from surgery to either the first recurrence or death from any cause. Overall survival (OS) was defined as the time from surgery to death from any cause.

Statistical analysis

We used the Studentā€™s t test to compare the averages of continuous variables between two groups, and the Mannā€“Whitney U-test, chi-squared test, and Fisherā€™s exact test to compare the proportions of categorical variables. Receiver operating characteristic (ROC) curve analysis was applied to identify the best discriminating cut-off values for SUVmax, MTV, and TLG. A multivariate Cox proportional hazard regression model with stepwise comparisons was used to identify independent prognostic markers. Prognostic variables were assessed by the log-rank test, and PFS and OS were analysed by the Kaplanā€“Meier method. Univariate and multivariate analyses of clinico-pathological variables were used to compare the reliability of the PET-CT parameters. Pā€‰<ā€‰0.05 indicated significant differences. All statistical analyses were carried out using JMPĀ®14 (SAS Institute Inc, Cary, NC, USA).

Results

Patient characteristics

The patient characteristics are summarised in Supplemental TableĀ 1. Twenty-three patients had cM1 disease, which was confined to metastasis of the supraclavicular LNs. All 226 patients underwent surgical resection after NACT. The most common NACT regimen was DCF (69.5%), followed by FAP (28.8%). Curative (R0) surgery was performed for the majority of patients (97.3%) while six patients (2.7%) received R1 resection. Three-field lymphadenectomy was performed in 129 cases (57.1%).

PET-CT evaluation of primary tumour and lymph nodes

The median SUVmax, MTV, and TLG values for the primary tumour before NACT were 12.2 (2.8ā€“40.9), 19.2 (0.4ā€“232.7), and 96.3 (1.2ā€“1619.3), respectively, and after NACT were 3.4 (1.5ā€“17.9), 1.6 (0ā€“52.9), and 4.5 (0ā€“345.8), respectively. All three PET-CT parameters were significantly lower after NACT (each Pā€‰<ā€‰0.0001). The number of PET-positive LNs before NACT (pre-NACT PET-N) was 0, 1, and >2 in 93 (41.2%), 49 (21.7%), and 84 (37.2%) cases, respectively, and the number of PET-positive LNs after NACT (post-NACT PET-N) was 0, 1, and >2 in 161 (71.2%), 33 (14.6%), and 32 (14.2%) cases, respectively.

Optimal cut-off values based on survival analysis

We conducted a stepwise analysis to determine the optimal cut-off for each parameter that clearly discriminates between responders and non-responders to NACT based on PFS. We achieved this by evaluating cut-off values for each PET-CT parameter at each 10% reduction from 0% to 90% (TableĀ 1). The cut-off values of 60%, 80%, and 80% for the SUVmax, MTV, and TLG reduction rate, respectively, showed the largest difference in PFS between responders and non-responders and the lowest P values (SUVmax: hazard ratio [HR]ā€‰=ā€‰1.75, Pā€‰=ā€‰0.0026; MTV: HRā€‰=ā€‰1.70, Pā€‰=ā€‰0.0054; TLG: HRā€‰=ā€‰2.04, Pā€‰=ā€‰0.0003). Therefore, we defined 60%, 80%, and 80% as the optimal cut-off values of the reduction in SUVmax, MTV, and TLG, respectively, to separate responders and non-responders to NACT for locally advanced OSCC.

Table 1 Stepwise regression analysis for the identification of optimal each parameter reduction rate cut-off value (nā€‰=ā€‰226).

We performed a ROC analysis of disease recurrence to determine the optimal cut-off value for the reduction of three PET-CT parameters as shown in Fig.Ā 1. The area under the curve (AUC) identified the TLG reduction to be the largest among the parameters. For all parameters, the cut-off values obtained by the stepwise method were the approximate values of those determined by the ROC analysis.

Fig. 1: Receiver operating characteristic (ROC) analysis of postoperative recurrence.
figure 1

The area under the curve (AUC) and cut-off for each PET-CT indicator are shown.

Clinico-pathological parameters associated with PET-CT evaluation

Using the cut-off value for the reduction in TLG determined by the stepwise method, we divided all patients into two groups: TLG responders (TLG reduction rate ā‰„80%) and TLG non-responders (TLG reduction rate <80%). We then compared various clinico-pathological factors as shown in TableĀ 2. We did not find any significant differences between the two groups in terms of age, sex, tumour location, tumour size, differentiation, pre-therapeutic PET-CT value, clinical TNM status, residual tumour, pre- and post-NACT PET-N, or postoperative complication. Compared to the TLG non-responders, TLG responders had a higher proportion of patients receiving the DCF regimen (77.5% vs. 50.0%; Pā€‰<ā€‰0.0001), better histological response (Grade 1b-3; 72.5% vs. 45.5%; Pā€‰=ā€‰0.0001), and less advanced pT stage (pT0-2; 65.0% vs. 30.3%; Pā€‰<ā€‰0.0001; TableĀ 2).

Table 2 Correlation between TLG reduction rate and clinico-pathological parameters.

Prognostic significance of PET-CT evaluation

The median follow-up time was 4.51 (range, 0.22ā€“10.88) years. The TLG responders achieved significantly more favourable prognoses than non-responders (2-year PFS rate: 64.1% vs. 38.5%, Pā€‰=ā€‰0.0001, Fig.Ā 2a; 2-year OS rate: 84.0% vs. 56.8%; Pā€‰<ā€‰0.0001). When classified into three groups according to TLG reduction rate (80ā€“100%, 60ā€“80%, <60%), the 2-year PFS rates were 64.1%, 46.3%, and 29.9%, respectively (Fig.Ā 2b). Univariate analysis of PFS indicated a correlation between PFS and cT, post-NACT SUVmax, pre-NACT MTV, post-NACT MTV, pre-NACT TLG, post-NACT TLG, the reduction rates for each PET-CT parameter, pre- and post-NACT PET-N, pT, pN, and histological grade (TableĀ 3). Among preoperative factors, TLG reduction rate (HRā€‰=ā€‰2.58; 95% CI 1.16ā€“5.73; Pā€‰=ā€‰0.020) and post-NACT PET-N (HRā€‰=ā€‰1.79; 95% CI 1.04ā€“3.08; Pā€‰=ā€‰0.035) were significant independent prognostic factors in a multivariate analysis of PFS (multivariate model #1, TableĀ 3). In addition, the TLG reduction rate (HRā€‰=ā€‰2.51; 95% CI 1.14ā€“5.51; Pā€‰=ā€‰0.022) and pN (HRā€‰=ā€‰1.96; 95% CI 1.24ā€“3.10; Pā€‰=ā€‰0.004) were both significant in the multivariate analysis of all factors for PFS (multivariate model #2, TableĀ 3). Lastly, a Kaplanā€“Meier analysis of PFS for all patients classified into four groups based on TLG reduction and post-NACT PET-N, both of which were independent prognostic factors in the multivariate analysis (model #1), is shown in Fig.Ā 3a. The same analysis of PFS for all patients classified based on TLG reduction and pN, both of which were independent prognostic factors in the multivariate analysis (model #2), is shown in Fig.Ā 3b. The 5-year PFS rates were 60.7% among post-NACT PET-N (0ā€“1)/TLG-responders, 36.7% among post-NACT PET-N (0ā€“1)/TLG-non-responders, 33.3% among post-NACT PET-N (ā‰„2)/TLG-responders, and only 9.1% among post-NACT PET-N (ā‰„2)/TLG-non-responders (Fig.Ā 3a). Furthermore, the 5-year PFS rates were 76.3% among pN(āˆ’)/TLG-responders, 50.0% among pN(āˆ’)/TLG-non-responders, 45.7% among pN(+)/TLG-responders, and 26.3% among pN(+)/TLG-non-responders (Fig.Ā 3b).

Fig. 2: Progression-free survival (PFS) classified by the total lesion glycolysis (TLG) reduction rate in the primary tumour during neoadjuvant chemotherapy.
figure 2

a The TLG responders (TLG reduction rate ā‰„80%) achieved significantly more favourable prognoses than non-responders (TLG reduction rate <80%). Two-year PFS rate: 64.1 vs. 38.5%, Pā€‰=ā€‰0.0001. b When classified into three groups according to TLG reduction rate (80ā€“100%, 60ā€“80%, <60%), 2-year PFS rates were 64.1%, 46.3%, and 29.9%, respectively.

Table 3 Univariate and multivariate survival analysis of progression-free survival.
Fig. 3: Kaplanā€“Meier analysis of progression-free survival (PFS).
figure 3

a For all patients classified into four groups based on the total lesion glycolysis (TLG) reduction rate and number of post-NACT PET-positive lymph nodes (PET-N). b For all patients classified into four groups based on TLG reduction rate and pN.

Discussion

In the ROC analysis of recurrence and multivariate analysis of survival, the TLG reduction in the primary tumour during NACT was the best marker among the PET-CT parameters that we examined. Furthermore, a TLG reduction rate of 80% was the optimal cut-off to clearly discriminate between responders and non-responders to NACT based on the stepwise analysis and its correlation with pathological response. Multivariate survival analysis demonstrated that TLG reduction and post-NACT PET-N are independent prognostic parameters, suggesting the potential utility of a PET-based treatment strategy in advanced OSCC patients undergoing NACT plus surgery.

MTV and TLG have been reported to be better prognostic factors for survival than SUV in several types of cancer [39, 40]. Furthermore, in radiotherapy or preoperative CRT for advanced OC, TLG was recently reported to be superior to SUV and MTV in predicting the prognosis [41, 42]. Consistent with these reports, we have shown that the TLG reduction rate is only an independent prognostic parameter in patients with advanced OSCC who undergo surgery after NACT. In a multivariate analysis of MTV and TLG on PET-CT, Choi et al. previously reported that TLG was a more useful prognostic marker before surgical resection of sarcoma [43]. Other studies have also reported that TLG is the best marker in several types of cancer [44,45,46,47]. The reasons are speculated to be as follows. Though MTV represents only the amount of active metabolic tumour cells, TLG, calculated by multiplying the tumour volume by the SUVmean of the tumour, can provide more detailed information about the pathology than other PET-CT parameters because it reflects both biological features and total tumour volume throughout the body [44,45,46]. However, whether MTV or TLG is more useful in predicting prognosis and determining treatment efficacy is still controversial. As these factors are strongly related, it is difficult to compare which is the better marker, and there is no standardised method for comparing these parameters [15]. Therefore, the results of this study should be validated by another cohort in a future prospective study.

The advantage of the present study is a large number of cases compared to previous, similar PET-CT studies [23]. To the best of our knowledge, no previous report has shown the prognostic value of TLG reduction rate before and after NACT in such a large number of OSCC patients. Furthermore, all patients in this study had squamous cell carcinoma treated with NACT with a triplet regimen followed by surgery, whereas treatment methods were not standardised in many previous studies. Moreover, by directly comparing the correlations between the three PET-CT indices and clinico-pathological indices using the optimal cut-offs obtained by the stepwise analysis, a TLG reduction rate of 80% was the best index. Among preoperative parameters, post-NACT PET-N and the TLG reduction rate were identified as independent prognostic factors in a multivariate analysis of survival. This result suggests that PET-CT evaluation of both the primary tumour and LNs can preoperatively select patients with extremely poor prognosis who may benefit from additional chemotherapy with a different regimen or chemoradiation instead of immediate surgery. Goodman et al. also reported that changing chemotherapy regimens improved prognosis in oesophageal cancer patients with small reductions in SUVmax by PET-CT before and after induction chemotherapy [8]. In other words, early stratification of patients with poor response to treatment by PET-CT would be extremely important. Furthermore, these patients at high risk of recurrence may be good candidates for adjuvant therapy with immune checkpoint inhibitors, especially based on the results of the recent CheckMate577 trial [48]. Treatment strategies for the stratified patients with poor prognosis are an urgent issue for future study. Thus, it is notable that PET-CT during NACT, which is currently the standard treatment for advanced OSCC, may be better utilised to identify patients with poor prognosis at an early stage of multimodal treatment. Although TLG reduction is an indicator focusing only on the primary tumour, it was found to be a more accurate prognostic marker when combined with the post-NACT PET-N or pN stage. As the primary tumour and metastatic LNs behave differently during chemotherapy [7, 25, 35, 37], it would be of interest to evaluate changes in PET-CT parameters in metastatic LNs, combined or compared with the changes in the primary tumour.

This study has some limitations. First, this study is a retrospective investigation at a single institution, though a large cohort of patients was analysed. Second, we used a SUVmax of 2.5 as an absolute threshold for measuring MTV. There are two ways to define MTV, the absolute SUV threshold method and the fixed % SUVmax threshold method, but which method is better has not been decided. No statistical difference has been reported between the two methods [44]. The present results need to be verified in the future in a separate cohort. Third, this study included patients who received FAP and DCF therapy as NACT. Subgroup analysis by NACT may be necessary in the future.

In conclusion, the present study demonstrated that the TLG reduction during NACT, in addition to post-NACT PET-N, is useful for accurately predicting prognosis in advanced OSCC patients undergoing surgery following NACT.