Background: Artificial intelligence (AI) models are emerging as promising tools to identify predictive features among data coming from health records. Their application in clinical routine is still challenging, due to technical limits and to explainability issues in this specific setting. Response to standard first-line immunotherapy (ICI) in metastatic Non-Small-Cell Lung Cancer (NSCLC) is an interesting population for machine learning (ML), since up to 30% of patients do not benefit. Methods: We retrospectively collected all consecutive patients with PD-L1 ≥ 50 % metastatic NSCLC treated with first-line ICI at our institution between 2017 and 2021. Demographic, laboratory, molecular and clinical data were retrieved manually or automatically according to data sources. Primary aim was to explore feasibility of ML models in clinical routine setting and to detect problems and solutions for everyday implementation. Early progression was used as preliminary endpoint to test our algorithm. ...

Background: Artificial intelligence (AI) models are emerging as promising tools to identify predictive features among data coming from health records. Their application in clinical routine is still challenging, due to technical limits and to explainability issues in this specific setting. Response to standard first-line immunotherapy (ICI) in metastatic Non-Small-Cell Lung Cancer (NSCLC) is an interesting population for machine learning (ML), since up to 30% of patients do not benefit. Methods: We retrospectively collected all consecutive patients with PD-L1 ≥ 50 % metastatic NSCLC treated with first-line ICI at our institution between 2017 and 2021. Demographic, laboratory, molecular and clinical data were retrieved manually or automatically according to data sources. Primary aim was to explore feasibility of ML models in clinical routine setting and to detect problems and solutions for everyday implementation. Early progression was used as preliminary endpoint to test our algorithm. Results: Out of 123 patients, 106 were included, 52/106 (49 %) had disease progression or died within 3 months of start of ICI. Early progression correlated with increased neutrophil percentage (>80 % of white blood cells), neutrophil/lymphocyte ratio (≥8) and lower-range PD-L1 status (<70 %) at baseline, which was consistent with literature. Automated ML (AutoML) models run on our dataset reached precision scores around 80 %, with Voting Ensemble emerging as best performing model, while white-box models (such as Shapley Additive exPlanations) provided better explainability. In all AutoML models, laboratory features were the top selected features, whilst clinical ones needed more pre-processing before gaining relevance, which was consistent with different data extraction (automatic versus manual) and missing data rates. Conclusions: ML models’ application is feasible in clinical practice and can trustworthily predict early progression during first-line ICI for metastatic NSCLC. Solving pre-analytical issues is key for future improvement, focusing on automatic tools for data extraction, collection and explainability.

Exploring machine learning tools in a retrospective case-study of patients with metastatic non-small cell lung cancer treated with first-line immunotherapy: A feasibility single-centre experience / Ogliari, Francesca Rita; Traverso, Alberto; Barbieri, Simone; Montagna, Marco; Chiabrando, Filippo; Versino, Enrico; Bosco, Antonio; Lin, Alessia; Ferrara, Roberto; Oresti, Sara; Damiano, Giuseppe; Viganò, Maria Grazia; Ferrara, Michele; Riva, Silvia Teresa; Nuccio, Antonio; Venanzi, Francesco Maria; Vignale, Davide; Cicala, Giuseppe; Palmisano, Anna; Cascinu, Stefano; Gregorc, Vanesa; Bulotta, Alessandra; Esposito, Antonio; Tacchetti, Carlo; Reni, Michele. - In: LUNG CANCER. - ISSN 0169-5002. - 199:(2025). [10.1016/j.lungcan.2024.108075]

Exploring machine learning tools in a retrospective case-study of patients with metastatic non-small cell lung cancer treated with first-line immunotherapy: A feasibility single-centre experience

Ogliari, Francesca Rita;Montagna, Marco;Chiabrando, Filippo;Ferrara, Roberto;Damiano, Giuseppe;Ferrara, Michele;Nuccio, Antonio;Venanzi, Francesco Maria;Vignale, Davide;Cicala, Giuseppe;Palmisano, Anna;Cascinu, Stefano;Esposito, Antonio;Tacchetti, Carlo;Reni, Michele
2025-01-01

Abstract

Background: Artificial intelligence (AI) models are emerging as promising tools to identify predictive features among data coming from health records. Their application in clinical routine is still challenging, due to technical limits and to explainability issues in this specific setting. Response to standard first-line immunotherapy (ICI) in metastatic Non-Small-Cell Lung Cancer (NSCLC) is an interesting population for machine learning (ML), since up to 30% of patients do not benefit. Methods: We retrospectively collected all consecutive patients with PD-L1 ≥ 50 % metastatic NSCLC treated with first-line ICI at our institution between 2017 and 2021. Demographic, laboratory, molecular and clinical data were retrieved manually or automatically according to data sources. Primary aim was to explore feasibility of ML models in clinical routine setting and to detect problems and solutions for everyday implementation. Early progression was used as preliminary endpoint to test our algorithm. ...
2025
Background: Artificial intelligence (AI) models are emerging as promising tools to identify predictive features among data coming from health records. Their application in clinical routine is still challenging, due to technical limits and to explainability issues in this specific setting. Response to standard first-line immunotherapy (ICI) in metastatic Non-Small-Cell Lung Cancer (NSCLC) is an interesting population for machine learning (ML), since up to 30% of patients do not benefit. Methods: We retrospectively collected all consecutive patients with PD-L1 ≥ 50 % metastatic NSCLC treated with first-line ICI at our institution between 2017 and 2021. Demographic, laboratory, molecular and clinical data were retrieved manually or automatically according to data sources. Primary aim was to explore feasibility of ML models in clinical routine setting and to detect problems and solutions for everyday implementation. Early progression was used as preliminary endpoint to test our algorithm. Results: Out of 123 patients, 106 were included, 52/106 (49 %) had disease progression or died within 3 months of start of ICI. Early progression correlated with increased neutrophil percentage (>80 % of white blood cells), neutrophil/lymphocyte ratio (≥8) and lower-range PD-L1 status (<70 %) at baseline, which was consistent with literature. Automated ML (AutoML) models run on our dataset reached precision scores around 80 %, with Voting Ensemble emerging as best performing model, while white-box models (such as Shapley Additive exPlanations) provided better explainability. In all AutoML models, laboratory features were the top selected features, whilst clinical ones needed more pre-processing before gaining relevance, which was consistent with different data extraction (automatic versus manual) and missing data rates. Conclusions: ML models’ application is feasible in clinical practice and can trustworthily predict early progression during first-line ICI for metastatic NSCLC. Solving pre-analytical issues is key for future improvement, focusing on automatic tools for data extraction, collection and explainability.
Artificial intelligence application; Automated Machine Learning algorithms; Automatic data extraction; Explainable AI; White-box and black-box models pairing;
Artificial intelligence application
Automated Machine Learning algorithms
Automatic data extraction
Explainable AI
White-box and black-box models pairing
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0169500224006093-main.pdf

solo gestori archivio

Tipologia: PDF editoriale (versione pubblicata dall'editore)
Licenza: Copyright dell'editore
Dimensione 3.39 MB
Formato Adobe PDF
3.39 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11768/178976
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact