Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group
  • Applied Spectroscopy
  • Vol. 76,
  • Issue 4,
  • pp. 462-474
  • (2022)

Group and Basis Restricted Non-Negative Matrix Factorization and Random Forest for Molecular Histotype Classification and Raman Biomarker Monitoring in Breast Cancer

Open Access Open Access

Abstract

Raman spectroscopy is a non-invasive optical technique that can be used to investigate biochemical information embedded in cells and tissues exposed to ionizing radiation used in cancer therapy. Raman spectroscopy could potentially be incorporated in personalized radiation treatment design as a tool to monitor radiation response in at the metabolic level. However, tracking biochemical dynamics remains challenging for Raman spectroscopy. Here we developed a novel analytical framework by combining group and basis restricted non-negative matrix factorization and random forest (GBR-NMF-RF). This framework can monitor radiation response profiles in different molecular histotypes and biochemical dynamics in irradiated breast cancer cells. Five subtypes of; human breast cancer (MCF-7, BT-474, MDA-MB-230, and SK-BR-3) and normal cells derived from human breast tissue (MCF10A) which had been exposed to ionizing radiation were tested in this framework. Reference Raman spectra of 20 biochemicals were collected and used as the constrained Raman biomarkers in the GBR-NMF-RF framework. We obtained scores for individual biochemicals corresponding to the contribution of each Raman reference spectrum to each spectrum obtained from the five cell types. A random forest classifier was then fitted to the chemical scores for performing molecular histotype classifications (HER2, PR, ER, Ki67, and cancer versus non-cancer) and assessing the importance of the Raman biochemical basis spectra for each classification test. Overall, the GBR-NMF-RF framework yields classification results with high accuracy (>97%), high sensitivity (>97%), and high specificity (>97%). Variable importance calculated in the random forest model indicated high contributions from glycogen and lipids (cholesterol, phosphatidylserine, and stearic acid) in molecular histotype classifications.

© 2021 The Author(s)

PDF Article
More Like This
Classification for breast cancer diagnosis with Raman spectroscopy

Qingbo Li, Qishuo Gao, and Guangjun Zhang
Biomed. Opt. Express 5(7) 2435-2445 (2014)

Low coherence quantitative phase microscopy with machine learning model and Raman spectroscopy for the study of breast cancer cells and their classification

Vishesh Dubey, Azeem Ahmad, Ankit Butola, Darakhshan Qaiser, Anurag Srivastava, and Dalip Singh Mehta
Appl. Opt. 58(5) A112-A119 (2019)

Rapid identification of breast cancer subtypes using micro-FTIR and machine learning methods

Sajid Farooq, Matheus Del-Valle, Moises Oliveira dos Santos, Sofia Nascimento dos Santos, Emerson Soares Bernardes, and Denise Maria Zezell
Appl. Opt. 62(8) C80-C87 (2023)

Supplementary Material (1)

NameDescription
Supplement 1       sj-pdf-1-asp-10.1177_00037028211035398 - Supplemental material for Group and Basis Restricted Non-Negative Matrix Factorization and Random Forest for Molecular Histotype Classification and Raman Biomarker Monitoring in Breast Cancer

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.


Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.