Genetic mutation and biological pathway prediction based on whole slide images in breast carcinoma using deep learning

Qu, Hui; Zhou, Mu; Yan, Zhennan; Wang, He; Rustgi, Vinod K.; Zhang, Shaoting; Gevaert, Olivier; Metaxas, Dimitris N.

doi:10.1038/s41698-021-00225-9

Download PDF

Article
Open access
Published: 23 September 2021

Genetic mutation and biological pathway prediction based on whole slide images in breast carcinoma using deep learning

npj Precision Oncology volume 5, Article number: 87 (2021) Cite this article

9397 Accesses
33 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Breast carcinoma is the most common cancer among women worldwide that consists of a heterogeneous group of subtype diseases. The whole-slide images (WSIs) can capture the cell-level heterogeneity, and are routinely used for cancer diagnosis by pathologists. However, key driver genetic mutations related to targeted therapies are identified by genomic analysis like high-throughput molecular profiling. In this study, we develop a deep-learning model to predict the genetic mutations and biological pathway activities directly from WSIs. Our study offers unique insights into WSI visual interactions between mutation and its related pathway, enabling a head-to-head comparison to reinforce our major findings. Using the histopathology images from the Genomic Data Commons Database, our model can predict the point mutations of six important genes (AUC 0.68–0.85) and copy number alteration of another six genes (AUC 0.69–0.79). Additionally, the trained models can predict the activities of three out of ten canonical pathways (AUC 0.65–0.79). Next, we visualized the weight maps of tumor tiles in WSI to understand the decision-making process of deep-learning models via a self-attention mechanism. We further validated our models on liver and lung cancers that are related to metastatic breast cancer. Our results provide insights into the association between pathological image features, molecular outcomes, and targeted therapies for breast cancer patients.

Prediction of tumor origin in cancers of unknown primary origin with cytology-based deep learning

Article Open access 16 April 2024

PERCEPTION predicts patient response and resistance to treatment using single-cell transcriptomics of their tumors

Article 18 April 2024

Demographic bias in misdiagnosis by computational pathology models

Article 19 April 2024

Introduction

Breast carcinoma is the most common cancer among women worldwide that consists of a heterogeneous group of diseases with different histological, prognostic, and clinical outcomes¹. Approximately 50% of all women diagnosed with breast cancer can develop metastatic diseases, such as liver and lung cancers². In the past decades, substantial efforts have been made to deepen our understanding of breast cancer risk factors, molecular pathogenesis, and treatment development. Especially, high-throughput molecular profiling reveals that multiple genetic mutations and biological signaling pathways could have a great influence on tumor progression and overall survival³.

Comprehensive genomic analysis has identified key driver genetic mutations that are responsible for therapeutic implication and outcome prediction of breast cancer. The tumor suppressor gene TP53 is found altered in breast carcinoma in ~30% of all cases with prognostic implication⁴. Overexpression of ERBB2 is also an adverse prognostic indicator correlated with decreased survival in breast cancer⁵. Given certain types of mutations, targeted therapies for patient subgroups have been developed. For example, the PI3K inhibitor is designed to be responsive for patients with the PIK3CA mutation, which is a key driver gene associated with oncogenesis and hyperactivity of the PI3K pathway. The identification of driver mutations is essential for targeted therapy and clinical diagnosis of breast malignancies.

Digital whole-slide images (WSI) can potentially offer a computationally effective and efficient means to quantitatively characterize cell-level heterogeneity of cancer specimens. Pathologists routinely use WSIs to identify nuclei features, diagnose cancer status and measure the histopathological grade of cancer tissues. However, there is a lack of research linking WSI with gene mutations and pathway activities for advancing clinical assessment in breast cancer. Preliminary evidence suggests that it is possible to apply deep-learning approaches to automatically predict cancer subtypes in multiple cancers^6,7,8, predict mutations in lung⁶ and liver cancers⁹, classify mesothelioma¹⁰, detect DNA methylation patterns¹¹, estimate human epidermal growth factor receptor 2 status in breast cancer¹², and predict pan-cancer prognosis for patients¹³. However, pan-cancer studies^14,15,16 are unable to provide deep characterization of breast cancer across histopathology, mutation, and pathway activity levels.

In this study, we develop WSI-based deep-learning classifiers for predicting key mutation outcomes and important biological pathway activities in breast cancer. We directly provide slide-level predictions with a self-attention mechanism. This self-attention technique can capture the relationship between patches and empower us to visualize representative tiles during the decision-making process. Our study highlights WSI visual interactions between mutation and its related pathway, enabling a head-to-head comparison to reinforce our major findings. Furthermore, we validate our analysis in a pan-cancer setting on liver and lung cancer cohorts to gain additional insights of mutation prediction across cancers based on the metastatic associations derived from breast cancer.

Results

Datasets

We collected 659 patients with breast invasive carcinoma from The Cancer Genome Atlas (TCGA)¹⁷. Data inclusion criteria for each patient contain: (1) one hematoxylin and eosin (H&E) stained histopathology whole-slide image (WSI), (2) mutational data with the point mutation status of 18 driver genes and copy number alteration (CNA) of 35 genes (see Methods), and (3) omics data with the mRNA expression data and CNA data of all genes. The WSI data were downloaded from Genomic Data Commons data portal (https://portal.gdc.cancer.gov/) and the other molecular data were obtained from the cBioPortal (https://www.cbioportal.org/). The 659 cases in the TCGA-BRCA dataset were randomly partitioned into training, validation and test sets based on 70%, 15%, and 15% ratios, respectively. The characteristics of the breast cancer patients in each set are shown in Table 1. In addition, we collected 350 patients with lung adenocarcinoma from TCGA-LUAD cohort and 316 patients with liver hepatocellular carcinoma from TCGA-LIHC cohort to validate our method following the same data inclusion criteria. Each dataset is randomly split into training and testing sets with 20 and 80% ratios, where the training set is utilized to fine-tune the developed models trained from the breast cancer data. The patients’ characteristics of the lung and liver datasets are shown in Table 2. After WSI tile extraction and tumor tile selection (see Methods), there are 703,804, 140,981, and 167,530 tiles used in the training, validation and testing sets of breast cancer, 130,659 and 465,925 tiles in the training and testing sets of lung cancer, and 116,635 and 570,073 tiles in the training and testing sets of liver cancer.

Table 1 Patient characteristics on the 659 cases from TCGA-BRCA cohort.

Full size table

Table 2 Patient characteristics on the 350 cases from TCGA-LUAD cohort and 316 cases from TCGA-LIHC cohort.

Full size table

Prediction of gene mutation status in breast cancer from pathology images

We trained our models on pathology images to predict significant mutations profiles in breast cancer. More specifically, our deep-learning model extracted mutation-specific feature vectors from tumor tiles and predicted the gene mutation probability of the corresponding patient (see Fig. 1 and Methods). We seek to predict two types of gene mutations including point mutation and CNA. Our model demonstrated high-level performance on predicting the point mutation status of multiple important genes. The AUC scores for point mutation and CNA are shown in Table 3 and Table 4, respectively, and the corresponding ROC curves are shown in Fig. 2. For example, we found that our model is highly predictive on TP53 (AUC = 0.729), which is the most frequently mutated gene in breast cancer with prognostic implication. Our models also showed good results on predicting mutations of RB1 (AUC 0.852), CDH1 (AUC 0.776), NF1 (AUC 0.768), NOTCH2 (AUC 0.740) in breast cancer.

**Fig. 1: The proposed network structure.**

Table 3 AUC (with 95% CI) achieved by the models trained on the point mutation data of breast cancer.

Full size table

Table 4 AUC (with 95% CI) achieved by the models trained on the copy number alteration (CNA) data of breast cancer.

Full size table

**Fig. 2: ROC curves of the top prediction outcomes.**

We also found that our deep-learning classifier predicted well (AUC > 0.65) on the CNA status in breast cancer, including six genes of FGFR1, EIF4EBP1, KAT6A, HEY1, ZNF217, and RAB25. More importantly, the use of the self-attention mechanism makes our deep-learning approach explainable, which enabled us to identify key tiles in the process of model prediction (Fig. 3). For example, we computed each tile’s weight that contributes to the final global feature vector and presented the weight map of a patient for TP53 in Fig. 3b and RB1 in Fig. 4b. The corresponding top 20 weighted tiles are also shown in Figs. 3d and 4d, respectively. Additional results of gene point mutation and CNA predictions can be found in Supplementary Table 1, Supplementary Table 2a, and Supplementary Table 2b.

**Fig. 3: Weight maps of tiles when predicting the point mutation status of TP53 and p53 pathway activity from mRNA expression data in breast cancer.**

**Fig. 4: Weight maps of tiles when predicting the point mutation status of RB1 and cell cycle pathway activity from mRNA expression data in breast cancer.**

Prediction of biological pathway activity from histopathology images

We developed deep-learning models to predict the activities of ten canonical biological pathways¹⁸ identified in breast cancer for each patient. The pathway activity levels were derived from either mRNA expression data or the CNA data (see Methods) to supervise the model training. The model structure and training method were kept the same as in the mutation prediction task.

When using the mRNA expression data to represent pathway activity, we found that the p53, PI3K, and cell cycle pathways are predictable (AUC > 0.65, Table 4). When using the CNA data for pathway activity, the Myc pathway achieves the highest AUC (0.7950). The notch pathway (AUC 0.668) and p53 pathway (AUC 0.640) also have significant performance. Additional results of pathway activity predictions can be found in Supplementary Table 3 and Supplementary Table 4.

We also explored the visual interpretation of biological pathway activity and driver mutations as shown in histopathology images. To allow a joint analysis, we chose the biological pathway¹⁸ that is associated with the available driver gene mutations. For example, the visualization of the weight map on the p53 pathway is shown in Fig. 3c and the top 20 weighted tiles are also offered in Fig. 3e. Meanwhile, we used the same patient as the result in gene mutation prediction (Fig. 3), which shows the highlighted areas in TP53 mutation prediction (Fig. 3b) and the top 20 weighted tiles (Fig. 3d). We found that those tiles are highly correlated to both TP53 mutation and p53 pathway activity. This finding increases the confidence in our prediction because TP53 is a key gene in the p53 pathway, and one would expect a relationship between the mutation status and pathway activity (Tables 5, 6). Similarly, the same observation can be found between RB1 mutation prediction result and cell cycle pathway prediction result in Fig. 4. Additional results can be found from Supplementary Fig. 1 to Supplementary Fig. 10. Overall, we have seen a shared similarity among highlighted tiles despite the complexity of biological pathway activities.

Table 5 AUC (with 95% CI) achieved by the models trained on the pathway activity derived from mRNA expression data of breast cancer.

Full size table

Table 6 AUC (with 95% CI) achieved by the models trained on the pathway activity derived from copy number alteration data of breast cancer.

Full size table

Validation of our deep-learning model on lung and liver cancers

Next, we validated our modeling approach in two different cancers namely lung adenocarcinoma and hepatocellular carcinoma. In the lung adenocarcinoma (TCGA-LUAD) cohort, 9 genes for point mutation and 14 genes for CNA were used for model testing (see Methods). Our fine-tuned models developed from the breast cancer cohort can predict the point mutation of TP53 (AUC 0.705) and Notch2 (AUC 0.656), the copy number alteration of FGFR1 (AUC 0.676), the p53 pathway activity (AUC 0.602) from mRNA expression data, and the activities of Myc pathway (AUC 0.658) and PI3K pathway (AUC 0.601) from CNA data. Overall, the responses on the pathway prediction are not as good as those on mutation prediction. Notably, the mutations of TP53 gene occur in about 50% of non-small cell lung cancer (NSCLC) and TP53 mutation is associated with worse prognosis with treatment resistance¹⁹, therefore the prediction of TP53 mutation is also helpful for the diagnosis of lung cancer.

In the liver hepatocellular carcinoma (TCGA-LIHC) cohort, the numbers of tested genes are 7 and 25 for point mutation and CNA, respectively. Our fine-tuned models can predict the point mutation of RB1 (AUC 0.795), the copy number alteration of TGFβ2 (AUC 0.718), the pathway activity of cell cycle from mRNA expression data (AUC 0.614), and Myc pathway activity from CNA data (AUC 0.602). In particular, RB1 is a key inhibitor of cell cycle progression in HCC patients^20,21,22, and RB1 mutations are significantly associated with reduced cancer-specific and recurrence-free survival after resection in HCC patients^20,21,22. Therefore, the prediction of RB1 mutation has potential prognosis value for those patients.

We also visualized the weight maps of TP53 mutation and p53 pathway of a representative patient in lung cancer in Fig. 5, and those of RB1 mutation and cell cycle pathway in the liver cancer in Fig. 6. In both examples, we can observe similar morphological patterns identified from a pathologist’s perspective in the two weight maps and tile appearances in the top 20 weighted tiles, which are similar to our observations for breast cancer. More results can be found in Supplementary Fig. 11 and Supplementary Fig. 12.

**Fig. 5: Weight maps of tiles when predicting the point mutation status of TP53 and p53 pathway activity from mRNA expression data in lung cancer.**

**Fig. 6: Weight maps of tiles when predicting the point mutation status of RB1 and cell cycle pathway activity from mRNA expression data in liver cancer.**

Discussion

In this study, we demonstrated that key gene mutation outcomes and biological pathway activity of breast cancer can be predicted by deep-learning classifiers from whole-slide images. We further validated the deep-learning model to infer mutation status on liver and lung cancers, respectively. Our WSI-based deep-learning models can identify the point mutation status of six genes (RB1, CDH1, NF1, NOTCH2, TP53, and MAP3K1) and the copy number alteration of another six genes (FGFR1, EIF4EBP1, KAT6A, HEY1, ZNF217, and RAB25) in breast cancer. To deepen our understanding of cancer biology, we explored the predictive power of deep learning to predict underlying biological pathway activity, which is a challenging task involving complex biological relations among gene expressions. From the activity levels of 10 canonical signaling pathways derived from the mRNA expression data and copy number alteration inputs⁶, we found that three important pathways (p53, pi3k, and cell cycle) measured by mRNA expression and two pathways (Myc and Notch) measured by copy number alteration can be well predicted from our analysis.

Cancers are caused by gene mutations and therefore the prediction of key gene mutations based on whole-slide images will positively impact the targeted treatment of cancer patients^{4,23,24,25,26,27,28,29}. For example, our models can predict TP53 point mutation (AUC 0.729) and FGFR1 copy number alteration (AUC 0.794) with high accuracies. TP53 is a tumor suppressor gene that plays a key role in many cellular pathways controlling cell proliferation, cell survival, and genomic integrity²³. It is mutated frequently in breast cancer^4,23 and has been associated with poor prognosis^4,23,24. The FGFR1 gene is a member of the fibroblast growth factor receptor (FGFR) family that regulates important biological processes including cell proliferation and differentiation during development and tissue repair²⁵. In breast cancer, FGFR1 amplification is the most frequent genomic aberration²⁶, and may lead to dysregulated FGF receptors and promote cancer growth and metastasis. Extensive works^26,27,28,29 have shown that FGFR1 could be a therapy target in breast cancer (e.g., the anti-FGFR1 dovitinib (TKI1258) therapy²⁷). With the prediction of TP53 mutation and FGFR1 alteration, our models offered insights into selecting patient subgroups for the targeted therapy from digitalized WSI scans.

We extended our study to analyze biological pathway prediction based on whole-slide images that has seldom been addressed previously. Biological pathways are the interactions among molecules in a cell that result in certain products or changes in cancer³⁰. Several important signaling pathways have been identified as frequently and genetically altered in cancer¹⁸. We showed that deep learning can predict pathology activity levels, providing valuable information for prognosis and therapeutic planning. For example, the p53 pathway activity (predicted with 0.798 AUC in our method) is associated with more aggressive disease and worse overall survival in breast cancer³¹. The Myc pathway (predicted with 0.795 AUC) acts as a key regulator of cell growth and proliferation, which has been linked to the basal-like breast cancer^32,33, and can serve as a target for this aggressive subtype in breast cancer.

To overcome the interpretability challenges of AI-powered models, we employed a self-attention mechanism that is able to visualize the region of interest that contributed to outcomes prediction. In other words, we can display the weight map of each tumor tile to understand the decision making of the classifiers, highlighting the regions that contribute most to the final prediction. An example of the visualized weights map when predicting TP53 point mutation and p53 pathway activity is shown in Fig. 3b, c, respectively. Tiles with brighter green colors have larger weights, indicating that those tiles are most important in the decision-making process. Interestingly, the highlighted regions in both tasks are approximately located in the same part of the whole-slide image, and the top 20 weighted tiles in the two tasks shown in Fig. 3d, e, are also similar. The possible reason could be that TP53 is a crucial gene in the p53 pathway thus the predictions depend on similar image features in this example. This type of methodology and visualization has the potential to enable the improved exploration of the relationship between the image morphological features and molecular outcomes, and the relationship between genes and biological pathways, which can lead to new discoveries in breast cancer development.

To validate our method, we further extended the trained deep-learning models on lung and liver cancers with transfer learning. We hypothesize that the models trained using breast cancer data can also predict important gene mutations and pathway activities in lung and liver cancers since they are two common sites for the breast cancer metastatic spread³⁴. Out of the well-predicted genes in breast cancer, the point mutations of TP53 and Notch2, and the copy number alteration of FGFR1 can also be predicted in lung cancer (LUAD). In the liver cancer (LIHC), the well-predicted genes are RB1 and TGFβ2. These genes are indeed highly related to the diagnosis of lung cancer^19,35,36 and liver cancer^20,37. The different results on liver and lung cancers may be caused by the tissue differences of the two cancers. The pathway prediction results in the two cancers are not as good as gene mutation predictions, probably because pathways are more complicated than gene mutations thus are more challenging to predict. However, the well-predicted pathways in breast cancer still get the highest AUC scores in lung (p53) and liver (cell cycle, Myc) cohorts.

Overall, our study highlights deep characterization of breast cancer, its mutation outcome, and biological pathway activity. We present unique insights into WSI visual interactions between mutation and its pathway, enabling a head-to-head comparison to reinforce our major findings. Our approach can be a useful computational tool for gene mutation pre-screening, prior to the costly gene mutation analysis such as next-generation sequencing. Our evaluation strategy differs from pan-cancer studies^14,15,16, which evaluate performance on each cancer individually. We measured the performance across cancer types by training on breast cancer and validating on liver and lung cancers, which is more challenging due to inherent differences of cancer tissues³⁸. In terms of model development, we directly provide slide-level predictions without assuming that each tile or super-tile shares the same label as the whole slide. Unlike the regular attention mechanism used in the related works^39,40 that calculates the weight of each patch according to the prediction, the self-attention in our work measures the similarities between each patch and all other patches and can capture the relationship between patches when making predictions. The self-attention mechanism further enables us to visualize the importance of tiles during the decision-making process, instead of the probabilities of mutations or expression for tiles. This finding can be used to better understand which image-based morphological features are related to certain gene mutations or pathway activities. Finally, our approach is a data-driven workflow that does not require nuclei detection¹² as a prerequisite for specific prediction tasks.

While building associations between histopathology and molecular profiles is promising, the identified genotype–phenotype relationships here are not intended to replace standard transcriptomic tests. Given the confirmation from our collaborative pathologist that there is a lack of consensus on molecularly defined patterns seen from histopathological scans, we expect our detectable findings could complement pathologists’ routine workflow. The identified pathological descriptions were only exploratory rather than drawing conclusive associations, which warrant more clinical examinations in future studies. A limitation of the study is that the workflow is based on formalin-fixed, paraffin-embedded (FFPE) slides given their quality of preserving microscopic characteristics of tissues, while frozen tissues could also be considered for extended analysis in the future. Our computational analysis has a dependence on the feature extractor pretrained on natural images (ImageNet dataset⁴¹). There is a domain gap between natural images and pathology images. Therefore, the exploration of appropriate feature representations of pathology tiles and their parameters will be crucial to assess the validity and reproducibility of algorithms. To maximize the power of deep-learning approaches, it is also necessary to address data scarcity in histopathology-related tasks. There are often a significant portion of data samples that are insufficient and under-represented for certain mutation prediction tasks (e.g., only <5% mutant samples). High-quality, large-scale pathological data with precise molecular annotations will be needed to boost model development. Alternatively, transfer learning has proven to be useful in computer vision tasks when training samples are less available^42,43. Therefore, a pretrained classifier built from diverse pathological datasets may provide superior results compared with our cancer-type-specific model.

In conclusion, we demonstrated that deep neural networks can be used to predict molecular outcomes in breast cancer including gene mutations and biological pathway activities from histopathology whole-slide images. Our extensive results highlighted new findings among genotype–phenotype associations, offering insights into the identification of targeted therapies for breast cancer treatment.

Methods

Data selection

The original TCGA-BRCA cohort consists of 1098 patients with H&E stained whole-slide images, genomic data, and additional clinical information. We analyzed the 1133 Formalin-Fixed Paraffin-Embedded (FFPE) slides that were generated by fixing a specimen in formaldehyde and then embedding it in a paraffin wax block for cutting. We further filtered out low quality FFPE slides according to the following criteria: (1) There is no diagnostic time information in a slide with low visual quality. (2) A slide has extensive blurred areas or is abnormally stained with little informative tissue areas. For patients with multiple slides, we only kept the slide with the best visual quality. After slide selection preprocessing, we collected 659 slides (659 patients) along with the corresponding omics data (mRNA expression and copy number alteration). The same selection process is performed on the validation datasets of TCGA-LUAD and TCGA-LIHC, resulting in 350 and 316 cases with both pathology images and omics profiles, respectively. The slide lists of the three cancers after data selection can be found in the Supplementary Note 1. These public TCGA cohorts were available online without restriction and authentication.

Histopathology data preprocessing

For each slide, we extracted nonoverlapping tiles of 512 × 512 at ×20 magnification and removed background tiles (Fig. 7a). A background tile was determined if its mean pixel value is higher than 220. We focused on tumor areas in the whole-slide images therefore we adopted a semi-automatic labeling method to identify tumor tiles. The labeling process was implemented by the initial clustering and manual refinement. In the first step, k-means clustering was performed on all tiles for each slide. Specifically, each 512 × 512 tile was downsampled to 128 × 128, which was then flattened into a 49152-length feature vector. These feature vectors were then clustered into two groups (i.e., tumor and nontumor regions). In the second step, a pathologist (20 years of clinical experience) additionally verified the segmentation quality of tumor regions and revised inaccurate results of slides to ensure the tumor labeling results were reasonable. For example, the tiles with artefacts in the annotated slides were removed manually if they were left after the clustering step. We then performed color normalization using the method⁴⁴ to eliminate the color variations in different slides. The tiles of the same slide were processed by using the same slide-level pixel mean and standard deviation during the normalization.

**Fig. 7: Illustration of the deep-learning workflow for data processing and model evaluation.**

Mutated genes and pathway activity identification

To ensure a sufficient amount of training WSIs for mutated genes, for point mutation we selected 18 important genes in breast cancer, which were related to cell functions (e.g., cell cycle, p53 signaling, notch signaling, and DNA damage response) and they were mutated at least 3%. For CNA data, we selected 35 genes with mutation percentage greater than 5%. In the validation tasks of lung and liver cancers, we used the same criterion to select gene profiles, resulting in 9 point mutation genes and 14 CNA genes in the lung cancer and 7 point mutation genes and 25 CNA genes in the liver cancer for analysis.

In the pathway activity prediction task, we identified ten canonical signaling pathways with frequent genetic alterations. The pathway activity in each patient was obtained by a weighted sum of the genes’ expression data or CNA data in the pathway. Then the activity was binarized as activated if it is greater than zero and inactivated otherwise. For each pathway, we generated two types of activity labels from mRNA expression data or CNA data for each patient as follow:

$$\begin{array}{*{20}{c}} {v^{s,i} = \frac{1}{{N_{gene}^i}}\mathop {\sum }\limits_{n = 1}^{N_{gene}^i} w_n^{s,i}u_n^{s,i},i = 1,2, \ldots ,10,s = 1,2, \ldots ,659} \end{array}$$

(1)

$$\begin{array}{*{20}{c}} {l^{s,i} = \left\{ {\begin{array}{*{20}{c}} {1\;if\;v^{s,i}\, >\, 0} \\ {0\;{\mathrm{otherwise}}} \end{array}} \right.,} \end{array}$$

(2)

where v^s,i is the activity level of pathway i in patient s, l^s,i is the binary activity label of pathway i in patient s, $N_{{\mathrm{gene}}}^i$ is the number of important genes that are involved in the pathway i according to Sanchez-Vega et al.’s work¹⁸, $u_n^{s,i}$ is the expression level or CNA level of gene n in pathway i and patient s, $w_n^{s,i}$ is the corresponding weight, which takes value 1 if the gene is an oncogene and −1 if it is a tumor suppressor. The CNN aims to predict the binary label l^s,i, i.e., whether a pathway is activated (l^s,i = 1) or inactivated (l^s,i = 0) in a patient.

Model structure

We developed deep-learning models to predict mutation status and pathway activity from histopathology images. Our model architecture consists of two main sections (Fig. 1):

Feature extractor: This module aims to obtain a feature vector representing the input tile. We use the convolutional layers of ResNet-101⁴⁵ as the feature extractor, which is widely used in image classification tasks and has shown powerful feature representation ability in various applications. This subnetwork is pretrained on the ImageNet dataset⁴¹ and kept unchanged during training and testing. Through the feature extraction, each input tile is represented by a 2048-dimensional feature vector, resulting in a feature matrix of N × 2048 for slide of patient s, where N is the number of tumor tiles in the slide and varies from slide to slide.
Multi-layer perceptron (MLP) predictor with self-attention: This subnetwork follows the feature extractor to output the final prediction. It consists of three fully connected layers and one self-attention layer (Fig. 1). The first two fully connected layers have 512 and 128 neurons, respectively, reducing the size of feature matrix to N × 128. The self-attention layer is used to compute the importance weight of each tile’s feature vector and guide the network to pay more attention to the crucial tiles. Self-attention has been used successfully in a variety of tasks in natural language processing^46,47,48 and computer vision⁴⁹ to model relationships between widely separated spatial regions. In this paper, we make slight modifications based on the method in Zhang et al.⁴⁹:

$$\begin{array}{*{20}{c}} {f\left( x \right) = W_fx,g\left( x \right) = W_gx} \end{array}$$

(3)

$$\begin{array}{*{20}{c}} {\alpha _{i,j} = {\mathrm{softmax}}\left( {f\left( {x_i} \right)^Tg\left( {x_j} \right)} \right)} \end{array}$$

(4)

$$\begin{array}{*{20}{c}} {o_j = \mathop {\sum }\limits_{i = 1}^N \alpha _{j,i}x_i,y = x + \gamma \cdot o} \end{array}$$

(5)

where x is the input feature matrix, W_f and W_g are 1 × 1 convolution filters, α_j,i indicates how much attention the model pays to the ith tile’s features when computing the jth tile’s activation o_j, γ is a trainable parameter controlling the scale of the attention. y is the output of the self-attention layer after an average pooling, which is the global feature vector representing all tumor tiles of a slide. The final fully connected layer transforms the global feature to a prediction.

In our study, the gene mutation status prediction and pathway activity prediction are formulated as classification tasks, and thus cross-entropy loss is used to train the models.

Model training and evaluation

The feature extractor subnetwork (ResNet-101 without fully connected (FC) layer) is pretrained and fixed during training for all prediction tasks. Therefore, we extract the feature vectors of tumor tiles for all patients beforehand and save them to the disk. Training the prediction module from the saved feature vectors can greatly accelerate the training speed. During feature extraction, each 512 × 512 tile is resized to 224 × 224 image and normalized by the mean and standard deviation of the ImageNet dataset³⁹ before feeding to the pretrained ResNet-101. The prediction subnetwork is trained with the Adam optimizer for 30 epochs. The initial value of γ in the self-attention layer is 1. The learning rate of γ is set to 0.001 and all other parameters have a learning rate of 0.0001. The best model is saved when achieving the best performance on the validation set. For different tasks (e.g., point mutation, pathway activity), the models for breast cancer are all trained from scratch. It took ~6 min to train the MLP with self-attention (30 epochs, batch size 8) on a NVIDIA TITAN Xp GPU. Our training is efficient because our method can directly provide slide-level prediction instead of tile-level predictions as done in Fu et al¹⁵.

To evaluate the model’s performance on the validation set (for model selection) and test set, we use the area-under-the-curve (AUC) in both mutation prediction and pathway activity prediction tasks. The AUC is the area under the ROC curve, which is created by plotting the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings. AUC informs the capability of a model in distinguishing between classes. The 95% confidence interval (CI) of each AUC score is calculated by 1000 bootstrapping to estimate the uncertainty of AUC.

Model fine-tuning on the lung and liver cancers data

During model fine-tuning, we fix the parameters of the first two fully connected (fc) layers of the prediction subnetwork and fine-tune the self-attention layer and the last fc layer. We assume that image features learned from breast cancer data could be also useful in a pan-cancer setting. This fine-tuning strategy could help to investigate if there is any underlying relationship between the data of breast cancer and lung or liver cancer. We did not fine-tune models for all possible gene profiles from TCGA, because the point mutation and CNA percentage of some genes are extremely low in lung or liver cancers. Only genes with >3% point mutation or >5% CNA were fine-tuned and tested in our study.

Visualization

The self-attention layer in our model can produce the importance weights of tiles in a slide in the prediction tasks, which is helpful for us to explore the biological interpretation value of deep-learning classifiers. We compute the log value of the weight of each tile and project it to the original location in the whole-slide image, resulting in the weight map (Fig. 3). The weight β_i is computed according to Eqs. (3), (4), (5):

$$\begin{array}{*{20}{c}} {\beta _i = 1 + \gamma \mathop {\sum }\limits_{j = 1}^N \alpha _{i,j}} \end{array}$$

(6)

Besides, we select tiles with top 20 largest weights in a slide to show the appearance of important tiles (Figs. 3d and e).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The whole-slide images used in this study are publicly available through the Genomic Data Commons data portal (https://portal.gdc.cancer.gov/). The omics data (mutation, copy number alteration and mRNA expression data) are publicly available through cBioPortal (https://www.cbioportal.org/), and the download links are provided in the Supplementary Note 2.

Code availability

The source code used in this study can be found at https://github.com/huiqu18/GeneMutationFromHE.

References

Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68, 394–424 (2018).
Article PubMed Google Scholar
Bale, R., Putzer, D. & Schullian, P. Local treatment of breast cancer liver metastasis. Cancers 11, 1341 (2019).
Article CAS PubMed Central Google Scholar
Feng, Y. et al. Breast cancer development and progression: Risk factors, cancer stem cells, signaling pathways, genomics, and molecular pathogenesis. Genes Dis. 5, 77–106 (2018).
Article CAS PubMed PubMed Central Google Scholar
Børresen‐Dale, A. L. TP53 and breast cancer. Hum. Mutat. 21, 292–300 (2003).
Article PubMed Google Scholar
Blackwell, K. L. et al. Randomized study of Lapatinib alone or in combination with trastuzumab in women with ErbB2-positive, trastuzumab-refractory metastatic breast cancer. J. Clin. Oncol. 28, 1124–1130 (2010).
Article CAS PubMed Google Scholar
Coudray, N. et al. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nat. Med. 24, 1559–1567 (2018).
Article CAS PubMed Google Scholar
Hou, L. et al. Patch-based convolutional neural network for whole slide tissue image classification. In Proc. IEEE Conference on Computer Vision and Pattern Recognition. (IEEE, 2016).
Couture, H. D. et al. Image analysis with deep learning to predict breast cancer grade, ER status, histologic subtype, and intrinsic subtype. npj Breast Cancer 4, 1–8 (2018).
Article Google Scholar
Chen, M. et al. Classification and mutation prediction based on histopathology H&E images in liver cancer using deep learning. npj Precision Oncol. 4, 1–7 (2020).
Google Scholar
Courtiol, P. et al. Deep learning-based classification of mesothelioma improves prediction of patient outcome. Nat. Med. 25, 1519–1525 (2019).
Article CAS PubMed Google Scholar
Zheng, H., Momeni, A., Cedoz, P.-L., Vogel, H. & Gevaert, O. Whole slide images reflect DNA methylation patterns of human tumors. npj Genomic Med. 5, 1–10 (2020).
Article Google Scholar
Anand, D. et al. Deep learning to estimate human epidermal growth factor receptor 2 status from hematoxylin and eosin-stained breast tissue images. J. Pathol. Inf. 11, 11–20 (2020).
Cheerla, A. & Gevaert, O. Deep learning with multimodal representation for pancancer prognosis prediction. Bioinformatics 35, i446–i454 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kather, J. N. et al. Pan-cancer image-based detection of clinically actionable genetic alterations. Nat. Cancer 1, 789–799 (2020).
Article PubMed PubMed Central Google Scholar
Fu, Y. et al. Pan-cancer computational histopathology reveals mutations, tumor composition and prognosis. Nat. Cancer 1, 800–810 (2020).
Article Google Scholar
Schmauch, B. et al. A deep learning model to predict RNA-Seq expression of tumours from whole slide images. Nat. Commun. 11, 1–15 (2020).
Article Google Scholar
Hoadley, K. A. et al. Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer. Cell 173, 291–304. e296 (2018).
Article CAS PubMed PubMed Central Google Scholar
Sanchez-Vega, F. et al. Oncogenic signaling pathways in the cancer genome atlas. Cell 173, 321–337. e310 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mogi, A. & Kuwano, H. TP53 mutations in nonsmall cell lung cancer. BioMed Res. Int. 2011, https://doi.org/10.1155/2011/583929 (2011).
Ahn, S. M. et al. Genomic portrait of resectable hepatocellular carcinomas: implications of RB1 and FGF19 aberrations for patient stratification. Hepatology 60, 1972–1982 (2014).
Article CAS PubMed Google Scholar
Huang, W., Skanderup, A. J. & Lee, C. G. Advances in genomic hepatocellular carcinoma research. Gigascience 7, giy135 (2018).
Article PubMed Central Google Scholar
Schulze, K. et al. Exome sequencing of hepatocellular carcinomas identifies new mutational signatures and potential therapeutic targets. Nat. Genet. 47, 505–511 (2015).
Article CAS PubMed PubMed Central Google Scholar
Olivier, M. et al. The clinical value of somatic TP53 gene mutations in 1,794 patients with breast cancer. Clin. Cancer Res. 12, 1157–1167 (2006).
Article CAS PubMed Google Scholar
Ungerleider, N. A. et al. Breast cancer survival predicted by TP53 mutation status differs markedly depending on treatment. Breast Cancer Res. 20, 115 (2018).
Article PubMed PubMed Central Google Scholar
Tiong, K. H., Mah, L. Y. & Leong, C.-O. Functional roles of fibroblast growth factor receptors (FGFRs) signaling in human cancers. Apoptosis 18, 1447–1468 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sobhani, N. et al. Current status of fibroblast growth factor receptor-targeted therapies in breast cancer. Cells 7, 76 (2018).
Article PubMed Central Google Scholar
André, F. et al. Targeting FGFR with dovitinib (TKI258): preclinical and clinical data in breast cancer. Clin. Cancer Res. 19, 3693–3702 (2013).
Article PubMed Google Scholar
Brady, N. J., Chuntova, P., Bade, L. K. & Schwertfeger, K. L. The FGF/FGF receptor axis as a therapeutic target in breast cancer. Expert Rev. Endocrinol. Metab. 8, 391–402 (2013).
Article CAS PubMed PubMed Central Google Scholar
Tenhagen, M., van Diest, P. J., Ivanova, I. A., van der Wall, E. & van der Groep, P. Fibroblast growth factor receptors in breast cancer: expression, downstream effects, and possible drug targets. Endocr. Relat. Cancer 19, R115–R129 (2012).
Article CAS PubMed Google Scholar
Courtesy: National Human Genome Research Institute. Biological Pathways Fact Sheet. https://www.genome.gov/about-genomics/fact-sheets/Biological-Pathways-Fact-Sheet. (2020).
Gasco, M., Shami, S. & Crook, T. The p53 pathway in breast cancer. Breast Cancer Res. 4, 70 (2002).
Article CAS PubMed PubMed Central Google Scholar
Xu, J., Chen, Y. & Olopade, O. I. MYC and breast cancer. Genes Cancer 1, 629–640 (2010).
Article CAS PubMed PubMed Central Google Scholar
Palaskas, N. et al. 18F-fluorodeoxy-glucose positron emission tomography marks MYC-overexpressing human basal-like breast cancers. Cancer Res. 71, 5164–5174 (2011).
Article CAS PubMed PubMed Central Google Scholar
Weigelt, B., Peterse, J. L. & Van’t Veer, L. J. Breast cancer metastasis: markers and models. Nat. Rev. Cancer 5, 591–602 (2005).
Article CAS PubMed Google Scholar
Chen, C.-Y. et al. Expression of Notch gene and its impact on survival of patients with resectable non-small cell lung cancer. J. Cancer 8, 1292 (2017).
Article PubMed PubMed Central Google Scholar
Heist, R. S. et al. FGFR1 amplification in squamous cell carcinoma of the lung. J. Thorac. Oncol. 7, 1775–1780 (2012).
Article CAS PubMed PubMed Central Google Scholar
Katz, L. H. et al. TGF-β signaling in liver and gastrointestinal cancers. Cancer Lett. 379, 166–172 (2016).
Article CAS PubMed PubMed Central Google Scholar
Levy-Jurgenson, A., Tekpli, X., Kristensen, V. N. & Yakhini, Z. Spatial transcriptomics inferred from pathology whole-slide images links tumor heterogeneity to survival in breast and lung cancer. Sci. Rep. 10, 1–11 (2020).
Article Google Scholar
Yao, J., Zhu, X., Jonnagaddala, J., Hawkins, N. & Huang, J. Whole slide images based cancer survival prediction using attention guided deep multiple instance learning networks. Med. Image Anal. 65, 101789 (2020).
Article PubMed Google Scholar
Mobadersany, P., Cooper, L. A. & Goldstein, J. A. GestAltNet: aggregation and attention to improve deep learning of gestational age from placental whole-slide images. Lab. Invest. 1–10 (2021).
Russakovsky, O. et al. Imagenet large scale visual recognition challenge. Int. J. Computer Vis. 115, 211–252 (2015).
Article Google Scholar
Tan, C. et al. A survey on deep transfer learning. In International Conference on Artificial Neural Networks. (Springer, 2018).
Zamir, A. R. et al. Taskonomy: Disentangling task transfer learning. in Proc. IEEE Conference on Computer Vision and Pattern Recognition. (IEEE, 2018).
Reinhard, E., Adhikhmin, M., Gooch, B. & Shirley, P. Color transfer between images. IEEE Computer Graph. Appl. 21, 34–41 (2001).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. in Proc. IEEE Conference on Computer Vision and Pattern Recognition. (IEEE, 2016).
Lin, Z. et al. A structured self-attentive sentence embedding. arXiv preprint arXiv:170303130 (2017).
Cheng, J., Dong, L. & Lapata, M. Long short-term memory-networks for machine reading. arXiv preprint arXiv:160106733 (2016).
Vaswani, A. et al. Attention is all you need. in Advances in Neural Information Processing Systems. (Curran Associates, 2017).
Zhang, H., Goodfellow, I., Metaxas, D. & Odena, A. Self-attention generative adversarial networks. In International conference on machine learning, Proceedings of Machine Learning Research, 7354–7363 (2019).

Download references

Acknowledgements

This work was supported in part by grants from National Science Foundation (IIS-1703883, CNS-1747778, CCF-1733843, IIS-1763523, IIS-1849238-825536, ARO MURI-Z8424104-440149). This work was partially supported by Centre on Perceptual and Interactive Intelligence (CPII) Limited. This work was supported by National Institute of Dental & Craniofacial Research (NIDCR) (U01 DE025188), the National Institute of Biomedical Imaging and Bioengineering (R56 EB020527), and the National Cancer Institute (U01 CA217851).

Author information

These authors contributed equally: Hui Qu, Mu Zhou.
These authors jointly supervised this work: Shaoting Zhang, Olivier Gevaert, Dimitris N. Metaxas.

Authors and Affiliations

Department of Computer Science, Rutgers University, Piscataway, NJ, USA
Hui Qu & Dimitris N. Metaxas
Sensebrain Research, Princeton, NJ, USA
Mu Zhou & Zhennan Yan
School of Medicine, Yale University, New Haven, CT, USA
He Wang
Department of Medicine, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, USA
Vinod K. Rustgi
SenseTime Research and Shanghai AI Laboratory, Shanghai, China
Shaoting Zhang
Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine, Stanford University, Stanford, CA, USA
Olivier Gevaert

Authors

Hui Qu
View author publications
You can also search for this author in PubMed Google Scholar
Mu Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Zhennan Yan
View author publications
You can also search for this author in PubMed Google Scholar
He Wang
View author publications
You can also search for this author in PubMed Google Scholar
Vinod K. Rustgi
View author publications
You can also search for this author in PubMed Google Scholar
Shaoting Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Gevaert
View author publications
You can also search for this author in PubMed Google Scholar
Dimitris N. Metaxas
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.Q., M.Z. and Z.Y. were involved in the study design, data collection, and computational analysis. H.W., R.K.V. and O.G. collected and checked data. H.W. and R.K.V. provided clinical expertise. All authors participated paper writing.

Corresponding authors

Correspondence to Shaoting Zhang, Olivier Gevaert or Dimitris N. Metaxas.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Qu, H., Zhou, M., Yan, Z. et al. Genetic mutation and biological pathway prediction based on whole slide images in breast carcinoma using deep learning. npj Precis. Onc. 5, 87 (2021). https://doi.org/10.1038/s41698-021-00225-9

Download citation

Received: 07 November 2020
Accepted: 14 July 2021
Published: 23 September 2021
DOI: https://doi.org/10.1038/s41698-021-00225-9

This article is cited by

A systematic pan-cancer study on deep learning-based prediction of multi-omic biomarkers from routine pathology images
- Salim Arslan
- Julian Schmidt
- Pahini Pandya
Communications Medicine (2024)
Current status and prospects of artificial intelligence in breast cancer pathology: convolutional neural networks to prospective Vision Transformers
- Ayaka Katayama
- Yuki Aoki
- Tetsunari Oyama
International Journal of Clinical Oncology (2024)
A novel transformer-based aggregation model for predicting gene mutations in lung adenocarcinoma
- Kai Sun
- Yuanjie Zheng
- Weikuan Jia
Medical & Biological Engineering & Computing (2024)
A Large-scale Synthetic Pathological Dataset for Deep Learning-enabled Segmentation of Breast Cancer
- Kexin Ding
- Mu Zhou
- Shaoting Zhang
Scientific Data (2023)
Biological insights and novel biomarker discovery through deep learning approaches in breast cancer histopathology
- Divneet Mandair
- Jorge S. Reis-Filho
- Alan Ashworth
npj Breast Cancer (2023)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Datasets

Prediction of gene mutation status in breast cancer from pathology images

Prediction of biological pathway activity from histopathology images

Validation of our deep-learning model on lung and liver cancers

Discussion

Methods

Data selection

Histopathology data preprocessing

Mutated genes and pathway activity identification

Model structure

Model training and evaluation

Model fine-tuning on the lung and liver cancers data

Visualization

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links