Ultra-high resolution, multi-scale, context-aware approach for detection of small cancers on mammography

Rangarajan, Krithika; Gupta, Aman; Dasgupta, Saptarshi; Marri, Uday; Gupta, Arun Kumar; Hari, Smriti; Banerjee, Subhashis; Arora, Chetan

doi:10.1038/s41598-022-15259-7

Download PDF

Article
Open access
Published: 08 July 2022

Ultra-high resolution, multi-scale, context-aware approach for detection of small cancers on mammography

Krithika Rangarajan¹^nAff2,
Aman Gupta³,
Saptarshi Dasgupta³,
Uday Marri²,
Arun Kumar Gupta²,
Smriti Hari²,
Subhashis Banerjee³^nAff4 &
…
Chetan Arora³

Scientific Reports volume 12, Article number: 11622 (2022) Cite this article

3088 Accesses
4 Citations
2 Altmetric
Metrics details

Subjects

Abstract

While detection of malignancies on mammography has received a boost with the use of Convolutional Neural Networks (CNN), detection of cancers of very small size remains challenging. This is however clinically significant as the purpose of mammography is early detection of cancer, making it imperative to pick them up when they are still very small. Mammography has the highest spatial resolution (image sizes as high as 3328 × 4096 pixels) out of all imaging modalities, a requirement that stems from the need to detect fine features of the smallest cancers on screening. However due to computational constraints, most state of the art CNNs work on reduced resolution images. Those that work on higher resolutions, compromise on global context and work at single scale. In this work, we show that resolution, scale and image-context are all important independent factors in detection of small masses. We thereby use a fully convolutional network, with the ability to take any input size. In addition, we incorporate a systematic multi-scale, multi-resolution approach, and encode image context, which we show are critical factors to detection of small masses. We show that this approach improves the detection of cancer, particularly for small masses in comparison to the baseline model. We perform a single institution multicentre study, and show the performance of the model on a diagnostic mammography dataset, a screening mammography dataset, as well as a curated dataset of small cancers < 1 cm in size. We show that our approach improves the sensitivity from 61.53 to 87.18% at 0.3 False Positives per Image (FPI) on this small cancer dataset. Model and code are available from https://github.com/amangupt01/Small_Cancer_Detection

Deep Learning to Improve Breast Cancer Detection on Screening Mammography

Article Open access 29 August 2019

Attention-based deep learning for breast lesions classification on contrast enhanced spectral mammography: a multicentre study

Article 15 December 2022

A convolutional deep learning model for improving mammographic breast-microcalcification diagnosis

Article Open access 14 December 2021

Introduction

Breast cancer is the most common cancer in women, and the second most common cancer overall in the world¹. Early detection is a crucial factor that aids in reducing mortality rates due to breast cancer². Mammography is therefore offered as a screening modality for women over a certain age in many countries for early detection of cancer³. In fact, the 10 year survival rate due to breast cancer falls from over 95% in patients where the cancer is less than 1 cm to about 60% when the size of the cancer is over 3 cm⁴. However, detecting cancer on mammograms is a tedious job⁵, apart from requiring highly specialized breast radiologists. Only about 5 out of 1000 scans would harbor a cancer⁶. The cancer may also only occupy less than 1% of the image. The ability to identify these tiny cancers which are likely to be missed by a fatigued radiologist would therefore be an important contribution of computer vision.

State-of-art deep learning based image classifiers^7,8 and object detectors^9,10,11 have performed exceedingly well in mammography. The reader is directed to the systematic review by Freeman et al.¹² for a more detailed description of currently available deep learning tools. However the sensitivity for detection of small masses is lower than for large masses¹³. Recently, some authors have attempted to design approaches specific to small cancer detection. Savelli et al.¹⁴ in their work designed a multi-context network for small lesion detection, however they show their results only on microcalcification. Agarwal et al.¹³ trained a patch-based classifier while showing the benefit of domain adaptation in mammography. They showed their results on masses of different sizes in their analysis, showing that the performance of CNNs drops for small masses. Lotter et al.¹⁵ perform multi-scale curriculum learning on mammograms to deal with the problem of very small lesions in comparison to image size. However, they perform only image level classification, without precise cancer localisation. In this work, we deliver a concerted effort towards the detection of small sized cancers, which represent a clinically significant problem. We analyze this problem from the perspective of mammography and propose a solution by combining resolution, scale as well as image-context. (Fig. 1).

Importance of spatial resolution

Fine Microcalcifications and details such as spiculations are central to identifying breast cancers, the visibility of which on mammography is critically dependent on the spatial resolution of a mammogram. In fact, out of all imaging modalities, mammography has the highest spatial resolution. Digital Mammograms have a pixel size of around 50–100 microns¹⁶. DMs tend to be very large images, ranging from 2300 × 1800 pixels (of dimension 100 microns) to 4096 × 5625 pixels (of dimension 54 micron)¹⁷. Most CNNs however take much smaller input sizes, thus losing this information so critical to the diagnosis of small masses.

Importance of scale

When mammograms are used in full resolution as input to a network, the receptive field typically includes only a fraction of the image. While full resolution is vital for identifying fine details (such as spiculations), masses may not be seen in its entirety within the receptive field. Thereby the shape and margins of the mass would not be visualized, which are important descriptors of any mammographic mass (Fig. 1). Thus, apart from seeing mammograms at original resolution, we propose that a multi-scale approach where the image is also seen at reduced scale would be particularly beneficial in the context of mammograms.

Importance of image-context

The features essential for detection of small cancers, may be significantly different from those for large (or average sized) cancers, much like in case of small objects in natural images¹⁸. While shape, margins and density are important descriptors for average sized masses, for a very small mass, features such as margins and shape may not be visible directly. Other cues in surrounding parenchyma such as architectural distortion and the skin bulge or retraction play an important role in the detection. Thus we propose that image-context is particularly important in the case of small sized cancers.

Approach to detection of small cancers

Though resolution, scale and context are all well described factors playing an important role in object detection, designing models which can do all these together is challenging. Our ideas and network for incorporating scale and context for detection are inspired by the work of Hu et al.¹⁸ on small object detection in natural images. We hereby present a concerted effort towards small cancer detection with our network, which takes into account all the above factors, without compromising on each other.

Materials and methods

Data

This was a single institution, multi-centre study. We show our results on one diagnostic mammography dataset, one screening mammography dataset and a curated dataset with small cancers.

Training dataset

For training our network, we collected a dataset from our hospital consisting of Full Field Digital Mammograms (FFDM) acquired on Selenia Dimensions, a Hologic Mammography unit from Jan 2015 to Dec 2015. In order to make a balanced dataset suitable for training, we collected consecutive patients who had been assigned a BIRADS 4 category and had a histopathological diagnosis. Thus this dataset consisted of 839 images, with 393 cancers. The images of the contralateral breast provided normal examples to the network.

Test datasets

Diagnostic mammography dataset

Our country has no formal mammography screening program. Thus our test dataset (acquired from the same centre as the training dataset) had a distribution of diagnostic mammography practice. For this dataset, consecutive patients who underwent mammography from January 2018 to June 2018 were chosen. Patients who had been given a BIRADS 4 or above but did not have histopathological report were excluded from the study. There were 2569 images with 243 cancers in this dataset. This is referred to as the DM dataset in further discussion.

Screening mammography dataset

This was an external dataset obtained from our cancer centre, where opportunistic screening is offered to all eligible women. All patients who underwent mammography from January to April 2021 were selected, except those without histological proof for BIRADS 4 or above lesions. This dataset provided an external test dataset, as well as helped ascertain our efficacy in a screening setting. These images were acquired on a Hologic system. There were 2146 images with 59 cancers, and this is referred to as the SM dataset in further discussion.

Small cancer dataset

In order to establish the value of our network in small cancers, we curated a dataset of patients with cancers less than 1 cm in size (diameter of mass was used for masses and longest dimension of the cluster was used for a cluster of microcalcification). There were 79 images in this dataset with an average cancer size of 5.8 mm diameter. This is referred to as the SC dataset. Images in this dataset came from both the above centres.

All datasets were collected after obtaining ethical clearance from the Institutional Review Board (IRB) of the All India Institute of Medical Sciences with reference number IEC-247/04.05.2018. This data was de-identified, informed consent was obtained for use of data from all patients participating in the study. All experimental protocols were approved by the IRB of the All India Institute of Medical Sciences, New Delhi and all methods were carried out in accordance with prevailing guidelines and regulations. Bounding box annotations were performed by 3 breast radiologists with 2, 8 and 15 years of experience in breast imaging. All images were of size 3328 × 4096 pixels.

Model architecture

Pre-processing

All input images were of size 3328 × 4096. These were initially cropped to remove the portion of the mammogram that had no breast in it. These images were thus of variable size (depending on the size of the breast) on one dimension and 4096 pixels on the other dimension. These were then passed forward in the network.

Network architecture

Our proposed architecture is shown in Fig. 2. The network involved the following steps (1) Generating multiple scales (2) Systematic crop of images (3) Passing through baseline architecture (4) Combination at test time.

Generation of multiple scales

The full resolution image is rescaled to give images at 3 scales- X, 0.5X and 0.25 X, where X is the original image.

Systematic crops

Crops of size 0.25 times the original image are taken from all 3 scales. These crops were 1024 pixels on one dimension, and variable size on the other dimension. These crops constitute the input to the network. The crops are systematically taken from the larger images from right to left and top to bottom ensuring that no part of the image is left out.

Baseline architecture

We chose YOLO v5 (You Only Look Once version 5)¹⁹ as our baseline architecture as it is fully convolutional and thus allows us to pass images of all input sizes. Our ablation study to choose the baseline network is presented in the results section. We used YOLO v5 with a CSPDarknet backbone, a PANet neck with upsampling and Concatenations from different layers and a final YOLO head Convolution. The CSPDarknet is used for feature extraction, PANet for feature fusion and the YOLO layer for computing class and objectness scores. Concatenation of feature vectors at varying layers within the back-bone architecture helps to leverage the varying contextual information available in different layers. Each layer therefore corresponds to increasingly larger receptive fields, which is instrumental in detecting masses of very small sizes.

Combination of output at test time

At test time, full images were given as input in 3 scales- X, 0.5 X and 0.25X. Predictions were generated for each scale separately. These predictions were finally combined using Weighted Box Fusion²⁰(WBF) as described by Wang et al. WBF uses the confidence scores from each of the models and then combines them to construct an "average” bounding box that captures the underlying ground truth boxes better than any of the individual predictions from the models. We experimented with simple Non-Maximal Suppression (NMS) and NMS with thresholds and found WBF to perform the best.

Implementation details

As in yolov5 default implementation, we used a Binary Cross-Entropy with Logits Loss for computation of object scores, and SGD as the Optimizer. We kept the batch size to 16, and the initial learning rate to be 0.01. All computations were carried out on High Performance Computing Cluster having 32 GB V100 GPUs.

Results and experiments

We evaluated the model using Free-Response Operator Characteristic (FROC) curves by plotting the sensitivity of the network against the false positive marks per image. A detection was considered a true detection if the center of the predicted box fell anywhere within the ground-truth box, as is the standard practice in mammography^21,22.

Selection of baseline architecture

For selection of baseline network, 4 object detection networks which were fully convolutional were trained on our training data and tested on the DM dataset. The results are given in Table 1.

Table 1 Selection of a fully convolutional network as baseline for our proposed network.

Full size table

Results on the DM, SM and small cancer dataset

Our FROC curves on the DM, SM and SC datasets are given in Fig. 3a–c respectively . The results of the baseline architecture is also plotted for comparison. Table 2 summarizes the performance of the network on the 3 datasets.

Table 2 Summary of performance of our proposed network on the 3 datasets.

Full size table

Ablation studies

The architecture we have described has been built on 3 basic principles: resolution, scale and context. In order to study the effect of each of these components, we performed a few ablation experiments on our Small Cancer dataset (Fig. 5).

First, to study the effect of resolution, we tested the small cancer dataset by upsampling the low resolution images instead of using higher resolutions for crops. Here the images were first downsampled to 0.25X, and then 0.5X and X were generated by upsampling the 0.25X image. Crops were then taken from the upsampled images. Figure 4a shows the FROC thus obtained and compares it with our proposed model. As seen here, the model performs much better when crops are taken from high resolution images, rather than simply from different scales, demonstrating the importance of resolution.

Second, in order to study the effect of scale, we train our baseline architecture with only full resolution images without re-scaling them (Fig. 4b). As seen in Fig. 4b, our proposed model performs better than the model trained and tested on only the high resolution image indicating the importance of a multi-scale approach.

Finally, in order to study the effect of context, we first negate the effect of resolution by using upsampled images (upsampled from 0.25X images) rather than original resolution images as input. Crops were now systematically taken from each scale. Thus here crops from 0.25X have maximum context and crops from X have least context, though all have the same resolution. The performance on 0.25X and X are thus studied in Fig. 4c. As seen here, 0.25 X performs much better than X, demonstrating the importance of image context in cancer detection.

In order to analyze the importance of each scale/ resolution in relation to size of image, we analysed the results of each individual scale prior to WBF on the SM dataset. We analysed masses exclusively caught only on that particular scale. These are summarized in Table 3.

Table 3 Analysis of detection performance on individual scales prior to WBF.

Full size table

Thus we see that each factor, resolution, scale as well as context had significant independent contribution towards achieving the best accuracy using our model.

Discussion

Context, scale and resolution have been known to play a role in small object detection in natural images. In this work we explore the role of each of these factors towards small cancer detection specific to mammography, which presents a unique problem due to the presence of very small objects (cancers) in very large images. We showed through experiments how each of these factors have an important role to play in detection of small cancers.

We performed a single institution, multi-centre study to validate the results of our proposed network. We validated our results on a dataset with a distribution of diagnostic mammography practice as well as a screening mammography dataset (external validation). We show that our network performs comparably in both settings. In both datasets we showed an improvement in comparison to our baseline architecture. In addition, we tested our results on a curated dataset with cancers < 1 cm in size. We show that the improvement is most marked in this dataset with our approach, suggesting that this approach is particularly suited for detection of small cancers. Some examples from this dataset which were captured by our network are given in Fig. 5. We also note that despite being an external dataset, we perform better in our SM dataset than our DM dataset. This is due to a large number of benign masses in our DM dataset which were detected as cancers, contributing to false positives in this dataset.

We analyze the importance of resolution, context and scale by comparing our performance of each factor in isolation with the performance of our proposed network. Through our ablation studies (Fig. 4) we demonstrate that each of these factors have an important role to play. Our analysis of performance of each individual scale (Table 3) also showed that though the sensitivity was the lowest at full resolution, this scale was particularly important for detection of small masses.

Our study has some limitations. We acknowledge that all mammograms came from a single vendor, thus the effectiveness of the network in other situations need further study. Analysis of masses that were missed by this network revealed that we missed isodense, obscure masses, and masses placed in the peripheral breast tissue. There were only 3 images with microcalcification clusters in the DM dataset and 9 images in the SM dataset. Therefore the performance of the network on microcalcifications has not been adequately studied. Analysis of the false positives revealed that some benign lesions, such as cysts and fibroadenomas were also detected as cancers by the network. This indicates the direction of further research in this area.

To conclude, in this work we present a concerted effort towards the detection of small cancers. In a multicentre study we show the effectiveness of our approach in the setting of diagnostic mammography, screening mammography, as well as in a subset of small cancers < 1 cms in size.

Data availability

All our datasets were from the All India Institute of Medical Sciences, New Delhi, and are not available publicly.

References

Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA. Cancer J. Clin. 68, 394–424 (2018).
Article Google Scholar
Cancer Facts and Statistics 2015 | Research | American Cancer Society. https://www.cancer.org/research/cancer-facts-statistics/all-cancer-facts-figures/cancer-facts-figures-2015.html.
Seely, J. M. & Alhassan, T. Screening for breast cancer in 2018-what should we be doing today?. Curr. Oncol. Tor. Ont 25, S115–S124 (2018).
Article CAS Google Scholar
Zheng, Y.-Z., Wang, L., Hu, X. & Shao, Z.-M. Effect of tumor size on breast cancer-specific survival stratified by joint hormone receptor status in a SEER population-based study. Oncotarget 6, 22985–22995 (2015).
Article Google Scholar
Cheng, H. D. et al. Approaches for automated detection and classification of masses in mammograms. Pattern Recognit. 39, 646–668 (2006).
Article ADS Google Scholar
Grabler, P., Sighoko, D., Wang, L., Allgood, K. & Ansell, D. Recall and cancer detection rates for screening mammography: Finding the sweet spot. AJR Am. J. Roentgenol. 208, 208–213 (2017).
Article Google Scholar
Hepsağ, P. U., Özel, S. A. & Yazıcı, A. Using deep learning for mammography classification. In 2017 International Conference on Computer Science and Engineering (UBMK) 418–423. https://doi.org/10.1109/UBMK.2017.8093429 (2017).
Zhu, W., Lou, Q., Vang, Y. S. & Xie, X. Deep multi-instance networks with sparse label assignment for whole mammogram classification. ArXiv170508550 Cs (2017).
Ribli, D., Horváth, A., Unger, Z., Pollner, P. & Csabai, I. Detecting and classifying lesions in mammograms with Deep Learning. Sci. Rep. 8, 4165 (2018).
Article ADS Google Scholar
Detection of Masses in Mammograms Using a One-Stage Object Detector Based on a Deep Convolutional Neural Network. https://journals.plos.org/plosone/article?id=https://doi.org/10.1371/journal.pone.0203355.
Shen, L. et al. Deep Learning to Improve Breast Cancer Detection on Screening Mammography. Sci. Rep. 9, 12495 (2019).
Article ADS Google Scholar
Freeman, K. et al. Use of artificial intelligence for image analysis in breast cancer screening programmes: Systematic review of test accuracy. BMJ 374, n1872 (2021).
Article Google Scholar
Agarwal, R., Diaz, O., Lladó, X., Yap, M. H. & Martí, R. Automatic mass detection in mammograms using deep convolutional neural networks. J. Med. Imaging 6, 031409 (2019).
Article Google Scholar
Savelli, B., Bria, A., Molinara, M., Marrocco, C. & Tortorella, F. A multi-context CNN ensemble for small lesion detection. Artif. Intell. Med. 103, 101749 (2020).
Article CAS Google Scholar
Lotter, W., Sorensen, G. & Cox, D. A multi-scale cnn and curriculum learning strategy for mammogram classification. ArXiv170706978 Cs (2017).
McNitt-Gray, M. F. AAPM/RSNA physics tutorial for residents: Topics in CT. Radiation dose in CT. Radiographics 22, 1541–1553 (2002).
Article Google Scholar
Maidment, A. D. A. Digital mammography. Semin. Roentgenol. 38, 216–230 (2003).
Article Google Scholar
Hu, P. & Ramanan, D. Finding tiny faces. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 1522–1530. https://doi.org/10.1109/CVPR.2017.166 (2017).
YOLOv5 Documentation. https://docs.ultralytics.com/.
Solovyev, R., Wang, W. & Gabruseva, T. Weighted boxes fusion: Ensembling boxes from different object detection models. Image Vis. Comput. 107, 104117 (2021).
Article Google Scholar
Bunch, P. C., Hamilton, J. F., Sanderson, G. K. & Simmons, A. H. A free response approach to the measurement and characterization of radiographic observer performance. In Application of Optical Instrumentation in Medicine VI Vol. 0127 124–135 (International Society for Optics and Photonics, 1977).
Chapter Google Scholar
Ellis, R. L., Meade, A. A., Mathiason, M. A., Willison, K. M. & Logan-Young, W. Evaluation of computer-aided detection systems in the detection of small invasive breast carcinoma. Radiology 245, 88–94 (2007).
Article Google Scholar

Download references

Acknowledgements

This work was supported in part by the Department of Biotechnology, Government of India, under Grant BT/PR33193/AI/133/5/2019. We thank support of Google in the form of google cloud credits for this work. We acknowledge the contribution of Gaurav Jain, Shivang Chopra, Gaurav Munjal and Rohan Dhanakshirur for some initial research that culminated in this work. Rangarajan K and Gupta Aman contributed equally towards this paper. Use of all non-public data was started after obtaining ethical clearance from IRB, reference number IEC-247/04.05.2018.

Author information

Krithika Rangarajan
Present address: Department of Radiology, All India Institute of Medical Sciences, New Delhi, India
Subhashis Banerjee
Present address: Department of Computer Science, Ashoka University, Sonepat, Harayana, India

Authors and Affiliations

School of Information Technology, Indian Institute of Technology, Delhi, India
Krithika Rangarajan
Department of Radiology, All India Institute of Medical Sciences, New Delhi, India
Uday Marri, Arun Kumar Gupta & Smriti Hari
Department of Computer Science and Engineering, Indian Institute of Technology, Delhi, India
Aman Gupta, Saptarshi Dasgupta, Subhashis Banerjee & Chetan Arora

Authors

Krithika Rangarajan
View author publications
You can also search for this author in PubMed Google Scholar
Aman Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Saptarshi Dasgupta
View author publications
You can also search for this author in PubMed Google Scholar
Uday Marri
View author publications
You can also search for this author in PubMed Google Scholar
Arun Kumar Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Smriti Hari
View author publications
You can also search for this author in PubMed Google Scholar
Subhashis Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
Chetan Arora
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.R., U.M. and S.H. were involved in data collection and curation. K.R., S.H. and A.K.G. were involved in giving critical inputs to network design from a radiological perspective. A.G., S.D. were involved in code generation and testing. K.R., S.B. and C.A. were involved in network design and execution. K.R., C.A. and S.B. oversaw the entire project, and were involved in paper writing.

Corresponding author

Correspondence to Krithika Rangarajan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rangarajan, K., Gupta, A., Dasgupta, S. et al. Ultra-high resolution, multi-scale, context-aware approach for detection of small cancers on mammography. Sci Rep 12, 11622 (2022). https://doi.org/10.1038/s41598-022-15259-7

Download citation

Received: 31 May 2021
Accepted: 21 June 2022
Published: 08 July 2022
DOI: https://doi.org/10.1038/s41598-022-15259-7

This article is cited by

Emerging Clinical Applications for Cone Beam Breast CT: Changing the Breast Imaging Paradigm
- Kristina Siddall
- Xiaohua Zhang
- Avice O’Connell
Current Breast Cancer Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Deep Learning to Improve Breast Cancer Detection on Screening Mammography

Attention-based deep learning for breast lesions classification on contrast enhanced spectral mammography: a multicentre study

A convolutional deep learning model for improving mammographic breast-microcalcification diagnosis

Introduction

Importance of spatial resolution

Importance of scale

Importance of image-context

Approach to detection of small cancers

Materials and methods

Data

Training dataset

Test datasets

Diagnostic mammography dataset

Screening mammography dataset

Small cancer dataset

Model architecture

Pre-processing

Network architecture

Generation of multiple scales

Systematic crops

Baseline architecture

Combination of output at test time

Implementation details

Results and experiments

Selection of baseline architecture

Results on the DM, SM and small cancer dataset

Ablation studies

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Emerging Clinical Applications for Cone Beam Breast CT: Changing the Breast Imaging Paradigm

Comments

Search

Quick links