• United States
IBM®
               

BRIGHT Challenge
BReast tumor Image classification on Gigapixel HisTopathological images

A new approach to benchmark the acceleration of scientific discovery for cancer subtyping

  • About BRIGHT
  • The Challenge
  • Evaluation
  • Organizers

Important Dates

Release of data, code, and metrics for training Nov. 15, 2021
Release of examples for submission files Jan. 21, 2022
Release of data and metrics for testing Jan. 31, 2022
Challenge workshop website goes live Feb. 01, 2022
Submission deadline of results and papers Feb. 28, 2022
Response from reviewers Mar. 21, 2022
Registration deadline and BRIGHT Workshop Mar. 27, 2022
Camera Ready Version of the paper Apr. 03, 2022
Post-workshop leaderboard release Apr. 04, 2022
Submission deadline for manuscripts Apr. 21, 2022
Publication of challenge outcomes Oct. 01, 2022

Like the BRIGHT Challenge?

Try the KNIGHT challenge for kidney images.

Data

The participants will use the BReAst Carcinoma Subtyping (BRACS) dataset, a cohort of H&E-stained breast tissue biopsies. BRACS is an open source dataset but the BRIGHT challenge data organization, which is different to the public source data, is only available through registration for the Challenge. The slides pertained to breast tumor patients with one of the following six breast tumor subtypes, i.e., Pathological Benign (PB), Usual Ductal Hyperplasia (UDH), Flat Epithelia Atypia (FEA), Atypical Ductal Hyperplasia (ADH), Ductal Carcinoma in Situ (DCIS), and Invasive Carcinoma (IC) (examples in Figure 1). The slides were scanned with an Aperio AT2 scanner at 0:25 \mu m/pixel using a magnification factor of 40\times to result in WSIs.

The WSIs were assigned a subtype in a two-step procedure. First, inline with pathological diagnostic protocol, three board-certified pathologists independently annotated each WSIs according to the highest tumor lesion inside the WSIs. Then, the annotations with disagreement were discussed and re-annotated by the consensus of the three pathologists to reduce observer variability. A set of representative Regions of Interest (ROIs) per many WSIs were also annotated according to the same procedure. We have included the WSIs and the identified ROIs in the dataset.

The major differentiators of BRACS are, (i) the inclusion of pre-cancerous (ADH and FEA) subtypes, which pose a major diagnostic challenge in practice, and (ii) to the best of our knowledge, BRACS is the largest annotated and publicly available dataset for breast tumor subtyping with diverse WSIs and ROIs. Table 1 summarizes the distribution of the subtype-wise number of ROIs and WSIs in the training and validation sets of the BRACS dataset. We are currently in the process of preparing an external cohort of WSIs for the testing.

  Non-cancerous Pre-cancerous Cancerous Total
  PB UDH FEA ADH DCIS IC  
Train ROIs 757 435 673 428 705 568 3566
WSIs 131 65 30 36 49 112 423
Validation ROIs 79 82 83 79 85 81 489
WSIs 16 9 11 12 12 20 80
Test WSIs In progress 200

Table 1: The number of breast tumor subtype-wise ROIs and WSIs in BRACS dataset.

The Challenge

The presence of atypical lesions in a tissue specimen indicates a higher risk for attaining invasive carcinoma, but their identification via tissue examination requires a very high confidence of expert pathologists. For instance, ADH shares morphological similarities with DCIS, and in certain cases, ADH possesses all the features of DCIS and is only limited in size. Also, UDH, ADH and DCIS are all characterized by an intraductal growth pattern which makes these classes difficult to differentiate in H&E stained sections. While atypical lesions are very important, their presence cannot be detected by reviewing a mammogram or other breast imaging studies. They also cannot be felt on a clinical breast exam and if they are found in a core biopsy, more frequent imaging follow-up and often surgical excision are recommended [8]. Finally, including atypias in the profiling of the patients phenotypes paves the way for a more disease progression gnostic biomarker discovery. To that end, an AI solution that can distinguish atypias from other cancer subtypes in histology images is of great value for clinical practice.

To encourage the automated detection of breast tumor subtypes, we define two WSI classification tasks based on the BRACS dataset.

Task 1: A 3-class WSI classification task where we group the six original subtypes according to cancer risk, i.e., Non cancerous (PB+UDH), Pre-cancerous or Atypical (ADH+FEA), and Cancerous (DCIS+IC). Participants can train their algorithms on both annotated WSIs and ROIs. The testing will be done at WSI-level only.

Task 2: A 6-class WSI classification task where the participants have to perform finegrained tumor subtyping of WSIs. This task is more challenging as it includes more number of classes and higher inter-class ambiguities. Similar to Task 1, participants can train on the WSIs and ROIs, and testing will be done at WSI-level only.

Baseline

We developed Deep Learning (DL)-based multi-class classification baselines for the two aforementioned tasks on the BRACS dataset. For both the tasks, the classifications were performed at WSI-level without using the auxiliary annotated ROIs. The baseline CNN models were based on the network architecture presented in [4]. Each CNN model consisted of a compressing path and a learning path. In the compressing path, a WSI was encoded into a grid-based feature map by using a residual feature extractor. In the learning path, attention modules were employed to find ROIs by considering the spatial correlations of neighboring patch features. Following the stain normalization [9] of the WSIs at 10\times magnification, the images were augmented using affine transformations. For both tasks, the models were trained by using a batch size of 8, a dropout of 0:2, and an Adam optimizer with a learning rate of 10-5. The models were tested on the test set of the BRACS dataset, i.e., the Validation set in Table 1, and F1-scores were computed over four runs by using bootstrapped sampling. The results for Task 1 and 2 are reported in Table 2. The method details and code are available in [4].

  Noncancerous Precancerous Cancerous Avg. F1-score
F1-score 0.57\pm0.05 0.43\pm0.06 0.74\pm0.02 0.58\pm0.01
 
  PB UDH FEA ADH DCIS IC Avg. F1-score
F1-score 0.43\pm0.06 0.23\pm0.08 0.20\pm0.08 0.16\pm0.08 0.41\pm0.05 0.89\pm0.01 0.39\pm0.02

Table 2: Mean and standard deviation of F1-score for 3-class and 6-class WSI classification.

The BRACS dataset was used to evaluate Histocartography, an open source graph representation and learning library that can be leveraged by participants in the challenge. Visit the main repository and the Histocartography github organization for more details.

  • Registration
  • Submission (TBD)
  • Contact the Organizers
  • Leaderboard (TBD)

Footer links

  • Contact
  • Privacy
  • Terms of use
  • Accessibility