Introduction

The most common type of cancer in women is Breast cancer, and the most common form of breast cancer is invasive ductal carcinoma (IDC). Precise identification and categorization of breast cancer subtypes is an important clinical task, and the use of automated methods can save time and reduce error. Curretly, the procedure used to diagnose breast cancer is time intensive and can miss small malignant regions. In order to diagnose cancer, a portion of tissue is inserted on a glass slide. Afterwards a pathologist tests this slide under a microscope. To eventually identify malignant regions, the pathologist has to scan large regions where there is no cancer visually.

Since these glass slides can now be digitized, computer vision can be used to speed up the workflow of a pathologist and to provide help for diagnosis. In this tutorial, you will learn how to train a FAST.AI deep learning model to predict breast cancer in breast histology images.

Dataset

The dataset was originally curated by Janowczyk and Madabhushi and Roa et al. but is available in public domain on Kaggle’s website. It is consisted of 162 whole mount slide images of Breast Cancer (BCa) specimens scanned at 40x.

In general, slide images are huge (spatial dimensions). As a result, 277,524 patches of size 50 x 50 pixels were extracted, including:

78,786 IDC positive (i.e., indicating breast cancer was found in the patch)
198,738 IDC negative (i.e., no breast cancer)

Each patch’s file name is of the format: uxXyYclassC.png — > example 10253idx5x1351y1101class0.png . Where u is the patient ID (10253idx5), X is the x-coordinate of where this patch was cropped from, Y is the y-coordinate of where this patch was cropped from, and C indicates the class where 0 is non-IDC and 1 is IDC.

Inspiration

You can write regular markdown here and Jekyll will automatically convert it to a nice webpage. I strongly encourage you to take 5 minutes to learn how to write in markdown - it’ll teach you how to transform regular text into bold/italics/headings/tables/etc.

Here is some bold text

Here is a secondary heading

Here’s a useless table:

Number	Next number	Previous number
Five	Six	Four
Ten	Eleven	Nine
Seven	Eight	Six
Two	Three	One

How about a yummy crepe?

Crepe

It can also be centered!

Crepe

Here’s a code chunk:

var foo = function(x) {
  return(x + 5);
}
foo(3)

And here is the same code with syntax highlighting:

var foo = function(x) {
  return(x + 5);
}
foo(3)

And here is the same code yet again but with line numbers:

1
2
3
4
var foo = function(x) {
  return(x + 5);
}
foo(3)

Boxes

You can add notification, warning and error boxes like this:

Notification

Note: This is a notification box.

Warning

Warning: This is a warning box.

Error

Error: This is an error box.