Spectral Information Divergence (SID) hyperspectral data classification¶

In this notebook, you will train and apply a Spectral information divergence classifier on hyperspectral data from Bílá Louka, Krkonoše mountains, Czechia. Please start by reading about the dataset and area of interest here.

Prerequisities (This notebook can be run either online using Google Colab or on your local machine)
- A Google account for accessing Google Colab (link to notebook). If running the exercise in Google Colab, please copy the notebook to your own Google Drive and follow the exercise, you don't need to download the dataset.
or
- A Python environment with the necessary libraries (manual). Download this notebook (click the download button in the top right corner of the page) and follow the exercise.
- Downloaded data (module4.zip/theme4_exercise_machine_learning) The dataset consists of:
  - Hyperspectral RPAS imagery of Bílá Louka, Czechia (50.728N, 15.682E) acquired in August of 2020 and resampled to 54 spectral bands with ground sampling distance of 9 cm: BL_202008_imagery.tif
  - a raster with reference data: BL_202008_reference.tif
  - Pretrained models and corresponding classified rasters: /sample_results/*
Tasks
- Preprocess hyperspectral imagery
- Classify the hyperspectral image using SID
- Observe how changing the training set size changes the resulting classification
- Evaluate your results and compare to our pretrained classifier
- Optional: Classify a urban scene

Structure of this exercise¶

What are you going to encounter during this exercise.

Load libraries, set paths
Load and Preprocess training data
Extract mean spectral curves from training data
Apply Classifier
Evaluate Result
Sample Solution

0. Load external libraries and set paths¶

First, we need to import external libraries:

numpy - Arrays to hold our data
matplotlib.pyplot - Draw images
pysptools.classification - Define and use a SID classifier
sklearn.metrics - Compute accuracy metrics using scikit-learn
sklearn.preprocessing - Normalizing input data using scikit-learn
time.perf_counter - Track how long individual functions take to run
os.path - Path manipulation
tqdm - show progress bars during training
joblib - Saving and loading trained classifiers
etrainee_m4_utils.image_preprocessing - Our library holding functions for image tiling, preprocessing, etc.
etrainee_m4_utils.inference_utils - Our library for correctly exporting classifed images
etrainee_m4_utils.visualisation_utils - Our library for visualising the data

Two external libraries are not imported directly in this notebook, but are used by functions in image_preprocessing and inference_utils:

gdal - Manipulates spatial data
scipy.io - Reads .mat files

In [ ]:

Copied!





import numpy as np
import matplotlib.pyplot as plt

from time import perf_counter
from os.path import join
import scipy

from pysptools.classification import SID

from sklearn.model_selection import GridSearchCV, RandomizedSearchCV
from sklearn.model_selection import KFold, StratifiedKFold, train_test_split
from sklearn.preprocessing import MinMaxScaler
from sklearn.metrics import classification_report

from tqdm import notebook as tqdm
from joblib import dump, load

from etrainee_m4_utils import image_preprocessing
from etrainee_m4_utils import inference_utils
from etrainee_m4_utils import visualisation_utils

# GLOBAL SETTINGS
plt.rcParams['figure.figsize'] = [5, 5]
np.set_printoptions(precision=2, suppress=True)  # Array print precision
import numpy as np
import matplotlib.pyplot as plt

from time import perf_counter
from os.path import join
import scipy

from pysptools.classification import SID

from sklearn.model_selection import GridSearchCV, RandomizedSearchCV
from sklearn.model_selection import KFold, StratifiedKFold, train_test_split
from sklearn.preprocessing import MinMaxScaler
from sklearn.metrics import classification_report

from tqdm import notebook as tqdm
from joblib import dump, load

from etrainee_m4_utils import image_preprocessing
from etrainee_m4_utils import inference_utils
from etrainee_m4_utils import visualisation_utils

# GLOBAL SETTINGS
plt.rcParams['figure.figsize'] = [5, 5]
np.set_printoptions(precision=2, suppress=True)  # Array print precision

In [ ]:

Copied!





# Set dataset name (used by visualisation functions) - 'bila_louka' or 'pavia_centre'
# default: 'pavia_centre'
ds_name = 'bila_louka'

# Get a list of class names
_, class_names = visualisation_utils._create_colorlist_classnames(ds_name=ds_name)
# Set dataset name (used by visualisation functions) - 'bila_louka' or 'pavia_centre'
# default: 'pavia_centre'
ds_name = 'bila_louka'

# Get a list of class names
_, class_names = visualisation_utils._create_colorlist_classnames(ds_name=ds_name)

Please fill correct paths to your training and reference rasters (just pointing the root_path variable to the project folder should do):

In [ ]:

Copied!





root_path = 'C:/folder/where/this/project/is/saved'

# PATHS TO TRAINING DATA
# Bila Louka
imagery_path   = join(root_path, 'BL_202008_imagery.tif')
reference_path = join(root_path, 'BL_202008_reference.tif')
# Pavia
#imagery_path   = join(root_path, 'Pavia.mat')
#reference_path = join(root_path, 'Pavia_gt.mat')

# PATH TO SAVE MODELS
model_save_folder = join(root_path, 'models')

# PATH TO SAVE CLASSIFIED IMAGE
out_path  = join(root_path, 'results/Krkonose_SID.tif')

# PATH TO THE SAMPLE RESULTS
sample_result_path  = join(root_path, 'sample_results/SID_sample_result.tif')
root_path = 'C:/folder/where/this/project/is/saved'

# PATHS TO TRAINING DATA
# Bila Louka
imagery_path   = join(root_path, 'BL_202008_imagery.tif')
reference_path = join(root_path, 'BL_202008_reference.tif')
# Pavia
#imagery_path   = join(root_path, 'Pavia.mat')
#reference_path = join(root_path, 'Pavia_gt.mat')

# PATH TO SAVE MODELS
model_save_folder = join(root_path, 'models')

# PATH TO SAVE CLASSIFIED IMAGE
out_path  = join(root_path, 'results/Krkonose_SID.tif')

# PATH TO THE SAMPLE RESULTS
sample_result_path  = join(root_path, 'sample_results/SID_sample_result.tif')

1. Load and preprocess training data¶

1.1. Data loading into NumPy¶

Let's start by reading an image into a numpy array, we do this in the background using GDAL.

The result of our function is a dictionary named loaded_raster, which contains two numpy arrays under keys imagery and reference. As we can see, the loaded hyperspectral dataset has 1088 by 1088 pixels with 54 spectral bands. The raster containing our reference data has the same dimensions in height and width.

For loading most raster datasets, we created a read_gdal() function in the image_preprocessing module. But loading .mat files for the Pavia City Centre requires a specific function (read_pavia_centre()). Both read_pavia_centre() and read_gdal() return a dictionary containing two numpy arrays with keys imagery and reference.

If using the Pavia City Centre dataset, you may notice that the original image has a shape of (1096, 1096, 102), but to make the data easier to tile for neural networks, we crop the image to (1088, 1088, 102) here.

In [ ]:

Copied!





loaded_raster = image_preprocessing.read_gdal(imagery_path, reference_path)
#loaded_raster = image_preprocessing.read_pavia_centre(imagery_path,
    # reference_path, out_shape=(1088, 1088, 102))

print(f'Tiled imagery shape {loaded_raster["imagery"].shape}')
print(f'Tiled reference shape {loaded_raster["reference"].shape}')
loaded_raster = image_preprocessing.read_gdal(imagery_path, reference_path)
#loaded_raster = image_preprocessing.read_pavia_centre(imagery_path,
    # reference_path, out_shape=(1088, 1088, 102))

print(f'Tiled imagery shape {loaded_raster["imagery"].shape}')
print(f'Tiled reference shape {loaded_raster["reference"].shape}')

In [5]:

Copied!

visualisation_utils.show_img_ref(loaded_raster["imagery"][:, :, [25, 15, 5]],
                                 loaded_raster["reference"], ds_name=ds_name)
visualisation_utils.show_img_ref(loaded_raster["imagery"][:, :, [25, 15, 5]],
                                 loaded_raster["reference"], ds_name=ds_name)

Clipping input data to the valid range for imshow with RGB data ([0..1] for floats or [0..255] for integers).

No description has been provided for this image

1.2. Flatten array¶

We will be classifying individual pixels, therefore we can transform the 3D image (height, width, spectral bands) to a 2D array (length, spectral bands). This transformation destroys spatial relationships within the image, however the classifiers can only use 1D features anyway and it simplifies the next step (filtering NoData).

In [ ]:

Copied!





orig_shape = loaded_raster['imagery'].shape

flat_arrs = {}
flat_arrs['imagery'] = loaded_raster['imagery'].reshape(
    orig_shape[0]*orig_shape[1], orig_shape[2])
flat_arrs['reference'] = loaded_raster['reference'].reshape(
    orig_shape[0]*orig_shape[1])

print(f'The flat imagery array has shape {flat_arrs["imagery"].shape}')
orig_shape = loaded_raster['imagery'].shape

flat_arrs = {}
flat_arrs['imagery'] = loaded_raster['imagery'].reshape(
    orig_shape[0]*orig_shape[1], orig_shape[2])
flat_arrs['reference'] = loaded_raster['reference'].reshape(
    orig_shape[0]*orig_shape[1])

print(f'The flat imagery array has shape {flat_arrs["imagery"].shape}')

1.3. Filter out NoData pixels¶

We can only train the classifier on pixels with a reference value, therefore we remove all pixels belonging to class 0 (NoData). This operation reduces our training dataset from ~1.18 milion to ~130 thousand pixels. We then visualise the spectral curves of individual pixels.

In [ ]:

Copied!

filtered_arrs = {}
filtered_arrs['imagery'] = flat_arrs['imagery'][flat_arrs['reference'] > 0]
filtered_arrs['reference'] = flat_arrs['reference'][flat_arrs['reference'] > 0]

print(f'The filtered array has shape {filtered_arrs["imagery"].shape}')
filtered_arrs = {}
filtered_arrs['imagery'] = flat_arrs['imagery'][flat_arrs['reference'] > 0]
filtered_arrs['reference'] = flat_arrs['reference'][flat_arrs['reference'] > 0]

print(f'The filtered array has shape {filtered_arrs["imagery"].shape}')

In [ ]:

Copied!

pixel_number = 1000
visualisation_utils.show_spectral_curve(filtered_arrs, pixel_number,
                                        ds_name=ds_name)
pixel_number = 1000
visualisation_utils.show_spectral_curve(filtered_arrs, pixel_number,
                                        ds_name=ds_name)

1.4. Splitting data for training/testing¶

Our reference dataset has to be split into three groups:

training / validation / test

In this step we divide the data into train+val and test groups. The variable train_fraction is used to establish how big of a fraction of the reference dataset is used for training and validation. By default, a third of the referece data is useed for training and validation, while the other two thirds are used for testing model preformance. The splitting is done using a scikit learn function train_test_split, which performs a stratified sampling from all classes.

Please experiment with different fractions of training data.

In [ ]:

Copied!

train_fraction = 1/3

X_train, X_test, y_train, y_test = train_test_split(filtered_arrs['imagery'], filtered_arrs['reference'], train_size=train_fraction)

train = {'imagery': X_train, 'reference': y_train}
test  = {'imagery': X_test,  'reference': y_test}
train_fraction = 1/3

X_train, X_test, y_train, y_test = train_test_split(filtered_arrs['imagery'], filtered_arrs['reference'], train_size=train_fraction)

train = {'imagery': X_train, 'reference': y_train}
test  = {'imagery': X_test,  'reference': y_test}

In [ ]:

Copied!

# How many training samples do we have for each class?
unique, counts = np.unique(train['reference'], return_counts=True)
print(f'The individual classes contain {counts} training pixels.')
# How many training samples do we have for each class?
unique, counts = np.unique(train['reference'], return_counts=True)
print(f'The individual classes contain {counts} training pixels.')

2. Extract mean spectral curves from training data¶

The training data can then be used to extract mean spectral curves for each class, thus creating a spectral library. the spectral library is then used by the classifier.

In [ ]:

Copied!





num_classes = np.unique(train['reference']).shape[0]
num_bands   = train['imagery'].shape[1]
mean_bands = np.empty((num_classes, num_bands))

for class_val in np.unique(train['reference']):
    mean_bands[class_val-1, :] = np.mean(train['imagery'][train['reference'] == class_val], axis=0)

spectral_lib = {'imagery': mean_bands, 'reference': np.unique(train['reference'])}
num_classes = np.unique(train['reference']).shape[0]
num_bands   = train['imagery'].shape[1]
mean_bands = np.empty((num_classes, num_bands))

for class_val in np.unique(train['reference']):
    mean_bands[class_val-1, :] = np.mean(train['imagery'][train['reference'] == class_val], axis=0)

spectral_lib = {'imagery': mean_bands, 'reference': np.unique(train['reference'])}

Let's visualise our spectral library

In [ ]:

Copied!

for cls in range(mean_bands.shape[0]):
    plt.plot(mean_bands[cls, :], label=class_names[cls+1])
plt.legend()
for cls in range(mean_bands.shape[0]):
    plt.plot(mean_bands[cls, :], label=class_names[cls+1])
plt.legend()

3. Model application & evaluation¶

3.1. Loading and preprocessing the data¶

Load a raster to classify. This can be the one that we used for training, but it can also be a different raster with the same number of bands.

By default, the training raster (imagery_path) is used.

In [ ]:

Copied!





# Load raster
raster = image_preprocessing.read_gdal_with_geoinfo(imagery_path, (0,0))

# Flattern spatial dimension of the raster
raster_shape = raster['imagery'].shape
raster_flat = raster['imagery'].reshape(raster_shape[0]*raster_shape[1],
                                        raster_shape[2])
# Load raster
raster = image_preprocessing.read_gdal_with_geoinfo(imagery_path, (0,0))

# Flattern spatial dimension of the raster
raster_shape = raster['imagery'].shape
raster_flat = raster['imagery'].reshape(raster_shape[0]*raster_shape[1],
                                        raster_shape[2])

3.2. Defining and applying the classifier¶

The following snippet applies the classifier to the loaded imagery, and then transforms the flattened array back into a raster.

In [ ]:

Copied!

sid = SID()
predicted_flat = sid.classify(raster_flat[:, None], mean_bands, threshold=0.9)[:, 0]
sid = SID()
predicted_flat = sid.classify(raster_flat[:, None], mean_bands, threshold=0.9)[:, 0]

In [ ]:

Copied!

predicted_raster = predicted_flat.reshape(raster_shape[0], raster_shape[1])
predicted_raster = predicted_flat.reshape(raster_shape[0], raster_shape[1])

You can also visualise the result:

In [ ]:

Copied!

visualisation_utils.show_classified(loaded_raster['imagery'][:, :, [25, 15, 5]],
                                    loaded_raster['reference'],
                                    predicted_raster, ds_name=ds_name)
visualisation_utils.show_classified(loaded_raster['imagery'][:, :, [25, 15, 5]],
                                    loaded_raster['reference'],
                                    predicted_raster, ds_name=ds_name)

3.3. Save/load trained SID for potential future use¶

In [ ]:

Copied!

# save using joblib.dump(object, filename)
model_path = join(model_save_folder, 'SID.joblib')
dump(sid, model_path)
# save using joblib.dump(object, filename)
model_path = join(model_save_folder, 'SID.joblib')
dump(sid, model_path)

In [ ]:

Copied!

# load using joblib.load(filename)
sid = load(model_path)
# load using joblib.load(filename)
sid = load(model_path)

3.4. Export resulting raster¶

Export the resulting classified raster into out_path for distribution or further analysis (e.g. validation in GIS).

In [ ]:

Copied!

inference_utils.export_result(out_path, predicted_raster, raster['geoinfo'])
inference_utils.export_result(out_path, predicted_raster, raster['geoinfo'])

4. Evaluate Classification Result¶

4.1. Apply classifier to the test dastaset¶

In [ ]:

Copied!

sid = SID()
classified_test = sid.classify(test['imagery'][:, None], mean_bands, threshold=0.9)[:, 0]
sid = SID()
classified_test = sid.classify(test['imagery'][:, None], mean_bands, threshold=0.9)[:, 0]

4.2. Compute accuracy metrics¶

In [ ]:

Copied!

print(classification_report(test['reference'], classified_test,
      target_names=class_names[1:]))
print(classification_report(test['reference'], classified_test,
      target_names=class_names[1:]))

4.3. Show Confusion Matrix¶

In [ ]:

Copied!

visualisation_utils.show_confusion_matrix(test['reference'], classified_test,
                                          ds_name=ds_name)
visualisation_utils.show_confusion_matrix(test['reference'], classified_test,
                                          ds_name=ds_name)

5. Sample Solution¶

Our classification has generated this result (please note that you may get a different result):

In [ ]:

Copied!





# Read test reference
test_arr = image_preprocessing.read_gdal(imagery_path, test_path)
test_flat = test_arr['reference'].reshape(
    test_arr['reference'].shape[0]*test_arr['reference'].shape[1])
test_filtered = test_flat[test_flat > 0]

# Read sample result
sample_arr = image_preprocessing.read_gdal(imagery_path, sample_result_path)
sample_flat = sample_arr['reference'].reshape(
    sample_arr['reference'].shape[0] * sample_arr['reference'].shape[1])
sample_filtered = sample_flat[test_flat > 0]
# Read test reference
test_arr = image_preprocessing.read_gdal(imagery_path, test_path)
test_flat = test_arr['reference'].reshape(
    test_arr['reference'].shape[0]*test_arr['reference'].shape[1])
test_filtered = test_flat[test_flat > 0]

# Read sample result
sample_arr = image_preprocessing.read_gdal(imagery_path, sample_result_path)
sample_flat = sample_arr['reference'].reshape(
    sample_arr['reference'].shape[0] * sample_arr['reference'].shape[1])
sample_filtered = sample_flat[test_flat > 0]

In [ ]:

Copied!





# Visualise the sample result
visualisation_utils.show_classified(loaded_raster['imagery'][:, :, [25, 15, 5]],
                                    loaded_raster['reference'],
                                    sample_arr['reference'],
                                    ds_name=ds_name)
# Visualise the sample result
visualisation_utils.show_classified(loaded_raster['imagery'][:, :, [25, 15, 5]],
                                    loaded_raster['reference'],
                                    sample_arr['reference'],
                                    ds_name=ds_name)

In [ ]:

Copied!

# Print a classification report for the sample result
print(classification_report(test_filtered, sample_filtered,
      target_names=class_names[1:]))
# Print a classification report for the sample result
print(classification_report(test_filtered, sample_filtered,
      target_names=class_names[1:]))

In [ ]:

Copied!

# Show a Confusion matrix for the sample result
visualisation_utils.show_confusion_matrix(test_filtered, sample_filtered,
                                          ds_name=ds_name)
# Show a Confusion matrix for the sample result
visualisation_utils.show_confusion_matrix(test_filtered, sample_filtered,
                                          ds_name=ds_name)

Optional: Classify a urban scene¶

Try using this notebook to classify a urban scene (Pavia City Centre). Reflect on how the different landscape structure and clearer class definitions influence the classification result.

Pavia city centre is a common benchmark for hyperspectral data classification and can be obtained from http://www.ehu.eus/ccwintco/index.php/Hyperspectral_Remote_Sensing_Scenes#Pavia_Centre_and_University. You will need to change the paths to input data and use the read_pavia_centre method to load the matlab matrices into numpy.

Return to exercises

In [ ]: