Unsupervised sentinel-2 imagery Classification

May 3, 2025 · 2 min read

Unsupervised sentinel-2 imagery Classification

Unsupervised Sentinel 2 imagery land cover classification using GEE and geemap package

Two clustering methods have been tested:

Method 1 : Xmeans

X-Means extends the K-Means clustering algorithm by efficiently estimating the optimal number of clusters within a specified range. The algorithm iteratively evaluates potential cluster splits using a Bayesian Information Criterion (BIC) to determine the most likely number of clusters. Users can customize parameters like the minimum and maximum number of clusters, iterations, distance function, and randomization seed for fine-grained control over the clustering process. Implemented within Earth Engine, X-Means offers a scalable solution for geospatial data analysis and pattern recognition tasks.

Method 2: LVQ

The algorithm learns by adjusting cluster prototypes based on the input data during training epochs. LVQ learns a set of codebook vectors (prototypes) that represent clusters in the data. It assigns each data point to the cluster whose prototype is most similar. The prototypes are adjusted iteratively based on training data (even in unsupervised mode, it needs training samples to define the initial prototypes).

Discussions:

  • LVQ is potentially better at defining complex cluster shapes. Unlike X-means,LVQ isn’t strictly tied to spherical clusters. It can adapt to more irregular cluster boundaries depending on the distribution of the training data.

  • K-means-based algorithms are generally efficient for large datasets like satellite imagery and offers the ability to automatically estimates the number of clusters. A significant advantage over LVQ and standard K-means is its ability to determine a suitable number of clusters from the data, which can be very useful when the exact number of land cover types is unknown beforehand.

XMeans model

LVQ model