Group Buildings into Settlements

Given building locations, cluster the buildings into settlements. The algorithm makes the following assumptions:

  • Household count is proportional to building count.
  • The area of a settlement is constrained by the maximum distance that a person is willing to travel from their household to the marketplace.
  • The marketplace is located near the centroid of each settlement.
  • The number of people in each household is specified in an optional Population column. If the population of each household is not specified, use the Average Population per Household to estimate the population of each settlement.

Tool Arguments

  • Building Locations ? Specify building locations and optionally specify the number of people in each building by specifying a column called Population
  • Maximum Household Distance from Marketplace in Kilometers ? Maximum distance that a person is willing to travel from their household to the marketplace (e.g. 8 km by foot and 16 km by horse)
  • Minimum Population per Settlement ? Minimum number of people per settlement
  • Average Population per Household ? Average number of people per household (used if population is undefined for a household)

Result Properties

  • Settlement Buildings in CSV format (longitude, latitude, population, settlement_id, marketplace_distance_in_km) as buildings.csv
  • Settlements in CSV format (name=settlement_id, longitude, latitude, population) as settlements.csv
  • Settlements Snapshot in PNG format
  • Settlement Count
  • Silhouette Score
  • Histogram of Maximum Household Distance from Marketplace (x-axis = Maximum Household Distance from Marketplace, y-axis = Settlement Count)

Algorithm Requirements

Functionality

  • [Done] Handle case when Population is undefined.
  • [Done] Handle case when Population is lowercase.
  • [Done] Satisfy Minimum Population per Settlement Constraint.
  • [Done] Satisfy Maximum Household Distance from Marketplace in Kilometers constraint.
  • [Done] Combine clusters, centroids, metrics into a single tool.
  • [Done] Remove axis labels and box frame from cluster plot.
  • [Done] Output the following columns for settlements.csv: name, longitude, latitude, population

Configuration

  • [Done] Use uganda-ruhiira-200306.shp.zip as the default for Building Locations.
  • [Done] Use 100 as Minimum Population per Settlement.
  • [Done] Use 5.3 as default for Average Population per Household.
  • [Done] Use 16 as default for Maximum Household Distance from Marketplace in Kilometers.