Display Name: Density based Clustering
Description: Finds clusters of point features within surrounding noise based on their spatial distribution. Time can also be incorporated to find space-time clusters.
Category:
Help URL: https://gisdev.wsscwater.com/server/rest/directories/arcgisoutput/ClusteringTool/DensityBasedClustering_GPServer/ClusteringTool_DensityBasedClustering/DensityBasedClustering.htm
Execution Type: esriExecutionTypeAsynchronous
Parameters:
Parameter: in_features
Data Type: GPString
Display Name Input Point Features
Description: The point features for which density-based clustering will be performed.
Direction: esriGPParameterDirectionInput
Filter:
Dependency:
Default Value: DensityBasedClustering\WQIMS.mmisWorkOrders
Parameter Type: esriGPParameterTypeRequired
Category:
Choice List: [ WQIMS_DensityBasedClustering602, WQIMS.mmisWorkOrders ]
Parameter: output_features
Data Type: GPFeatureRecordSetLayer
Display Name Output Features
Description: The output feature class that will receive the cluster results.
Direction: esriGPParameterDirectionOutput
Filter:
Dependency:
Default Value:
Geometry Type: esriGeometryPoint
HasZ: false
HasM: false
Spatial Reference: 102100
(3857)
Fields:
-
OBJECTID
(
type: esriFieldTypeOID, alias: OBJECTID
)
-
SOURCE_ID
(
type: esriFieldTypeInteger, alias: Source ID
)
-
CLUSTER_ID
(
type: esriFieldTypeInteger, alias: Cluster ID
)
-
COLOR_ID
(
type: esriFieldTypeInteger, alias: Color ID
)
-
RECEIVEDDATE
(
type: esriFieldTypeDate, alias: RECEIVEDDATE, length: 8
)
-
START_TIME
(
type: esriFieldTypeDate, alias: Start Time, length: 8
)
-
END_TIME
(
type: esriFieldTypeDate, alias: End Time, length: 8
)
-
MEAN_TIME
(
type: esriFieldTypeDate, alias: Mean Time, length: 8
)
-
TIME_EXAGG
(
type: esriFieldTypeDouble, alias: Time Exaggeration
)
Features: None.
Parameter Type: esriGPParameterTypeRequired
Category:
Parameter: cluster_method
Data Type: GPString
Display Name Clustering Method
Description: Specifies the method that will be used to define clusters.Defined distance (DBSCAN)— A specified distance will be used to separate dense clusters from sparser noise. DBSCAN is the fastest of the clustering methods but is only appropriate if there is a very clear distance to use that works well to define all clusters that may be present. This results in clusters that have similar densities.Self-adjusting (HDBSCAN)— Varying distances will be used to separate clusters of varying densities from sparser noise. HDBSCAN is the most data-driven of the clustering methods and requires the least user input.Multi-scale (OPTICS)—The distance between neighbors and a reachability plot will be used to separate clusters of varying densities from noise. OPTICS offers the most flexibility in fine-tuning the clusters that are detected, though it is computationally intensive, particularly with a large search distance.
Direction: esriGPParameterDirectionInput
Filter:
Dependency:
Default Value: DBSCAN
Parameter Type: esriGPParameterTypeRequired
Category:
Choice List: [ DBSCAN, HDBSCAN, OPTICS ]
Parameter: min_features_cluster
Data Type: GPLong
Display Name Minimum Features per Cluster
Description: The minimum number of points that will be considered a cluster. Any cluster with fewer points than the number provided will be considered noise.
Direction: esriGPParameterDirectionInput
Filter:
Type: Range
Minimum: 2.0
Maximum: 1.0E8
Dependency:
Default Value: 2
Parameter Type: esriGPParameterTypeRequired
Category:
Parameter: search_distance
Data Type: GPLinearUnit
Display Name Search Distance
Description: The maximum distance that will be considered.For the Clustering Method parameter's Defined distance (DBSCAN) option, the Minimum Features per Cluster parameter value must be found within this distance for cluster membership. Individual clusters will be separated by at least this distance. If a point is located farther than this distance from the next closest point in the cluster, it will not be included in the cluster.For the Clustering Method parameter's Multi-scale (OPTICS) option, this parameter is optional and is used as the maximum search distance when creating the reachability plot. For OPTICS, the reachability plot, combined with the Cluster Sensitivity parameter value, determines cluster membership. If no distance is specified, the tool will search all distances, which will increase processing time.If left blank, the default distance used will be the highest core distance found in the dataset, excluding those core distances in the top 1 percent (the most extreme core distances). If the Time Field parameter value is provided, a search distance must be provided and does not include a default value.
Direction: esriGPParameterDirectionInput
Filter:
Dependency:
Default Value: 1.0 (esriMiles)
Parameter Type: esriGPParameterTypeOptional
Category:
Parameter: cluster_sensitivity
Data Type: GPLong
Display Name Cluster Sensitivity
Description: An integer between 0 and 100 that determines the compactness of clusters. A number close to 100 will result in a higher number of dense clusters. A number close to 0 will result in fewer, less compact clusters. If left blank, the tool will find a sensitivity value using the Kullback-Leibler divergence that finds the value in which adding more clusters does not add additional information.
Direction: esriGPParameterDirectionInput
Filter:
Type: Range
Minimum: 0.0
Maximum: 100.0
Dependency:
Default Value:
Parameter Type: esriGPParameterTypeOptional
Category:
Parameter: time_field
Data Type: GPString
Display Name Time Field
Description: The field containing the time stamp for each record in the dataset. This field must be of type Date. If provided, the tool will find clusters of points that are close to each other in space and time. The Search Time Interval parameter value must be provided to determine whether a point is close enough in time to a cluster to be included in the cluster.
Direction: esriGPParameterDirectionInput
Filter:
Dependency: in_features
Default Value: RECEIVEDDATE
Parameter Type: esriGPParameterTypeOptional
Category:
Choice List: [ RECEIVEDDATE, COMPLETEDDATE ]
Parameter: search_time_interval
Data Type: GPString
Display Name Search Time Interval
Description: The time interval that will be used to determine whether points form a space-time cluster. The search time interval spans before and after the time of each point, so, for example, an interval of 3 days around a point will include all points starting 3 days before and ending 3 days after the time of the point.For the Clustering Method parameter's Defined distance (DBSCAN) option, the Minimum Features per Cluster parameter value must be found within the search distance and the search time interval to be included in a cluster.For the Clustering Method parameter's Multi-scale (OPTICS) option, all points outside of the search time interval will be excluded when calculating core distances, neighbor-distances, and reachability distances.The search time interval does not control the overall time span of the resulting space-time clusters. The time span of points within a cluster can be larger than the search time interval as long as each point has neighbors within the cluster that are within the search time interval.
Direction: esriGPParameterDirectionInput
Filter:
Dependency:
Default Value: 60 Minutes
Parameter Type: esriGPParameterTypeOptional
Category:
Parameter: esri_out_feature_service_name
Data Type: GPString
Display Name Output Feature Service Name
Description: The name of the optional feature service to create on the federated server containing the result of this tool. If no name is specified an output feature service will not be created.
Direction: esriGPParameterDirectionInput
Filter:
Dependency:
Default Value:
Parameter Type: esriGPParameterTypeOptional
Category:
Supported Operations:
Submit Job
Child Resources:
Metadata