The variables were adjusted around the mean, with 1 representing the mean value. This ensured that each variable was given equal weight in the clustering process. The description I gave to each cluster was based on the information in this table. For instance cluster 1 is high in all three variables so areas where violence is common, thus my description, cluster 4 seemed to me to be pubs and club venues from the very high value of the second variable.
I am not saying my descriptions are perfect, I am explaining the process. Of course the members of the clusters are varying distances from the cluster center (this information is also given) and therefore will not necessarily exactly match the mean characteristics of the cluster. It should also be borne in mind that it is the dominant relevant features of the Ward that are being described. It is highly unlikely that all parts of the Ward will comply with that description. We then get into a discussion about the scale of analysis which I have discussed before on this blog.
So what is the answer to the question? Classification is the next stage in the process after clusters have been created; classification involves the description of clusters based on the characteristics of the cluster members.
Pattern Classification
ReplyDeleteThe process of separating data into categories, or classes, characterized by a distinct set of features
It pertains to known number of groups and the objective is to assign new data points to one of these groups
Clustering:
The process of partitioning objects into groups called clusters whose members are similar in some way , so that the data points in a group are similar to each another, while those in distinct groups aren’t similar to those in the other groups