NBA positions dataset clustered using k-means with a range of values of k
Format
nba_clusts
is a data.frame containing the NBA positions dataset
with additional columns holding k-means clusterings at different values of
k
and the first two principal components
Position - Player position
TurnoverPct - Turnover percentage
ReboundPct - Rebound percentage
AssistPct - Assist percentage
FieldGoalPct - Field goal percentage
K1 - K5 - Results of k-means clustering
PC1 - First principal component
PC2 - Second principal component
Source
NBA positions downloaded from https://github.com/lazappi/nba_positions.
The source dataset is available from Kaggle at https://www.kaggle.com/drgilermo/nba-players-stats/data?select=Seasons_Stats.csv and was originally scraped from Basketball Reference.
See https://github.com/lazappi/clustree/blob/master/data-raw/nba_clusts.R for details of how clustering was performed.