#机器学习# 文章列表浅入浅出：PageRank算法使用 TextRank 算法为文本生成关键字和摘要基于物品的协同过滤如何使用MapReduce实现基于物品的协同过滤（1）如何使用MapReduce实现基于物品的协同过滤（2）浅入浅出：K近邻算法使用mahout下的朴素贝叶斯分类器对新闻分类使用Affinity Propagation进行聚类 K-medoids聚类矩阵分解在推荐系统中的应用：NMF和经典SVD实战使用特征递归消除筛选特征如何分配权重比较NMF、PCA和VQ 方差和协方差基于SVD的协同过滤逻辑斯谛回归代码实现隐语义模型和NMF（非负矩阵分解）使用PCA处理MNIST数据集使用GBDT选取特征基于贝叶斯的文本分类系统的数据库设计在hadoop1.2.1上安装mahout 0.9 Hadoop 2.4 实现Kmeans聚类算法在Iris数据集上对比PCA、LDA、NMF 基于贝叶斯的文本分类实战单层决策树 Logistic regression（逻辑斯蒂回归）基于用户的协同过滤词袋模型与文档-词矩阵如何实现拼音与汉字的互相转换梯度下降法如何判定相似度 MovieLens数据集介绍基于KNN的文本分类实战 Jasper文本分类系列博客阅读摘录使用 Mean Shift进行聚类朴素贝叶斯的三个常用模型：高斯、多项式、伯努利使用决策树处理iris数据集浅入浅出：从Kmeans到Kmeans++ 如何持久化scikit-learn中训练好的模型浅入浅出：DBSCAN聚类算法（1）浅入浅出：DBSCAN聚类算法（2） 2015阿里移动推荐算法比赛第一赛季总结爬山算法使用朴素贝叶斯分类器划分邮件层次聚类基于MapReduce的频繁项集挖掘搜狗实体关系提取比赛

K-medoids聚类

2014-09-07

k-medoids聚类也可以叫做K中心点聚类，属于划分算法，在维基百科上给出了很详细的解释和示例，见k-medoids。这个方法和K-means很像，但不是K-means的变种。像k-medians等聚类方法，可以看做K-means的变种。

相对于K-means而言，k-medoids的优点是聚类结果不易受离群点、异常值的影响，缺点是算法复杂度稍高。

medoids在谷歌翻译中，翻译为中心点。

维基百科给出的算法如下：

Initialize: randomly select (without replacement) k of the n data points as the medoids
Associate each data point to the closest medoid. ("closest" here is defined using any valid distance metric, most commonly Euclidean distance, Manhattan distance or Minkowski distance)
...

For each medoid m {
    For each non-medoid data point o {
        Swap m and o and compute the total cost of the configuration
    }
}

Select the configuration with the lowest cost.
Repeat steps 2 to 4 until there is no change in the medoid.

再添加个资料：Partitioning Around Medoids (PAM)。

( 本文完 )