admin 管理员组

文章数量: 887021

cluster

无监督的cluster_acc计算
首先实现是分为两种形式，一种sklearn低版本的，一种是sklearn高版本的

实现1

from sklearn.utils.linear_assignment_ import linear_assignment
import sklearn
print(sklearn.__version__)
import numpy as npdef acc(ypred, y):"""Calculating the clustering accuracy. The predicted result must have the same number of clusters as the ground truth.ypred: 1-D numpy vector, predicted labelsy: 1-D numpy vector, ground truthThe problem of finding the best permutation to calculate the clustering accuracy is a linear assignment problem.This function construct a N-by-N cost matrix, then pass it to scipy.optimize.linear_sum_assignment to solve the assignment problem."""assert len(y) > 0assert len(np.unique(ypred)) == len(np.unique(y))s = np.unique(ypred)t = np.unique(y)N = len(np.unique(ypred))C = np.zeros((N, N), dtype = np.int32)for i in range(N):for j in range(N):idx = np.logical_and(ypred == s[i], y == t[j])C[i][j] = np.count_nonzero(idx)# convert the C matrix to the 'true' costCmax = np.amax(C)C = Cmax - C# indices = linear_assignment(C)row = indices[:][:, 0]col = indices[:][:, 1]# calculating the accuracy according to the optimal assignmentcount = 0for i in range(N):idx = np.logical_and(ypred == s[row[i]], y == t[col[i]] )count += np.count_nonzero(idx)return 1.0*count/len(y)if __name__ == '__main__':"""Using accuracy to evaluate clustering is usually not a good idea, the following example shows that even a completely wrong assignment yield accuracy of 0.5.Consider use more standard metrics, such as NMI or ARI."""              s = np.array([1, 2, 2 ,3, 1, 3,2,2,1,2,1,1,1])t = np.array([1, 1, 2, 2, 3, 3,1,1,2,2,2,2,2])   ac = acc(s, t)print(ac)

结果如下

实现2

from typing import List, Optional, Union
import numpy as np
import torch
from scipy.optimize import linear_sum_assignmentdef unsupervised_clustering_accuracy(y: Union[np.ndarray, torch.Tensor], y_pred: Union[np.ndarray, torch.Tensor]
) -> tuple:"""Unsupervised Clustering Accuracy"""assert len(y_pred) == len(y)u = np.unique(np.concatenate((y, y_pred)))n_clusters = len(u)mapping = dict(zip(u, range(n_clusters)))reward_matrix = np.zeros((n_clusters, n_clusters), dtype=np.int64)for y_pred_, y_ in zip(y_pred, y):if y_ in mapping:reward_matrix[mapping[y_pred_], mapping[y_]] += 1cost_matrix = reward_matrix.max() - reward_matrixrow_assign, col_assign = linear_sum_assignment(cost_matrix)# Construct optimal assignments matrixrow_assign = row_assign.reshape((-1, 1))  # (n,) to (n, 1) reshapecol_assign = col_assign.reshape((-1, 1))  # (n,) to (n, 1) reshapeassignments = np.concatenate((row_assign, col_assign), axis=1)optimal_reward = reward_matrix[row_assign, col_assign].sum() * 1.0return optimal_reward / y_pred.size, assignments

结果如下

第三种实现(purity score即acc)

import sklearn.metrics as metrics
from sklearn.metrics import adjusted_rand_score as ari, normalized_mutual_info_score as nmidef purity_score(y_true, y_pred):"""A function to compute cluster purity"""# compute contingency matrix (also called confusion matrix)contingency_matrix = metrics.cluster.contingency_matrix(y_true, y_pred)return np.sum(np.amax(contingency_matrix, axis=0)) / np.sum(contingency_matrix)

from typing import List, Optional, Union
import numpy as np
import torch
from scipy.optimize import linear_sum_assignmentdef unsupervised_clustering_accuracy(y: Union[np.ndarray, torch.Tensor], y_pred: Union[np.ndarray, torch.Tensor]
) -> tuple:"""Unsupervised Clustering Accuracy"""assert len(y_pred) == len(y)u = np.unique(np.concatenate((y, y_pred)))n_clusters = len(u)mapping = dict(zip(u, range(n_clusters)))reward_matrix = np.zeros((n_clusters, n_clusters), dtype=np.int64)for y_pred_, y_ in zip(y_pred, y):if y_ in mapping:reward_matrix[mapping[y_pred_], mapping[y_]] += 1cost_matrix = reward_matrix.max() - reward_matrixrow_assign, col_assign = linear_sum_assignment(cost_matrix)# Construct optimal assignments matrixrow_assign = row_assign.reshape((-1, 1))  # (n,) to (n, 1) reshapecol_assign = col_assign.reshape((-1, 1))  # (n,) to (n, 1) reshapeassignments = np.concatenate((row_assign, col_assign), axis=1)optimal_reward = reward_matrix[row_assign, col_assign].sum() * 1.0return optimal_reward / y_pred.size, assignments

测试1

y_true = np.array([1, 2, 2 ,3, 1, 3,2,2,1,2,1,1,1])
y_pred = np.array([1, 1, 2, 2, 3, 3,1,1,2,2,2,2,2])   ps=purity_score(y_true,y_pred)
uac = unsupervised_clustering_accuracy(y_true, y_pred)
print("purity socre={}".format(ps))
print("unsurpervised acc={}".format(uac[0]))

结果如下

测试2

本文标签： cluster

版权声明：本文标题：cluster 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.freenas.com.cn/jishu/1687332337h90531.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

技术交流 – FreeNAS中文网

cluster

cluster

实现1

实现2

第三种实现(purity score即acc)

测试1

测试2

更多相关文章

cluster

转 禁用HAIP,cluster

windows10下载安装启动nacos:“nacos is starting with cluster“

发表评论

推荐文章

7月8号

MySQL基础篇(DDL，DML，DQL，DCL详细讲解)

Windows7下配置JMeter安装环境

Windows7 搭建 PHP7 运行环境

macOS镜像下载(ISO、DMG)

热门文章

vue学习总结(三,案例用vue开发网易云音乐(已实现在线播放和下载))

老照片修复怎么操作？这个方法轻而易举

DB2 SQL 精萃

awk 精萃

源码搭建B2B2C体系电商多级分销商城，小程序、H5

狗日的系统之家下载的Windows 10 18031809系统不干净，捆绑自动安装腾讯关键等软件...

Windows7 默认网关自动清空

计算机安全更新无法卸载,出现windows 系统补丁无法卸载该怎么解决？简单几步即可解决...

重装win11系统后找不到WiFi

ASUS华硕魔霸7S枪神7笔记本Win11系统镜像包：重塑你的数字战场

最新文章

Raid技术

LSI_阵列卡操作手册

破解Centos7_root用户密码

Redhat重置Root用户密码方法

远程批量修改linux服务器密码的脚本

Windows7 系统安全设置权限技巧

（Windows系统）详细介绍Windows系统 含有英文版

最新Windows 11教育版下载：专为教育设计的系统！

Win7系统下搭建NFS服务器

零基础使用UltraISO制作并安装纯净Win10系统指南

转禁用HAIP,cluster

（Windows系统）详细介绍Windows系统含有英文版