1999.VLDB.Finding intensional knowledge of distance-based outliers

1999.VLDB.Finding intensional knowledge of distance-based outliers

paper

pdf

main idea

intensional knowledge:

a description or an explanation of why an identified outlier is exceptional.

two main issue:

what kinds of intensional knowledge to provide;

how to optimize the computation of such knowledge.

contribution

1、define two notions of outliers and the corresponding intensional knowledge: strongest outlier and weak outlier.

2、develop a naive and semi-naive algorithm for computing strongest outlier and weak outlier and the corresponding intensional knowledge.

3、effective sharing of IO and experiment.

method

other's citation

citation 1

authors identify outliers in subspaces of the features space using a distance-based anomaly detection method. This serves as explanation since the identified anomalies are outliers in the specific subspaces found, meaning that the features constituting the subspace are those that discriminate the most the instance. The authors introduce the notions of strongest, weak and trivial outliers. An outlier is non-trivial in a subspace A if it is not

an outlier in any subspace included in A. A strongest outlier is an outlier in a strongest outlying feature space (if no outlier exists in any subspace included in A, then A is a strongest feature space). A weak outlier is a non-trivial not strongest outlier. Algorithms are provided to identify (and thus explain) strong and weak outliers. This anomaly explanation method is model-specific because it is designed for distance-based methods. It is also local because it helps explaining one outlier at a time.[1]

citation 2

For example, Knorr and Ng [50] define the outlier categories C = {"trivial outlier," "weak outlier," "strongest outlier"} to help gain better insights about the nature of outliers. They define an anomalous data point o as the "strongest outlier" in a subspace A if it meets two criteria: (i) o is not an outlier in any subspace B ⊂ A, and (ii) no outlier exists in any subspace B ⊂ A.If o does not satisfy the criteria in (i), then it is a "weak outlier." If it does not fit the two criteria, then it is a "trivial outlier." The terms "trivial outlier," "weak outlier," and "strongest outlier" are used to separate noise from meaningful abnormal data [2].

Figure 3 shows an illustration of strongest, weak, and trivial outliers in the 3D space {A, B, C}. P1 and P5 are non-trivial outliers in the subspace AB because they are not outliers in subspace A or subspace B. They are also the strongest outliers in AB because there is no other anomalous point in the subspace A or B. P20 is a weak outlier in the subspace AC because there is another outlier point P11 in the subspace C. P11 is a trivial outlier in the subspace AC because it is also an outlier in the subspace C.[3]

definitions



experiment

reference

1\][2022.DKE.Anomaly explanation A review :3.1.1. Non-weighted feature importance](https://blog.csdn.net/shaoyue1234/article/details/142704911?fromshare=blogdetail&sharetype=blogdetail&sharerId=142704911&sharerefer=PC&sharesource=shaoyue1234&sharefrom=from_link) \[2\]2022.VLDB.A survey on outlier explanations:2.1.2 Categorical ranking of outliers \[3\]2022.VLDB.A survey on outlier explanations:5.2 Techniques to find categorical rankings of outliers

相关推荐
Rnan-prince6 天前
Node2Vec 从理论到工程:图嵌入驱动的文件系统异常检测实战
异常检测·图嵌入·node2vec
EDPJ10 天前
(2026|成电,超图,图文融合和对齐,高阶推理/将异常显式地推理为语义-结构一致性的违反)H2VLR:用于少样本异常检测的异构超图视觉语言推理
人工智能·计算机视觉·异常检测
Coovally AI模型快速验证16 天前
无人机拍叶片→AI找缺陷:CEA-DETR改进RT-DETR做风电叶片表面缺陷检测,mAP50达89.4%
人工智能·3d·视觉检测·无人机·异常检测·工业质检
quetalangtaosha17 天前
Anomaly Detection系列(CVPR2025 TAO论文解读)
人工智能·异常检测
Dfreedom.18 天前
异常检测算法详解:从“何为异常”到“如何发现”
人工智能·算法·机器学习·聚类·异常检测
quetalangtaosha23 天前
Anomaly Detection系列(CVPR2025 Odd-One-Out论文解读)
人工智能·计算机视觉·异常检测
quetalangtaosha24 天前
Anomaly Detection系列(CVPR2025 DeCo-Diff论文解读)
人工智能·计算机视觉·异常检测
Coovally AI模型快速验证1 个月前
YOLO训练可以偷懒?Anti-Forgetting Sampling跳过已学会的图片加速收敛
人工智能·yolo·视觉检测·异常检测·工业质检
这张生成的图像能检测吗1 个月前
(论文速读)基于混合学习的边缘计算物联网系统操作视觉质量检测
人工智能·深度学习·物联网·智能制造·异常检测
这张生成的图像能检测吗1 个月前
(论文速读)MoECLIP:零射异常检测补丁专家
人工智能·深度学习·计算机视觉·异常检测·clip·zero-shot方法