@Reference(authors="C.C. Aggarwal, P. S. Yu", title="Outlier detection for high dimensional data", booktitle="Proc. ACM SIGMOD Int. Conf. on Management of Data (SIGMOD 2001), Santa Barbara, CA, 2001", url="http://dx.doi.org/10.1145/375663.375668") public abstract class AbstractAggarwalYuOutlier<V extends NumberVector<?,?>> extends AbstractAlgorithm<OutlierResult> implements OutlierAlgorithm
Reference:
Outlier detection for high dimensional data
C.C. Aggarwal, P. S. Yu
International Conference on Management of Data Proceedings of the 2001 ACM
SIGMOD international conference on Management of data 2001, Santa Barbara,
California, United States
Modifier and Type | Class and Description |
---|---|
static class |
AbstractAggarwalYuOutlier.Parameterizer
Parameterization class.
|
Modifier and Type | Field and Description |
---|---|
static int |
DONT_CARE
Symbolic value for subspaces not in use.
|
protected int |
k
The target dimensionality.
|
static OptionID |
K_ID
OptionID for the target dimensionality
|
protected int |
phi
The number of partitions for each dimension
|
static OptionID |
PHI_ID
OptionID for the grid size
|
Constructor and Description |
---|
AbstractAggarwalYuOutlier(int k,
int phi)
Constructor.
|
Modifier and Type | Method and Description |
---|---|
protected ArrayList<ArrayList<DBIDs>> |
buildRanges(Relation<V> database)
Grid discretization of the data:
Each attribute of data is divided into phi equi-depth ranges. |
protected DBIDs |
computeSubspace(Vector<IntIntPair> subspace,
ArrayList<ArrayList<DBIDs>> ranges)
Method to get the ids in the given subspace
|
protected DBIDs |
computeSubspaceForGene(int[] gene,
ArrayList<ArrayList<DBIDs>> ranges)
Get the DBIDs in the current subspace.
|
TypeInformation[] |
getInputTypeRestriction()
Get the input type restriction used for negotiating the data query.
|
protected double |
sparsity(int setsize,
int dbsize,
int k)
Method to calculate the sparsity coefficient of
|
getLogger, makeParameterDistanceFunction, run
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
run
public static final OptionID PHI_ID
public static final OptionID K_ID
public static final int DONT_CARE
protected int phi
protected int k
public AbstractAggarwalYuOutlier(int k, int phi)
k
- K parameterphi
- Phi parameterprotected ArrayList<ArrayList<DBIDs>> buildRanges(Relation<V> database)
database
- protected double sparsity(int setsize, int dbsize, int k)
setsize
- Size of subsetdbsize
- Size of databasek
- Dimensionalityprotected DBIDs computeSubspace(Vector<IntIntPair> subspace, ArrayList<ArrayList<DBIDs>> ranges)
subspace
- protected DBIDs computeSubspaceForGene(int[] gene, ArrayList<ArrayList<DBIDs>> ranges)
gene
- gene dataranges
- Database rangespublic TypeInformation[] getInputTypeRestriction()
AbstractAlgorithm
getInputTypeRestriction
in interface Algorithm
getInputTypeRestriction
in class AbstractAlgorithm<OutlierResult>