V
- the type of FeatureVector handled by this Algorithm@Title(value="Dependency Derivator: Deriving numerical inter-dependencies on data") @Description(value="Derives an equality-system describing dependencies between attributes in a correlation-cluster") @Reference(authors="Elke Achtert, Christian B\u00f6hm, Hans-Peter Kriegel, Peer Kr\u00f6ger, Arthur Zimek", title="Deriving Quantitative Dependencies for Correlation Clusters", booktitle="Proc. 12th Int. Conf. on Knowledge Discovery and Data Mining (KDD \'06)", url="https://doi.org/10.1145/1150402.1150408", bibkey="DBLP:conf/kdd/AchtertBKKZ06") @Priority(value=-5) public class DependencyDerivator<V extends NumberVector> extends AbstractNumberVectorDistanceBasedAlgorithm<V,CorrelationAnalysisSolution<V>>
Reference:
Elke Achtert, Christian Böhm, Hans-Peter Kriegel, Peer Kröger,
Arthur Zimek
Deriving Quantitative Dependencies for Correlation Clusters
Proc. 12th Int. Conf. on Knowledge Discovery and Data Mining (KDD '06)
Modifier and Type | Class and Description |
---|---|
static class |
DependencyDerivator.Parameterizer<V extends NumberVector>
Parameterization class.
|
Modifier and Type | Field and Description |
---|---|
private EigenPairFilter |
filter
Filter to select eigenvectors.
|
private static Logging |
LOG
The logger for this class.
|
private java.text.NumberFormat |
nf
Number format for output of solution.
|
private PCARunner |
pca
Holds the object performing the pca.
|
private boolean |
randomsample
Flag for random sampling vs. kNN
|
private int |
sampleSize
Holds the value of
DependencyDerivator.Parameterizer.SAMPLE_SIZE_ID . |
distanceFunction
ALGORITHM_ID
DISTANCE_FUNCTION_ID
Constructor and Description |
---|
DependencyDerivator(NumberVectorDistanceFunction<? super V> distanceFunction,
java.text.NumberFormat nf,
PCARunner pca,
EigenPairFilter filter,
int sampleSize,
boolean randomsample)
Constructor.
|
Modifier and Type | Method and Description |
---|---|
CorrelationAnalysisSolution<V> |
generateModel(Relation<V> db,
DBIDs ids)
Runs the pca on the given set of IDs.
|
CorrelationAnalysisSolution<V> |
generateModel(Relation<V> relation,
DBIDs ids,
double[] centroid)
Runs the pca on the given set of IDs and for the given centroid.
|
TypeInformation[] |
getInputTypeRestriction()
Get the input type restriction used for negotiating the data query.
|
protected Logging |
getLogger()
Get the (STATIC) logger for this class.
|
CorrelationAnalysisSolution<V> |
run(Database database,
Relation<V> relation)
Computes quantitatively linear dependencies among the attributes of the
given database based on a linear correlation PCA.
|
getDistanceFunction
run
private static final Logging LOG
private final int sampleSize
DependencyDerivator.Parameterizer.SAMPLE_SIZE_ID
.private final PCARunner pca
private final EigenPairFilter filter
private final java.text.NumberFormat nf
private final boolean randomsample
public DependencyDerivator(NumberVectorDistanceFunction<? super V> distanceFunction, java.text.NumberFormat nf, PCARunner pca, EigenPairFilter filter, int sampleSize, boolean randomsample)
distanceFunction
- distance functionnf
- Number formatpca
- PCA runnerfilter
- Eigenvector filtersampleSize
- sample sizerandomsample
- flag for random samplingpublic CorrelationAnalysisSolution<V> run(Database database, Relation<V> relation)
database
- the database to run this DependencyDerivator onrelation
- the relation to usepublic CorrelationAnalysisSolution<V> generateModel(Relation<V> db, DBIDs ids)
db
- the databaseids
- the set of idspublic CorrelationAnalysisSolution<V> generateModel(Relation<V> relation, DBIDs ids, double[] centroid)
relation
- the databaseids
- the set of idscentroid
- the centroidpublic TypeInformation[] getInputTypeRestriction()
AbstractAlgorithm
getInputTypeRestriction
in interface Algorithm
getInputTypeRestriction
in class AbstractAlgorithm<CorrelationAnalysisSolution<V extends NumberVector>>
protected Logging getLogger()
AbstractAlgorithm
getLogger
in class AbstractAlgorithm<CorrelationAnalysisSolution<V extends NumberVector>>
Copyright © 2019 ELKI Development Team. License information.