O
- Object type@Reference(authors="A. McCallum, K. Nigam, L. H. Ungar", title="Efficient Clustering of High Dimensional Data Sets with Application to Reference Matching", booktitle="Proc. 6th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining", url="https://doi.org/10.1145/347090.347123", bibkey="DBLP:conf/kdd/McCallumNU00") public class CanopyPreClustering<O> extends AbstractDistanceBasedAlgorithm<O,Clustering<PrototypeModel<O>>> implements ClusteringAlgorithm<Clustering<PrototypeModel<O>>>
Reference:
A. McCallum, K. Nigam, L. H. Ungar
Efficient Clustering of High Dimensional Data Sets with Application to
Reference Matching
Proc. 6th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining
Modifier and Type | Class and Description |
---|---|
static class |
CanopyPreClustering.Parameterizer<O>
Parameterization class
|
Modifier and Type | Field and Description |
---|---|
private static Logging |
LOG
Class logger.
|
private double |
t1
Threshold for inclusion
|
private double |
t2
Threshold for removal
|
ALGORITHM_ID
DISTANCE_FUNCTION_ID
Constructor and Description |
---|
CanopyPreClustering(DistanceFunction<? super O> distanceFunction,
double t1,
double t2)
Constructor.
|
Modifier and Type | Method and Description |
---|---|
TypeInformation[] |
getInputTypeRestriction()
Get the input type restriction used for negotiating the data query.
|
protected Logging |
getLogger()
Get the (STATIC) logger for this class.
|
Clustering<PrototypeModel<O>> |
run(Database database,
Relation<O> relation)
Run the algorithm
|
getDistanceFunction
run
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
run
private static final Logging LOG
private double t1
private double t2
public CanopyPreClustering(DistanceFunction<? super O> distanceFunction, double t1, double t2)
distanceFunction
- Distance functiont1
- Inclusion thresholdt2
- Exclusion thresholdpublic Clustering<PrototypeModel<O>> run(Database database, Relation<O> relation)
database
- Databaserelation
- Relation to processpublic TypeInformation[] getInputTypeRestriction()
AbstractAlgorithm
getInputTypeRestriction
in interface Algorithm
getInputTypeRestriction
in class AbstractAlgorithm<Clustering<PrototypeModel<O>>>
protected Logging getLogger()
AbstractAlgorithm
getLogger
in class AbstractAlgorithm<Clustering<PrototypeModel<O>>>
Copyright © 2019 ELKI Development Team. License information.