O
- Object type@Reference(authors="A. McCallum, K. Nigam, L.H. Ungar", title="Efficient Clustering of High Dimensional Data Sets with Application to Reference Matching", booktitle="Proc. 6th ACM SIGKDD international conference on Knowledge discovery and data mining", url="http://dx.doi.org/10.1145%2F347090.347123") public class CanopyPreClustering<O> extends AbstractDistanceBasedAlgorithm<O,Clustering<PrototypeModel<O>>> implements ClusteringAlgorithm<Clustering<PrototypeModel<O>>>
A. McCallum, K. Nigam, L.H. Ungar
Efficient Clustering of High Dimensional Data Sets with Application to
Reference Matching
Proc. 6th ACM SIGKDD international conference on Knowledge discovery and data
mining
Modifier and Type | Class and Description |
---|---|
static class |
CanopyPreClustering.Parameterizer<O>
Parameterization class
|
Modifier and Type | Field and Description |
---|---|
private static Logging |
LOG
Class logger.
|
private double |
t1
Threshold for inclusion
|
private double |
t2
Threshold for removal
|
DISTANCE_FUNCTION_ID
Constructor and Description |
---|
CanopyPreClustering(DistanceFunction<? super O> distanceFunction,
double t1,
double t2)
Constructor.
|
Modifier and Type | Method and Description |
---|---|
TypeInformation[] |
getInputTypeRestriction()
Get the input type restriction used for negotiating the data query.
|
protected Logging |
getLogger()
Get the (STATIC) logger for this class.
|
Clustering<PrototypeModel<O>> |
run(Database database,
Relation<O> relation)
Run the algorithm
|
getDistanceFunction
makeParameterDistanceFunction, run
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
run
private static final Logging LOG
private double t1
private double t2
public CanopyPreClustering(DistanceFunction<? super O> distanceFunction, double t1, double t2)
distanceFunction
- Distance functiont1
- Inclusion thresholdt2
- Exclusion thresholdpublic Clustering<PrototypeModel<O>> run(Database database, Relation<O> relation)
database
- Databaserelation
- Relation to processpublic TypeInformation[] getInputTypeRestriction()
AbstractAlgorithm
getInputTypeRestriction
in interface Algorithm
getInputTypeRestriction
in class AbstractAlgorithm<Clustering<PrototypeModel<O>>>
protected Logging getLogger()
AbstractAlgorithm
getLogger
in class AbstractAlgorithm<Clustering<PrototypeModel<O>>>
Copyright © 2015 ELKI Development Team, Lehr- und Forschungseinheit für Datenbanksysteme, Ludwig-Maximilians-Universität München. License information.