ParallelGeneralizedDBSCAN (ELKI: Environment for DeveLoping KDD-Applications Supported by Index-Structures)

java.lang.Object
- de.lmu.ifi.dbs.elki.algorithm.AbstractAlgorithm<Clustering<Model>>
- - de.lmu.ifi.dbs.elki.algorithm.clustering.gdbscan.parallel.ParallelGeneralizedDBSCAN

All Implemented Interfaces:

Algorithm, ClusteringAlgorithm<Clustering<Model>>
```
@Reference(prefix="closely related",
           authors="M. Patwary, D. Palsetia, A. Agrawal, W. K. Liao, F. Manne, A. Choudhary",
           title="A new scalable parallel DBSCAN algorithm using the disjoint-set data structure",
           booktitle="IEEE Int. Conf. for High Performance Computing, Networking, Storage and Analysis (SC)",
           url="https://doi.org/10.1109/SC.2012.9",
           bibkey="DBLP:conf/sc/PatwaryPALMC12")
public class ParallelGeneralizedDBSCAN
extends AbstractAlgorithm<Clustering<Model>>
implements ClusteringAlgorithm<Clustering<Model>>
```
Parallel version of DBSCAN clustering.
This is the archetype of a non-linear shared-memory DBSCAN that does not sequentially expand a cluster, but processes points in arbitrary order and merges clusters when neighboring core points occur.
Because of synchronization when labeling points, the speedup will only be sublinear in the number of cores. But in particular without an index and on large data, the majority of the work is finding the neighbors; not in labeling the points.
Reference:
Please cite the latest ELKI version.
Related is the following publication, whose "disjoint set data structure" appears to be a similar union-find approach to ours, and whose DSDBSCAN appears rather similar. The main benefit of our approach is that we avoid using the union-find data structure for every object, but only use it for merging clusters.
M. Patwary, D. Palsetia, A. Agrawal, W. K. Liao, F. Manne, A. Choudhary
A new scalable parallel DBSCAN algorithm using the disjoint-set data structure
In IEEE Int. Conf. for High Performance Computing, Networking, Storage and Analysis (SC)

Since:

0.7.5

Author:

Erich Schubert

Nested Class Summary

Nested Classes
Modifier and Type	Class and Description
`static class`	`ParallelGeneralizedDBSCAN.Instance<T>` Instance for a particular data set.
`static class`	`ParallelGeneralizedDBSCAN.Parameterizer` Parameterization class

Field Summary

Fields
Modifier and Type	Field and Description
`protected boolean`	`coremodel` Track which objects are "core" objects.
`protected CorePredicate<?>`	`corepred` The core predicate factory.
`private static Logging`	`LOG` Get a logger for this algorithm
`protected NeighborPredicate<?>`	`npred` The neighborhood predicate factory.

Fields inherited from class de.lmu.ifi.dbs.elki.algorithm.AbstractAlgorithm
ALGORITHM_ID

Constructor Summary

Constructors
Constructor and Description
`ParallelGeneralizedDBSCAN(NeighborPredicate<?> npred, CorePredicate<?> corepred, boolean coremodel)` Constructor for parameterized algorithm.

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`TypeInformation[]`	`getInputTypeRestriction()` Get the input type restriction used for negotiating the data query.
`protected Logging`	`getLogger()` Get the (STATIC) logger for this class.
`Clustering<Model>`	`run(Database database)` Runs the algorithm.

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - LOG
```
private static final Logging LOG
```
    Get a logger for this algorithm
  - npred
```
protected NeighborPredicate<?> npred
```
    The neighborhood predicate factory.
  - corepred
```
protected CorePredicate<?> corepred
```
    The core predicate factory.
  - coremodel
```
protected boolean coremodel
```
    Track which objects are "core" objects.
- Constructor Detail
  - ParallelGeneralizedDBSCAN
```
public ParallelGeneralizedDBSCAN(NeighborPredicate<?> npred,
                                 CorePredicate<?> corepred,
                                 boolean coremodel)
```
    Constructor for parameterized algorithm.
    
    Parameters:
    
    npred - Neighbor predicate.
    
    corepred - Core point predicate.
    
    coremodel - Keep track of core points.
- Method Detail
  - run
```
public Clustering<Model> run(Database database)
```
    Description copied from interface: Algorithm
    
    Runs the algorithm.
    
    Specified by:
    
    run in interface Algorithm
    
    Specified by:
    
    run in interface ClusteringAlgorithm<Clustering<Model>>
    
    Overrides:
    
    run in class AbstractAlgorithm<Clustering<Model>>
    
    Parameters:
    
    database - the database to run the algorithm on
    
    Returns:
    
    the Result computed by this algorithm
  - getInputTypeRestriction
```
public TypeInformation[] getInputTypeRestriction()
```
    Description copied from class: AbstractAlgorithm
    
    Get the input type restriction used for negotiating the data query.
    
    Specified by:
    
    getInputTypeRestriction in interface Algorithm
    
    Specified by:
    
    getInputTypeRestriction in class AbstractAlgorithm<Clustering<Model>>
    
    Returns:
    
    Type restriction
  - getLogger
```
protected Logging getLogger()
```
    Description copied from class: AbstractAlgorithm
    
    Get the (STATIC) logger for this class.
    
    Specified by:
    
    getLogger in class AbstractAlgorithm<Clustering<Model>>
    
    Returns:
    
    the static logger

Class ParallelGeneralizedDBSCAN

Nested Class Summary

Field Summary

Fields inherited from class de.lmu.ifi.dbs.elki.algorithm.AbstractAlgorithm

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Detail

LOG

npred

corepred

coremodel

Constructor Detail

ParallelGeneralizedDBSCAN

Method Detail

run

getInputTypeRestriction

getLogger