ELKI

ELKI: Environment for DeveLoping KDD-Applications Supported by Index-Structures.

See: Description

Algorithms 
Package Description
de.lmu.ifi.dbs.elki.algorithm
Algorithms suitable as a task for the KDDTask main routine.
de.lmu.ifi.dbs.elki.algorithm.benchmark
Benchmarking pseudo algorithms.
de.lmu.ifi.dbs.elki.algorithm.clustering
Clustering algorithms.
de.lmu.ifi.dbs.elki.algorithm.clustering.affinitypropagation
Affinity Propagation (AP) clustering.
de.lmu.ifi.dbs.elki.algorithm.clustering.biclustering
Biclustering algorithms.
de.lmu.ifi.dbs.elki.algorithm.clustering.correlation
Correlation clustering algorithms
de.lmu.ifi.dbs.elki.algorithm.clustering.correlation.cash
Helper classes for the CASH algorithm.
de.lmu.ifi.dbs.elki.algorithm.clustering.gdbscan
Generalized DBSCAN.
de.lmu.ifi.dbs.elki.algorithm.clustering.hierarchical  
de.lmu.ifi.dbs.elki.algorithm.clustering.kmeans
K-means clustering and variations.
de.lmu.ifi.dbs.elki.algorithm.clustering.kmeans.quality
Quality measures for k-Means results.
de.lmu.ifi.dbs.elki.algorithm.clustering.onedimensional
Clustering algorithms for one-dimensional data.
de.lmu.ifi.dbs.elki.algorithm.clustering.subspace
Axis-parallel subspace clustering algorithms The clustering algorithms in this package are instances of both, projected clustering algorithms or subspace clustering algorithms according to the classical but somewhat obsolete classification schema of clustering algorithms for axis-parallel subspaces.
de.lmu.ifi.dbs.elki.algorithm.clustering.subspace.clique
Helper classes for the CLIQUE algorithm.
de.lmu.ifi.dbs.elki.algorithm.clustering.trivial
Trivial clustering algorithms: all in one, no clusters, label clusterings These methods are mostly useful for providing a reference result in evaluation.
de.lmu.ifi.dbs.elki.algorithm.outlier
Outlier detection algorithms
de.lmu.ifi.dbs.elki.algorithm.outlier.lof
LOF family of outlier detection algorithms.
de.lmu.ifi.dbs.elki.algorithm.outlier.meta
Meta outlier detection algorithms: external scores, score rescaling.
de.lmu.ifi.dbs.elki.algorithm.outlier.spatial
Spatial outlier detection algorithms
de.lmu.ifi.dbs.elki.algorithm.outlier.spatial.neighborhood
Spatial outlier neighborhood classes
de.lmu.ifi.dbs.elki.algorithm.outlier.spatial.neighborhood.weighted
Weighted Neighborhood definitions.
de.lmu.ifi.dbs.elki.algorithm.outlier.subspace
Subspace outlier detection methods.
de.lmu.ifi.dbs.elki.algorithm.outlier.trivial
Trivial outlier detection algorithms: no outliers, all outliers, label outliers.
de.lmu.ifi.dbs.elki.algorithm.statistics
Statistical analysis algorithms The algorithms in this package perform statistical analysis of the data (e.g. compute distributions, distance distributions etc.)
Databases and Index Structures 
Package Description
de.lmu.ifi.dbs.elki.database
ELKI database layer - loading, storing, indexing and accessing data
de.lmu.ifi.dbs.elki.database.datastore
General data store layer API (along the lines of Map<DBID, T> - use everywhere!)
de.lmu.ifi.dbs.elki.database.datastore.memory
Memory data store implementation for ELKI.
de.lmu.ifi.dbs.elki.database.ids
Database object identification and ID group handling API.
de.lmu.ifi.dbs.elki.database.ids.distance
Distance-DBID pairs, lists and heaps.
de.lmu.ifi.dbs.elki.database.ids.generic
Database object identification and ID group handling - generic implementations.
de.lmu.ifi.dbs.elki.database.ids.integer
Integer-based DBID implementation -- do not use directly - always use DBIDUtil.
de.lmu.ifi.dbs.elki.database.query
Database queries - computing distances, neighbors, similarities - API and general documentation.
de.lmu.ifi.dbs.elki.database.query.distance
Prepared queries for distances.
de.lmu.ifi.dbs.elki.database.query.knn
Prepared queries for k nearest neighbor (kNN) queries.
de.lmu.ifi.dbs.elki.database.query.range
Prepared queries for ε-range queries.
de.lmu.ifi.dbs.elki.database.query.rknn
Prepared queries for reverse k nearest neighbor (rkNN) queries.
de.lmu.ifi.dbs.elki.database.query.similarity
Prepared queries for similarity functions.
de.lmu.ifi.dbs.elki.database.relation
Relations, materialized and virtual (views).
de.lmu.ifi.dbs.elki.datasource
Data normalization (and reconstitution) of data sets.
de.lmu.ifi.dbs.elki.datasource.bundle
Object bundles - exchange container for multi-represented objects.
de.lmu.ifi.dbs.elki.datasource.filter
Data filtering, in particular for normalization and projection.
de.lmu.ifi.dbs.elki.datasource.filter.normalization
Data normalization.
de.lmu.ifi.dbs.elki.datasource.filter.transform
Data space transformations.
de.lmu.ifi.dbs.elki.datasource.parser
Parsers for different file formats and data types.
de.lmu.ifi.dbs.elki.index
Index structure implementations
de.lmu.ifi.dbs.elki.index.lsh
Locality Sensitive Hashing
de.lmu.ifi.dbs.elki.index.lsh.hashfamilies
Hash function families for LSH.
de.lmu.ifi.dbs.elki.index.lsh.hashfunctions
Hash functions for LSH
de.lmu.ifi.dbs.elki.index.preprocessed
Index structure based on preprocessors
de.lmu.ifi.dbs.elki.index.preprocessed.knn
Indexes providing KNN and rKNN data.
de.lmu.ifi.dbs.elki.index.preprocessed.localpca
Index using a preprocessed local PCA.
de.lmu.ifi.dbs.elki.index.preprocessed.preference
Indexes storing preference vectors.
de.lmu.ifi.dbs.elki.index.preprocessed.snn
Indexes providing nearest neighbor sets
de.lmu.ifi.dbs.elki.index.preprocessed.subspaceproj
Index using a preprocessed local subspaces.
de.lmu.ifi.dbs.elki.index.projected
Projected indexes for data.
de.lmu.ifi.dbs.elki.index.tree
Tree-based index structures
de.lmu.ifi.dbs.elki.index.tree.metrical
Tree-based index structures for metrical vector spaces.
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants
M-Tree and variants.
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.mktrees
Metrical index structures based on the concepts of the M-Tree supporting processing of reverse k nearest neighbor queries by using the k-nn distances of the entries.
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.mktrees.mkapp
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.mktrees.mkcop
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.mktrees.mkmax
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.mktrees.mktab
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.mtree
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.query
Classes for performing queries (knn, range, ...) on metrical trees.
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.strategies
Strategies for M-Trees (and variants).
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.strategies.insert
Insertion (choose path) strategies of nodes in an M-Tree (and variants).
de.lmu.ifi.dbs.elki.index.tree.metrical.mtreevariants.strategies.split
Splitting strategies of nodes in an M-Tree (and variants).
de.lmu.ifi.dbs.elki.index.tree.query
Classes related to generic tree queries.
de.lmu.ifi.dbs.elki.index.tree.spatial
Tree-based index structures for spatial indexing.
de.lmu.ifi.dbs.elki.index.tree.spatial.kd
K-d-tree and variants.
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants
R*-Tree and variants.
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.deliclu
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.query
Queries on the R-Tree family of indexes: kNN and range queries.
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.rstar
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.strategies
Various strategies for R-Trees and variants.
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.strategies.bulk
Packages for bulk-loading R*-Trees.
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.strategies.insert
Insertion strategies for R-Trees
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.strategies.overflow
Overflow treatment strategies for R-Trees
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.strategies.reinsert
Reinsertion strategies for R-Trees
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.strategies.split
Splitting strategies for R-Trees
de.lmu.ifi.dbs.elki.index.tree.spatial.rstarvariants.util
Utilities for R*-Tree and variants.
de.lmu.ifi.dbs.elki.index.vafile
Vector Approximation File
de.lmu.ifi.dbs.elki.persistent
Persistent data management.
Datatypes and Distance Functions 
Package Description
de.lmu.ifi.dbs.elki.data
Basic classes for different data types, database object types and label types.
de.lmu.ifi.dbs.elki.data.images
Package for processing image data (e.g. compute color histograms)
de.lmu.ifi.dbs.elki.data.model
Cluster models classes for various algorithms.
de.lmu.ifi.dbs.elki.data.projection
Data projections.
de.lmu.ifi.dbs.elki.data.spatial
Spatial data types - interfaces and utilities.
de.lmu.ifi.dbs.elki.data.synthetic
Generators for synthetic data sets
de.lmu.ifi.dbs.elki.data.synthetic.bymodel
Generator using a distribution model specified in an XML configuration file.
de.lmu.ifi.dbs.elki.data.type
Data type information, also used for type restrictions.
de.lmu.ifi.dbs.elki.distance
de.lmu.ifi.dbs.elki.distance.distancefunction
Distance functions for use within ELKI.
de.lmu.ifi.dbs.elki.distance.distancefunction.adapter
Distance functions deriving distances from e.g. similarity measures
de.lmu.ifi.dbs.elki.distance.distancefunction.colorhistogram
Distance functions using correlations.
de.lmu.ifi.dbs.elki.distance.distancefunction.correlation
Distance functions using correlations.
de.lmu.ifi.dbs.elki.distance.distancefunction.external
Distance functions using external data sources.
de.lmu.ifi.dbs.elki.distance.distancefunction.geo
Geographic (earth) distance functions.
de.lmu.ifi.dbs.elki.distance.distancefunction.histogram
Distance functions for one-dimensional histograms.
de.lmu.ifi.dbs.elki.distance.distancefunction.minkowski
Minkowski space L_p norms such as the popular Euclidean and Manhattan distances.
de.lmu.ifi.dbs.elki.distance.distancefunction.probabilistic
Distance from probability theory, mostly divergences such as K-L-divergence, J-divergence.
de.lmu.ifi.dbs.elki.distance.distancefunction.strings
Distance functions for strings.
de.lmu.ifi.dbs.elki.distance.distancefunction.subspace
Distance functions based on subspaces.
de.lmu.ifi.dbs.elki.distance.distancefunction.timeseries
Distance functions designed for time series.
de.lmu.ifi.dbs.elki.distance.distanceresultlist
Classes for building and storing the results of distance-based queries
de.lmu.ifi.dbs.elki.distance.distancevalue
Distance values, i.e. object storing an actual distance value along with comparison functions and value parsers.
de.lmu.ifi.dbs.elki.distance.similarityfunction
Similarity functions.
de.lmu.ifi.dbs.elki.distance.similarityfunction.kernel
Kernel functions.
Evaluation 
Package Description
de.lmu.ifi.dbs.elki.evaluation
Functionality for the evaluation of algorithms.
de.lmu.ifi.dbs.elki.evaluation.clustering
Evaluation of clustering results.
de.lmu.ifi.dbs.elki.evaluation.clustering.pairsegments
Pair-segment analysis of multiple clusterings.
de.lmu.ifi.dbs.elki.evaluation.histogram
Functionality for the evaluation of algorithms using histograms.
de.lmu.ifi.dbs.elki.evaluation.index
Simple index evaluation methods
de.lmu.ifi.dbs.elki.evaluation.outlier
Evaluate an outlier score using a misclassification based cost model.
de.lmu.ifi.dbs.elki.evaluation.roc
Evaluation of rankings using ROC AUC (Receiver Operation Characteristics - Area Under Curve)
de.lmu.ifi.dbs.elki.evaluation.similaritymatrix
Render a distance matrix to visualize a clustering-distance-combination.
GUI and Visualization 
Package Description
de.lmu.ifi.dbs.elki.gui
Graphical User Interfaces for ELKI
de.lmu.ifi.dbs.elki.gui.configurator
Configurator components
de.lmu.ifi.dbs.elki.gui.icons
Icons for ELKI GUI.
de.lmu.ifi.dbs.elki.gui.minigui
A very simple UI to build ELKI command lines.
de.lmu.ifi.dbs.elki.gui.multistep
Multi-step GUI for ELKI
de.lmu.ifi.dbs.elki.gui.multistep.panels
Panels for the multi-step GUI
de.lmu.ifi.dbs.elki.gui.util
Utility classes for GUIs (e.g. a class to display a logging panel)
de.lmu.ifi.dbs.elki.visualization
Visualization package of ELKI.
de.lmu.ifi.dbs.elki.visualization.batikutil
Commonly used functionality useful for Apache Batik.
de.lmu.ifi.dbs.elki.visualization.colors
Color scheme handling for ELKI.
de.lmu.ifi.dbs.elki.visualization.css
Managing CSS styles / classes.
de.lmu.ifi.dbs.elki.visualization.gui
Package to provide a visualization GUI.
de.lmu.ifi.dbs.elki.visualization.gui.detail
Classes for managing a detail view.
de.lmu.ifi.dbs.elki.visualization.gui.overview
Classes for managing the overview plot.
de.lmu.ifi.dbs.elki.visualization.opticsplot
Code for drawing OPTICS plots
de.lmu.ifi.dbs.elki.visualization.projections
Visualization projections
de.lmu.ifi.dbs.elki.visualization.projector
Projectors are responsible for finding appropriate projections for data relations.
de.lmu.ifi.dbs.elki.visualization.savedialog
Save dialog for SVG plots.
de.lmu.ifi.dbs.elki.visualization.style
Style management for ELKI visualizations.
de.lmu.ifi.dbs.elki.visualization.style.lines
Generate line styles for plotting in CSS
de.lmu.ifi.dbs.elki.visualization.style.marker
Draw plot markers
de.lmu.ifi.dbs.elki.visualization.svg
Base SVG functionality (generation, markers, thumbnails, export, ...).
de.lmu.ifi.dbs.elki.visualization.visualizers
Visualizers for various results
de.lmu.ifi.dbs.elki.visualization.visualizers.histogram
Visualizers based on 1D projected histograms.
de.lmu.ifi.dbs.elki.visualization.visualizers.optics
Visualizers that do work on OPTICS plots
de.lmu.ifi.dbs.elki.visualization.visualizers.pairsegments
Visualizers for inspecting cluster differences using pair counting segments.
de.lmu.ifi.dbs.elki.visualization.visualizers.parallel
Visualizers based on parallel coordinates.
de.lmu.ifi.dbs.elki.visualization.visualizers.parallel.cluster
Visualizers for clustering results based on parallel coordinates.
de.lmu.ifi.dbs.elki.visualization.visualizers.parallel.index
Visualizers for index structure based on parallel coordinates.
de.lmu.ifi.dbs.elki.visualization.visualizers.parallel.selection
Visualizers for object selection based on parallel projections.
de.lmu.ifi.dbs.elki.visualization.visualizers.scatterplot
Visualizers based on scatterplots.
de.lmu.ifi.dbs.elki.visualization.visualizers.scatterplot.cluster
Visualizers for clustering results based on 2D projections.
de.lmu.ifi.dbs.elki.visualization.visualizers.scatterplot.density
Visualizers for data set density in a scatterplot projection.
de.lmu.ifi.dbs.elki.visualization.visualizers.scatterplot.index
Visualizers for index structures based on 2D projections.
de.lmu.ifi.dbs.elki.visualization.visualizers.scatterplot.outlier
Visualizers for outlier scores based on 2D projections.
de.lmu.ifi.dbs.elki.visualization.visualizers.scatterplot.selection
Visualizers for object selection based on 2D projections.
de.lmu.ifi.dbs.elki.visualization.visualizers.thumbs
Thumbnail "Visualizers" (that take care of refreshing thumbnails)
de.lmu.ifi.dbs.elki.visualization.visualizers.visunproj
Visualizers that do not use a particular projection.
Utilities and Miscellaneous 
Package Description
de.lmu.ifi.dbs.elki
ELKI framework "Environment for Developing KDD-Applications Supported by Index-Structures" KDDTask is the main class of the ELKI-Framework for command-line interaction.
de.lmu.ifi.dbs.elki.application
Base classes for stand alone applications.
de.lmu.ifi.dbs.elki.application.cache
Utility applications for the persistence layer such as distance cache builders.
de.lmu.ifi.dbs.elki.application.geo
Application for exploring geo data.
de.lmu.ifi.dbs.elki.application.greedyensemble
Greedy ensembles for outlier detection.
de.lmu.ifi.dbs.elki.application.internal
Internal utilities for development.
de.lmu.ifi.dbs.elki.application.jsmap
JavaScript based map client - server architecture.
de.lmu.ifi.dbs.elki.logging
Logging facility for controlling logging behavior of the complete framework.
de.lmu.ifi.dbs.elki.logging.progress
Progress status objects (for UI)
de.lmu.ifi.dbs.elki.logging.statistics
Classes for logging various statistics.
de.lmu.ifi.dbs.elki.math
Mathematical operations and utilities used throughout the framework.
de.lmu.ifi.dbs.elki.math.dimensionsimilarity
Functions to compute the similarity of dimensions (or the interestingness of the combination).
de.lmu.ifi.dbs.elki.math.geodesy  
de.lmu.ifi.dbs.elki.math.geometry
Algorithms from computational geometry.
de.lmu.ifi.dbs.elki.math.linearalgebra
Linear Algebra package provides classes and computational methods for operations on matrices.
de.lmu.ifi.dbs.elki.math.linearalgebra.fitting
Function to numerically fit a function (such as a Gaussian distribution to given data.
de.lmu.ifi.dbs.elki.math.linearalgebra.pca
Principal Component Analysis (PCA) and Eigenvector processing.
de.lmu.ifi.dbs.elki.math.linearalgebra.pca.weightfunctions
Weight functions used in weighted PCA via WeightedCovarianceMatrixBuilder
de.lmu.ifi.dbs.elki.math.linearalgebra.randomprojections
Random projection families.
de.lmu.ifi.dbs.elki.math.scales
Scales handling for plotting.
de.lmu.ifi.dbs.elki.math.spacefillingcurves
Space filling curves.
de.lmu.ifi.dbs.elki.math.statistics
Statistical tests and methods.
de.lmu.ifi.dbs.elki.math.statistics.distribution
Standard distributions, with random generation functionalities.
de.lmu.ifi.dbs.elki.math.statistics.distribution.estimator
Estimators for statistical distributions.
de.lmu.ifi.dbs.elki.math.statistics.distribution.estimator.meta
Meta estimators: estimators that do not actually estimate themselves, but instead use other estimators, e.g. on a trimmed data set, or as an ensemble.
de.lmu.ifi.dbs.elki.math.statistics.kernelfunctions
Kernel functions from statistics.
de.lmu.ifi.dbs.elki.math.statistics.tests
Statistical tests.
de.lmu.ifi.dbs.elki.result
Result types, representation and handling
de.lmu.ifi.dbs.elki.result.optics
Result classes for OPTICS.
de.lmu.ifi.dbs.elki.result.outlier
Outlier result classes
de.lmu.ifi.dbs.elki.result.textwriter
Text serialization (CSV, Gnuplot, Console, ...)
de.lmu.ifi.dbs.elki.result.textwriter.naming
Naming schemes for clusters (for output when an algorithm doesn't generate cluster names).
de.lmu.ifi.dbs.elki.result.textwriter.writers
Serialization handlers for individual data types.
de.lmu.ifi.dbs.elki.utilities
Utility and helper classes - commonly used data structures, output formatting, exceptions, ...
de.lmu.ifi.dbs.elki.utilities.datastructures
Basic memory structures such as heaps and object hierarchies.
de.lmu.ifi.dbs.elki.utilities.datastructures.arraylike
Common API for accessing objects that are "array-like", including lists, numerical vectors, database vectors and arrays.
de.lmu.ifi.dbs.elki.utilities.datastructures.arrays
Utilities for arrays: advanced sorting for primitvie arrays.
de.lmu.ifi.dbs.elki.utilities.datastructures.heap
Heap structures and variations such as bounded priority heaps.
de.lmu.ifi.dbs.elki.utilities.datastructures.hierarchy
Delegate implementation of a hierarchy.
de.lmu.ifi.dbs.elki.utilities.datastructures.histogram
Classes for computing histograms.
de.lmu.ifi.dbs.elki.utilities.datastructures.iterator
ELKI Iterator API.
de.lmu.ifi.dbs.elki.utilities.documentation
Documentation utilities: Annotations for Title, Description, Reference
de.lmu.ifi.dbs.elki.utilities.ensemble
Utility classes for simple ensembles.
de.lmu.ifi.dbs.elki.utilities.exceptions
Exception classes and common exception messages.
de.lmu.ifi.dbs.elki.utilities.optionhandling
Parameter handling and option descriptions.
de.lmu.ifi.dbs.elki.utilities.optionhandling.constraints
Constraints allow to restrict possible values for parameters.
de.lmu.ifi.dbs.elki.utilities.optionhandling.parameterization
Configuration managers See the de.lmu.ifi.dbs.elki.utilities.optionhandling package for documentation!
de.lmu.ifi.dbs.elki.utilities.optionhandling.parameters
Classes for various typed parameters.
de.lmu.ifi.dbs.elki.utilities.pairs
Pairs and triples utility classes.
de.lmu.ifi.dbs.elki.utilities.referencepoints
Package containing strategies to obtain reference points Shared code for various algorithms that use reference points.
de.lmu.ifi.dbs.elki.utilities.scaling
Scaling functions: linear, logarithmic, gamma, clipping, ...
de.lmu.ifi.dbs.elki.utilities.scaling.outlier
Scaling of Outlier scores, that require a statistical analysis of the occurring values
de.lmu.ifi.dbs.elki.utilities.xml
XML and XHTML utilities.
de.lmu.ifi.dbs.elki.workflow
Work flow packages, e.g. following the usual KDD model, closely related to CRISP-DM
Other Packages 
Package Description
tutorial
Classes for ELKI tutorials.
tutorial.clustering
Classes from the tutorial on implementing a custom k-means variation.
tutorial.distancefunction
Classes from the tutorial on implementing distance functions.
tutorial.outlier  
Release 0.6.0 (2014-01-10_1610)