All Classes and Interfaces (CLUS 2.12.8 API)

This class could be used in any kind of classification setting (e.g., hierarchical multilabel) and basically stores the statistics, which enables as to compute the number of TP, TN, FP and FN (T - true, F - false, P - positives, and N - negatives) for any given threshold.

BitList

BitMapSelection

BitVectorStat

BitwiseNominalAttrType

CalcStatisticProcessor

calibrateByLabelCardinality

Threshold calibration method by choosing the threshold that minimizes the difference in label cardinality between the training data and the predictions for the test data.

CDTuneSizeConstrPruning

ChebyshevDistance

ClassesAttrType

ClassesAttrTypeSingleLabel

ClassesProbabilities

Calculate the confidence of prediction for the multi-target classification as follows: For each target, get the max classification probability (i.e., majority) among possible class values.

ClassHierarchyPreproc

Classic

ClassificationStat

Classification statistics about the data.

ClassTerm

Cloner

Cloner: deep clone objects.

CloningException

thrown if cloning fails

ClusAttrType.AttributeType

ClusAttrType.AttributeUseType

ClusAttrType.Status

ClusAttrType.ValueType

ClusBeam

ClusBeamAttrSelector

ClusBeamHeuristic

ClusBeamHeuristicError

ClusBeamHeuristicMEstimate

ClusBeamHeuristicMorishita

ClusBeamHeuristicSS

ClusBeamInduce

ClusBeamModel

ClusBeamModelDistance

ClusBeamSearch

ClusBeamSimClassStat

ClusBeamSimilarityOutput

ClusBeamSimRegrStat

ClusBeamSizeConstraintInfo

ClusBeamSizeConstraints

ClusBeamSyntacticConstraint

ClusCalcRuleErrorProc

ClusEnsembleClassifier

ClusEnsembleFeatureRanking

ClusEnsembleFeatureRankings

ClusEnsembleInduce

ClusEnsembleInduce.ParallelTrap

Random tree depths for different iterations, used for tree to rules optimization procedures.

ClusEnsembleInduceOptClassification

ClusEnsembleInduceOptimization

ClusEnsembleInduceOptRegHMLC

ClusEnsemblePredictionWriter

Writing the predictions from the ensemble in a separate file, their standard deviations from the voting procedure and the respective votes from each base classifier.

ClusExhaustiveDFSearch

Ensemble of decision trees.

Created by Vanja Mileski on 12/15/2016.

ClusHMTRNode

Created by Vanja Mileski on 12/16/2016.

ClusInductionAlgorithm

Subclasses should implement: public ClusModel induceSingleUnpruned(ClusRun cr); In addition, subclasses may also want to implement (to return more than one model): public void induceAll(ClusRun cr);

ClusInductionAlgorithmType

For each type of algorithm there should be a ClusClassifier object.

ClusInvalidSettingsException

ClusLearner

ClusLogger

This is the logging class.

ClusMisc

ClusModel

ClusModelCollectionIO

ClusNormalizedAttributeWeights

ClusNumberFormat

Clus formatter for the numbers that differently formats numbers whose absolute value is at least 1, and the others.

ClusNumericError

ClusOOBErrorEstimate

ClusOOBWeights

Class that holds OOB weights.

ClusOptionNode

ClusOptionTree

ClusOutput

Class for outputting the training and testing results to .out file.

ClusReliefFeatureRanking

Class that holds OOB weights for ROS ensembles.

ClusRule

ClusRuleClassifier

ClusRuleConstraintInduce

ClusRuleConstraintInduceTest

ClusRuleFromTreeInduce

Create rules by decision tree ensemble algorithms (forests).

ClusRuleHelperMethods

ClusRuleHeuristicDispersion

ClusRuleHeuristicDispersionAdt

ClusRuleHeuristicDispersionMlt

ClusRuleHeuristicError

ClusRuleHeuristicHierarchical

ClusRuleHeuristicMEstimate

ClusRuleHeuristicRDispersionAdt

ClusRuleHeuristicRDispersionMlt

ClusRuleHeuristicSSD

ClusRuleInduce

ClusRuleLinearTerm

A linear term that has been included in the rule set.

ClusRuleProbabilisticRuleSetInduce

ClusRuleProbabilisticRuleSetInduceOLD

ClusRuleProbabilisticRuleSetInduceWeights

Helper class for SLS algorithm weights.

ClusRuleSet

Class representing a set of predictive clustering rules.

ClusRulesForAttrs

Create one rule for each value of each nominal attribute

ClusRulesFromTree

Rule set created from a tree.

ClusRulesRandom

ClusRun

ClusSchema

ClusSchemaInitializer

ClusSelection

ClusSelfTrainingFTFInduce

Self-training that operates without confidence score Implemented on the basis of: Culp and Michailidis, An iterative algorithm for extending learners to a semi-supervised setting, Journal of Computational and Graphical Statistics, 2008

ClusSelfTrainingInduce

ClusSemiSupervisedClassifier

ClusSemiSupervisedInduce

ClusSemiSupervisedPCTs

Statistics about the data set.

ClusStatManager

Statistics manager Includes information about target attributes and weights etc.

ClusStatManager.Mode

ClusStatManager.TargetType

ClusStopCriterion

ClusStopCriterionMinNbExamples

ClusStopCriterionMinWeight

ClusStructuredDistance

CorrelationMatrixComputer

Coverage

CriterionBasedSelection

CurrentBestTestAndHeuristic

DataPreprocs

DataTuple

DEAlgorithm

Differential evolution algorithm.

DebugFile

DEIndividual

Class representing a DE individual

DEPopulation

Class representing the population.

DEProblem

Class representing a Differential evolution optimization problem.

DepthFirstInduce

DepthFirstInduceSparse

DepthFirstInduceWithOptions

DerivedConstraintsComputer

Parent class of the DoubleBooleanCount that stores statistics that are used when building ROC- and PR-curves.

DoubleBooleanCount

Class that stores prediction statistics that are used when building ROC- and PR-curves.

Structure that contains two doubles

A class that returns an Enumeration that returns only a subset of a given Enumeration using a certain filter.

ErrorOutput

ErrorVisitor

EuclideanDistance

EuclideanDistance works on all type of attributes.

Evaluator

Functions to evaluate predictions

Executer

ExtensionFilter

A Filter Class to filter Files by Extension.

FastClonerConcurrentHashMap

FastClonerCustomCollection<T extends Collection>

FastClonerCustomMap<T extends Map>

FastClonerHashMap

FastClonerHashSet

FastClonerLinkedHashMap

A Helper Class for working with Files on the Persistent Storage.

FindNeighboursCallable

Class for gradient descent optimization.

GDProblem

Class representing a gradient descent optimization problem.

GeneralException

GenerateData

Deprecated.

GeneticDistanceHeuristic

GeneticDistanceHeuristicMatrix

Deprecated.

Hamming loss is used in multi-label classification scenario.

Some handy functions

HierarchicalMultiLabelDistance

HierBasicDistance

HierClassTresholdPruner

HierClassWiseAccuracy

HierRemoveInsigClasses

HierRMSError

HierSingleLabelStat

HierSumPairwiseDistancesStat

HierThresholdCalibration

HierWeightSPath

HierWPenalty

HMCAverageNodeWiseModels

HMCAverageSingleClass

ICVPairwiseDistancesError

IDeepCloner

used by fast cloners to deep clone objects

IDumpCloned

IFastCloner

allows a custom cloner to be created for a specific class.

IFreezable

IInstantiationStrategy

marks the specific class as immutable and the cloner avoids cloning it

ImplicitLinearTerms

Class for including all the linear terms implicitly in the weight optimization procedure.

IndexAttrType

IndexedItem

IndexMergeSorter

Merge sort which returns the indexes of the target array, not the target array.

IndiceValuePair

Structure that contains one int and one double

INIFileEnum<T extends Enum<T>>

INIFileEnumList<T extends Enum<T>>

INIFileInt

INIFileNode

INIFileNominal

Corresponds to a nominal settings file field.

INIFileNominalOrDoubleOrVector

INIFileNominalOrIntOrVector

INIFileSection

INIFileSectionGroup

INIFileString

INIFileStringOrDouble

Deprecated.

Returns maximum of per-target reliability scores, i.e., an example is considered as reliable as its most reliable component

MComparator

MDouble

MDoubleArray

MDoubleArrayComparator

Returns minimum of per-target reliability scores, i.e., an example is considered as reliable as its least reliable component

MinkowskiDistance

MinMaxNormalization

Normalises per-target confidence scores to [0,1].

MIntArray

MisclassificationError

MissingTargetImputation

Use if you want to compute MLC-measures in HMLC case.

MLROCAndPRCurve.CurveType

MLweightedAUPRC

MMatrix

MNumber

ModelProcessorCollection

ModifiedGainHeuristic

Deprecated.

MultiDelimStringTokenizer

Implements simple insertion algorithm for maintaining k nearest neighbors.

Attribute of nominal value.

NominalBasicDistance

This class represents the distance between 2 values of a certain Nominal Attribute type.

This class stores some useful statistics for a Nominal Attribute of certain data.

Does nothing, no normalization is performed

Normalization

Class for normalization of per-target confidence scores

NoStopSearch

NoWeighting

Doesn't weights any attributes.

NumericAttrBase

NumericAttribute

NumericAttrType

Attribute of numeric (continuous) value.

NumericBasicDistance

This class represents the distance between 2 values of a certain Numerical Attribute type.

NumericStatistic

This class stores some useful statistics for a Numeric Attribute of certain data.

NumericTarget

NumericTest

ObjectLoadStream

A class implementing an interface to the loading of objects from a file

ObjectSaveStream

A class implementing an interface to the saving of objects to a file

ObjenesisInstantiationStrategy

OneBagResults

OneError

OneTarget

Fake learner which returns the maintarget.

OOBSelection

OptimizationAlgorithm

Abstract super class for optimization of weights of base learners.

OptimizationProblem

Class representing a optimization problem.

OptimizationProblem.OptimizationParameter

Parameters for optimization algorithm.

OptimizationProblem.RulePred

Predictions of rule type base functions.

OptimizationProblem.TrueValues

True values of the instances.

Deprecated.

Deprecated.

Provides reliability scores on the basis of 'actual error', which is not attainable in practice, i.e., if true unlabeled data are used.

OverSample

Pair<T1,T2>

Implements 2-Tuple.

Parallel

Parallel.Operation<T>

Quadruple<T1,T2,T3,T4>

Implements 4-Tuple.

RandomForestWeighting

RandomScore

Returns random numbers as reliability scores, two modes are possible: RANDOM_UNIFORM: random numbers are generated uniformly in [0,1] RANDOM_GAUSSIAN: random numbers are normally distribution in [0,1] with mean 0.5 and std.

RandomSelection

Ranking

On the basis of the given per-target confidence scores, provides ranking based confidence scores: per-target scores are ranked, independently for each target

RankingLoss

Recall

ReducedErrorHeuristic

RegressionStat

RegressionStatBase

RegressionStatBinaryNomiss

This class stores a cache of TargetSets, testdata and the predictions for that testdata TODO: better search in stored results.

RForestProximities

Class which determines reliability score of an unlabeled example e_u as follows: r(e_u) = sum_{e_l} w_l * oobError(e_l), where w_l is random forest proximity of e_u to labeled example e_l, and oonError return out-of-bag error of labeled example e_u.

RMSError

ROCAndPRCurve

RowData

Multiple rows (tuples) of data.

RowDataSortHelper

RRMSError

Relative root mean squared error.

RuleNormalization

Information about rule normalization.

SaveLoadNeighbours

SearchAlgorithm

SearchAlgorithmImpl

Abstract implementation of the SearchAlgo interface.

SearchDistance

SemiSupMinLabeledWeightStopCrit

All the settings.

SettingsData.NormalizeDataValues

SettingsEnsemble

SettingsEnsemble.EnsembleBootstrapping

Section: Ensemble methods *

SettingsEnsemble.EnsembleMethod

SettingsEnsemble.EnsembleRanking

SettingsEnsemble.EnsembleROSAlgorithmType

SettingsEnsemble.EnsembleROSVotingType

How ROS ensemble make predictions

SettingsEnsemble.EnsembleVotingType

SettingsEnsemble.RandomAttributeTypeSelection

SettingsExhaustiveSearch

SettingsExperimental

SettingsGeneral

SettingsGeneral.ResourceInfoLoad

Section: General - ResourceInfo loaded *

SettingsGeneric

SettingsHMLC

SettingsHMLC.HierarchyDistance

SettingsHMLC.HierarchyMeasures

SettingsHMLC.HierarchyType

Section: Hierarchical multi-label classification *

SettingsHMLC.HierarchyWeight

SettingsHMTR

SettingsHMTR.HierarchyAggregationsHMTR

SettingsHMTR.HierarchyDistanceHMTR

SettingsHMTR.HierarchyTypesHMTR

Section: Hierarchical multi-target regression *

SettingsILevelC

SettingsKNN

SettingsKNN.Distance

SettingsKNN.DistanceWeights

SettingsKNN.SearchMethod

SettingsKNNTree

SettingsMLC

SettingsMLC.MultiLabelMeasures

SettingsMLC.MultiLabelThresholdOptimization

SettingsOutput.ConvertRules

SettingsOutput.PythonModelType

SettingsOutput.ShowInfo

SettingsOutput.ShowModels

Section: Output - Show info in .out file *

SettingsOutput.WritePredictions

Section: Output - Write predictions to file *

SettingsPhylogeny

SettingsPhylogeny.PhylogenyCriterion

SettingsPhylogeny.PhylogenyDistanceMeasure

Section: Phylogeny *

SettingsPhylogeny.PhylogenySequence

SettingsRelief

SettingsRelief.MissingTargetHandling

SettingsRelief.MultilabelDistance

SettingsRelief.ReliefStatisticsType

SettingsRules

SettingsRules.CoveringMethod

SettingsRules.GDExternalMethodValues

For external GD binary, do we use GD or brute force method

SettingsRules.InitialRuleGeneratingMethod

How the initial rules are generated when using SampledRuleSet covering method

SettingsRules.OptimizationGDAddLinearTerms

SettingsRules.OptimizationGDMTCombineGradient

GD optimization.

SettingsRules.OptimizationLinearTermNormalizeValues

SettingsRules.OptimizationLossFunction

WEIGHT OPTIMIZATION Differential evolution algorithm

SettingsRules.OptimizationNormalization

SettingsRules.RuleAddingMethod

SettingsRules.RulePredictionMethod

SettingsSIT

SettingsSSL

SettingsSSL.SSLAggregation

Aggregation of per target reliability scores

SettingsSSL.SSLConfidenceMeasure

Confidence (i.e., reliability) score for Self-Training

SettingsSSL.SSLMethod

SettingsSSL.SSLNormalization

Normalization of per target reliability scores

SettingsSSL.SSLOOBErrorCalculation

Specifies which data will be used for calculation of OOB error, only originally labeled data or all examples (including the ones with predicted labeles with Self-training

SettingsSSL.SSLStoppingCriteria

Stopping criteria for self training

SettingsSSL.SSLUnlabeledCriteria

unlabeled criteria is the criteria by which the unlabeled data will be added to the training set (used by the Self Training algorithm)

SettingsTimeSeries

SettingsTimeSeries.TimeSeriesDistanceMeasure

Section: Time series *

SettingsTimeSeries.TimeSeriesPrototypeComplexity

SettingsTree

SettingsTree.EntropyType

SettingsTree.Heuristic

Section: Tree - Heuristic *

SettingsTree.InductionOrder

SettingsTree.MissingClusteringAttributeHandlingType

Determines how we handle the case where when searching evaluating candidate split all examples have only missing values for a clustering attriute, in one of the branches.

SettingsTree.MissingTargetAttributeHandlingType

SettingsTree.PruningMethod

Section: Tree - Pruning method *

SettingsTree.SetDistance

Section: Tree - SetDistance *

SettingsTree.SpatialMatrixType

SettingsTree.SpatialMeasure

SettingsTree.SplitPositions

SettingsTree.TimeSeriesDistanceMeasure

Section: Tree - TimeSeriesDistance *

SettingsTree.TreeOptimizeValues

Section: Tree *