Package org.apache.spark.ml.fpm
Interface PrefixSpanParams
- All Superinterfaces:
Identifiable
,Params
,Serializable
- All Known Implementing Classes:
PrefixSpan
-
Method Summary
Modifier and TypeMethodDescriptionlong
int
double
Param for the maximum number of items (including delimiters used in the internal storage format) allowed in a projected database before local processing (default:32000000
).Param for the maximal pattern length (default:10
).Param for the minimal support level (default:0.1
).Param for the name of the sequence column in dataset (default "sequence"), rows with nulls in this column are ignored.Methods inherited from interface org.apache.spark.ml.util.Identifiable
toString, uid
Methods inherited from interface org.apache.spark.ml.param.Params
clear, copy, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, onParamChange, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn
-
Method Details
-
getMaxLocalProjDBSize
long getMaxLocalProjDBSize() -
getMaxPatternLength
int getMaxPatternLength() -
getMinSupport
double getMinSupport() -
getSequenceCol
String getSequenceCol() -
maxLocalProjDBSize
LongParam maxLocalProjDBSize()Param for the maximum number of items (including delimiters used in the internal storage format) allowed in a projected database before local processing (default:32000000
). If a projected database exceeds this size, another iteration of distributed prefix growth is run.- Returns:
- (undocumented)
-
maxPatternLength
IntParam maxPatternLength()Param for the maximal pattern length (default:10
).- Returns:
- (undocumented)
-
minSupport
DoubleParam minSupport()Param for the minimal support level (default:0.1
). Sequential patterns that appear more than (minSupport * size-of-the-dataset) times are identified as frequent sequential patterns.- Returns:
- (undocumented)
-
sequenceCol
Param for the name of the sequence column in dataset (default "sequence"), rows with nulls in this column are ignored.- Returns:
- (undocumented)
-