controller

Type Members

abstract class AverageMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double] with StatsMetricHelper[EI, Q, P, A] with QPAMetric[Q, P, A, Double]

Returns the global average of the score returned by the calculate method.
Returns the global average of the score returned by the calculate method.
EI
Evaluation information
Q
Query
P
Predicted result
A
Actual result
trait CustomQuerySerializer extends BaseQuerySerializer

If your query class cannot be automatically serialized/deserialized to/from JSON, implement a trait by extending this trait, and overriding the querySerializer member with your custom JSON4S serializer.
If your query class cannot be automatically serialized/deserialized to/from JSON, implement a trait by extending this trait, and overriding the querySerializer member with your custom JSON4S serializer. Algorithm and serving classes using your query class would only need to mix in the trait to enable the custom serializer.
trait Deployment extends EngineFactory

Defines a deployment that contains an Engine
type EmptyActualResult = SerializableClass

Empty actual result.
type EmptyAlgorithmParams = EmptyParams

Empty algorithm parameters.
type EmptyDataParams = EmptyParams

Empty data parameters.
type EmptyDataSourceParams = EmptyParams

Empty data source parameters.
type EmptyEvaluationInfo = SerializableClass

Empty evaluation info.
type EmptyMetricsParams = EmptyParams

Empty metrics parameters.
type EmptyModel = SerializableClass

Empty model.
case class EmptyParams() extends Params with Product with Serializable

A concrete implementation of Params representing empty parameters.
type EmptyPreparatorParams = EmptyParams

Empty preparator parameters.
type EmptyPreparedData = SerializableClass

Empty prepared data.
type EmptyServingParams = EmptyParams

Empty serving parameters.
type EmptyTrainingData = SerializableClass

Empty training data.
class Engine[TD, EI, PD, Q, P, A] extends BaseEngine[EI, Q, P, A]

This class chains up the entire data process.
This class chains up the entire data process. PredictionIO uses this information to create workflows and deployments. In Scala, you should implement an object that extends the EngineFactory trait similar to the following example.
```
object ItemRankEngine extends EngineFactory {
  def apply() = {
    new Engine(
      classOf[ItemRankDataSource],
      classOf[ItemRankPreparator],
      Map(
        "knn" -> classOf[KNNAlgorithm],
        "rand" -> classOf[RandomAlgorithm],
        "mahoutItemBased" -> classOf[MahoutItemBasedAlgorithm]),
      classOf[ItemRankServing])
  }
}
```
TD
Training data class.
EI
Evaluation info class.
PD
Prepared data class.
Q
Input query class.
P
Output prediction class.
A
Actual value class.

See also
EngineFactory
abstract class EngineFactory extends AnyRef

If you intend to let PredictionIO create workflow and deploy serving automatically, you will need to implement an object that extends this class and return an Engine.
class EngineParams extends Serializable

This class serves as a logical grouping of all required engine's parameters.
trait EngineParamsGenerator extends AnyRef

Defines an engine parameters generator.
Defines an engine parameters generator.
Implementations of this trait can be supplied to "pio eval" as the second command line argument.
trait Evaluation extends EngineFactory with Deployment

Defines an evaluation that contains an engine and a metric.
Defines an evaluation that contains an engine and a metric.
Implementations of this trait can be supplied to "pio eval" as the first argument.
class FastEvalEngine[TD, EI, PD, Q, P, A] extends Engine[TD, EI, PD, Q, P, A]

:: Experimental :: FastEvalEngine is a subclass of Engine that exploits the immutability of controllers to optimize the evaluation process
:: Experimental :: FastEvalEngine is a subclass of Engine that exploits the immutability of controllers to optimize the evaluation process

Annotations
@Experimental()
class FastEvalEngineWorkflow[TD, EI, PD, Q, P, A] extends Serializable

:: Experimental :: Workflow based on FastEvalEngine
:: Experimental :: Workflow based on FastEvalEngine

Annotations
@Experimental()
class IdentityPreparator[TD] extends BasePreparator[TD, TD]

A helper concrete implementation of org.apache.predictionio.core.BasePreparator that passes training data through without any special preparation.
A helper concrete implementation of org.apache.predictionio.core.BasePreparator that passes training data through without any special preparation. This can be used in place for both PPreparator and LPreparator.
TD
Training data class.
abstract class LAlgorithm[PD, M, Q, P] extends BaseAlgorithm[RDD[PD], RDD[M], Q, P]

Base class of a local algorithm.
Base class of a local algorithm.
A local algorithm runs locally within a single machine and produces a model that can fit within a single machine.
If your input query class requires custom JSON4S serialization, the most idiomatic way is to implement a trait that extends CustomQuerySerializer, and mix that into your algorithm class, instead of overriding querySerializer directly.
PD
Prepared data class.
M
Trained model class.
Q
Input query class.
P
Output prediction class.
class LAverageServing[Q] extends LServing[Q, Double]

A concrete implementation of LServing returning the average of all algorithms' predictions, where their classes are expected to be all Double.
abstract class LDataSource[TD, EI, Q, A] extends BaseDataSource[RDD[TD], EI, Q, A]

Base class of a local data source.
Base class of a local data source.
A local data source runs locally within a single machine and return data that can fit within a single machine.
TD
Training data class.
EI
Evaluation Info class.
Q
Input query class.
A
Actual value class.
class LFirstServing[Q, P] extends LServing[Q, P]

A concrete implementation of LServing returning the first algorithm's prediction result directly without any modification.
class LIdentityPreparator[TD] extends IdentityPreparator[TD]

DEPRECATED.
DEPRECATED. Use IdentityPreparator instead.
TD
Training data class.
abstract class LPreparator[TD, PD] extends BasePreparator[RDD[TD], RDD[PD]]

Base class of a local preparator.
Base class of a local preparator.
A local preparator runs locally within a single machine and produces prepared data that can fit within a single machine.
TD
Training data class.
PD
Prepared data class.
abstract class LServing[Q, P] extends BaseServing[Q, P]

Base class of serving.
Base class of serving.
Q
Input query class.
P
Output prediction class.
trait LocalFileSystemPersistentModel[AP <: Params] extends PersistentModel[AP]

This trait is a convenience helper for persisting your model to the local filesystem.
This trait is a convenience helper for persisting your model to the local filesystem. This trait and LocalFileSystemPersistentModelLoader contain concrete implementation and need not be implemented.
The underlying implementation is Utils.save.
```
class MyModel extends LocalFileSystemPersistentModel[MyParams] {
  ...
}

object MyModel extends LocalFileSystemPersistentModelLoader[MyParams, MyModel] {
  ...
}
```
AP
Algorithm parameters class.

See also
LocalFileSystemPersistentModelLoader
trait LocalFileSystemPersistentModelLoader[AP <: Params, M] extends PersistentModelLoader[AP, M]

Implement an object that extends this trait for PredictionIO to support loading a persisted model from local filesystem during serving deployment.
Implement an object that extends this trait for PredictionIO to support loading a persisted model from local filesystem during serving deployment.
The underlying implementation is Utils.load.
AP
Algorithm parameters class.
M
Model class.

See also
LocalFileSystemPersistentModel
abstract class Metric[EI, Q, P, A, R] extends Serializable

Base class of a Metric.
Base class of a Metric.
EI
Evaluation information
Q
Query
P
Predicted result
A
Actual result
R
Metric result
class MetricEvaluator[EI, Q, P, A, R] extends BaseEvaluator[EI, Q, P, A, MetricEvaluatorResult[R]]

:: DeveloperApi :: Do no use this directly.
:: DeveloperApi :: Do no use this directly. Use MetricEvaluator$ instead. This is an implementation of org.apache.predictionio.core.BaseEvaluator that evaluates prediction performance based on metric scores.
EI
Evaluation information type
Q
Query class
P
Predicted result class
A
Actual result class
R
Metric result class

Annotations
@DeveloperApi()
case class MetricEvaluatorResult[R](bestScore: MetricScores[R], bestEngineParams: EngineParams, bestIdx: Int, metricHeader: String, otherMetricHeaders: Seq[String], engineParamsScores: Seq[(EngineParams, MetricScores[R])], outputPath: Option[String]) extends BaseEvaluatorResult with Product with Serializable

Contains all results of a MetricEvaluator
Contains all results of a MetricEvaluator
R
Type of the primary metric score
bestScore
The best score among all iterations
bestEngineParams
The set of engine parameters that yielded the best score
bestIdx
The index of iteration that yielded the best score
metricHeader
Brief description of the primary metric score
otherMetricHeaders
Brief descriptions of other metric scores
engineParamsScores
All sets of engine parameters and corresponding metric scores
outputPath
An optional output path where scores are saved
case class MetricScores[R](score: R, otherScores: Seq[Any]) extends Product with Serializable

Case class storing a primary score, and other scores
Case class storing a primary score, and other scores
R
Type of the primary metric score
score
Primary metric score
otherScores
Other scores this metric might have
abstract class OptionAverageMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double] with StatsOptionMetricHelper[EI, Q, P, A] with QPAMetric[Q, P, A, Option[Double]]

Returns the global average of the non-None score returned by the calculate method.
Returns the global average of the non-None score returned by the calculate method.
EI
Evaluation information
Q
Query
P
Predicted result
A
Actual result
abstract class OptionStdevMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double] with StatsOptionMetricHelper[EI, Q, P, A] with QPAMetric[Q, P, A, Option[Double]]

Returns the global standard deviation of the non-None score returned by the calculate method
Returns the global standard deviation of the non-None score returned by the calculate method
This method uses org.apache.spark.util.StatCounter library, a one pass method is used for calculation
EI
Evaluation information
Q
Query
P
Predicted result
A
Actual result
abstract class P2LAlgorithm[PD, M, Q, P] extends BaseAlgorithm[PD, M, Q, P]

Base class of a parallel-to-local algorithm.
Base class of a parallel-to-local algorithm.
A parallel-to-local algorithm can be run in parallel on a cluster and produces a model that can fit within a single machine.
If your input query class requires custom JSON4S serialization, the most idiomatic way is to implement a trait that extends CustomQuerySerializer, and mix that into your algorithm class, instead of overriding querySerializer directly.
PD
Prepared data class.
M
Trained model class.
Q
Input query class.
P
Output prediction class.
abstract class PAlgorithm[PD, M, Q, P] extends BaseAlgorithm[PD, M, Q, P]

Base class of a parallel algorithm.
Base class of a parallel algorithm.
A parallel algorithm can be run in parallel on a cluster and produces a model that can also be distributed across a cluster.
If your input query class requires custom JSON4S serialization, the most idiomatic way is to implement a trait that extends CustomQuerySerializer, and mix that into your algorithm class, instead of overriding querySerializer directly.
To provide evaluation feature, one must override and implement the batchPredict method. Otherwise, an exception will be thrown when pio evalis used.
PD
Prepared data class.
M
Trained model class.
Q
Input query class.
P
Output prediction class.
abstract class PDataSource[TD, EI, Q, A] extends BaseDataSource[TD, EI, Q, A]

Base class of a parallel data source.
Base class of a parallel data source.
A parallel data source runs locally within a single machine, or in parallel on a cluster, to return data that is distributed across a cluster.
TD
Training data class.
EI
Evaluation Info class.
Q
Input query class.
A
Actual value class.
class PIdentityPreparator[TD] extends IdentityPreparator[TD]

DEPRECATED.
DEPRECATED. Use IdentityPreparator instead.
TD
Training data class.
abstract class PPreparator[TD, PD] extends BasePreparator[TD, PD]

Base class of a parallel preparator.
Base class of a parallel preparator.
A parallel preparator can be run in parallel on a cluster and produces a prepared data that is distributed across a cluster.
TD
Training data class.
PD
Prepared data class.
trait Params extends Serializable

Base trait for all kinds of parameters that will be passed to constructors of different controller classes.
trait PersistentModel[AP <: Params] extends AnyRef

Mix in and implement this trait if your model cannot be persisted by PredictionIO automatically.
Mix in and implement this trait if your model cannot be persisted by PredictionIO automatically. A companion object extending IPersistentModelLoader is required for PredictionIO to load the persisted model automatically during deployment.
Notice that models generated by PAlgorithm cannot be persisted automatically by nature and must implement these traits if model persistence is desired.
```
class MyModel extends PersistentModel[MyParams] {
  def save(id: String, params: MyParams, sc: SparkContext): Boolean = {
    ...
  }
}

object MyModel extends PersistentModelLoader[MyParams, MyModel] {
  def apply(id: String, params: MyParams, sc: Option[SparkContext]): MyModel = {
    ...
  }
}
```
In Java, all you need to do is to implement this interface, and add a static method with 3 arguments of type String, Params, and SparkContext.
```
public class MyModel implements PersistentModel<MyParams>, Serializable {
  ...
  public boolean save(String id, MyParams params, SparkContext sc) {
    ...
  }

  public static MyModel load(String id, Params params, SparkContext sc) {
    ...
  }
  ...
}
```
AP
Algorithm parameters class.

See also
PersistentModelLoader
trait PersistentModelLoader[AP <: Params, M] extends AnyRef

Implement an object that extends this trait for PredictionIO to support loading a persisted model during serving deployment.
Implement an object that extends this trait for PredictionIO to support loading a persisted model during serving deployment.
AP
Algorithm parameters class.
M
Model class.

See also
PersistentModel
trait QPAMetric[Q, P, A, R] extends AnyRef

Trait for metric which returns a score based on Query, PredictedResult, and ActualResult
Trait for metric which returns a score based on Query, PredictedResult, and ActualResult
Q
Query class
P
Predicted result class
A
Actual result class
R
Metric result class
trait SanityCheck extends AnyRef

Extends a data class with this trait if you want PredictionIO to automatically perform sanity check on your data classes during training.
Extends a data class with this trait if you want PredictionIO to automatically perform sanity check on your data classes during training. This is very useful when you need to debug your engine.
class SerializableClass extends Serializable

Base class of several helper types that represent emptiness
class SimpleEngine[TD, EI, Q, P, A] extends Engine[TD, EI, TD, Q, P, A]

SimpleEngine has only one algorithm, and uses default preparator and serving layer.
SimpleEngine has only one algorithm, and uses default preparator and serving layer. Current default preparator is IdentityPreparator and serving is FirstServing.
TD
Training data class.
EI
Evaluation info class.
Q
Input query class.
P
Output prediction class.
A
Actual value class.
class SimpleEngineParams extends EngineParams

This shorthand class serves the SimpleEngine class.
abstract class StdevMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double] with StatsMetricHelper[EI, Q, P, A] with QPAMetric[Q, P, A, Double]

Returns the global standard deviation of the score returned by the calculate method
Returns the global standard deviation of the score returned by the calculate method
This method uses org.apache.spark.util.StatCounter library, a one pass method is used for calculation
EI
Evaluation information
Q
Query
P
Predicted result
A
Actual result
abstract class SumMetric[EI, Q, P, A, R] extends Metric[EI, Q, P, A, R] with QPAMetric[Q, P, A, R]

Returns the sum of the score returned by the calculate method.
Returns the sum of the score returned by the calculate method.
EI
Evaluation information
Q
Query
P
Predicted result
A
Actual result
R
Result, output of the function calculate, must be Numeric
class ZeroMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double]

Returns zero.
Returns zero. Useful as a placeholder during evaluation development when not all components are implemented.
EI
Evaluation information
Q
Query
P
Predicted result
A
Actual result
trait IEngineFactory extends EngineFactory

DEPRECATED.
DEPRECATED. Use EngineFactory instead.

Annotations
@deprecated
Deprecated
(Since version 0.9.2) Use EngineFactory instead.
trait IFSPersistentModel[AP <: Params] extends LocalFileSystemPersistentModel[AP]

DEPRECATED.
DEPRECATED. Use LocalFileSystemPersistentModel instead.

Annotations
@deprecated
Deprecated
(Since version 0.9.2) Use LocalFileSystemPersistentModel instead.
trait IFSPersistentModelLoader[AP <: Params, M] extends LocalFileSystemPersistentModelLoader[AP, M]

DEPRECATED.
DEPRECATED. Use LocalFileSystemPersistentModelLoader instead.

Annotations
@deprecated
Deprecated
(Since version 0.9.2) Use LocalFileSystemPersistentModelLoader instead.
trait IPersistentModel[AP <: Params] extends PersistentModel[AP]

DEPRECATED.
DEPRECATED. Use PersistentModel instead.

Annotations
@deprecated
Deprecated
(Since version 0.9.2) Use PersistentModel instead.
trait IPersistentModelLoader[AP <: Params, M] extends PersistentModelLoader[AP, M]

DEPRECATED.
DEPRECATED. Use PersistentModelLoader instead.

Annotations
@deprecated
Deprecated
(Since version 0.9.2) Use PersistentModelLoader instead.
trait WithPrId extends AnyRef

Mix in this trait for queries that contain prId (PredictedResultId).
Mix in this trait for queries that contain prId (PredictedResultId). This is useful when your engine expects queries to also be associated with prId keys when feedback loop is enabled.

Annotations
@deprecated
Deprecated
(Since version 0.9.2) To be removed in future releases.
trait WithQuerySerializer extends CustomQuerySerializer

DEPRECATED.
DEPRECATED. Use CustomQuerySerializer instead.

Annotations
@deprecated
Deprecated
(Since version 0.9.2) Use CustomQuerySerializer instead.

Value Members

object Engine extends Serializable

This object contains concrete implementation for some methods of the Engine class.
object EngineParams extends Serializable

Companion object for creating EngineParams instances.
object FastEvalEngineWorkflow extends Serializable

:: Experimental :: Workflow based on FastEvalEngine
:: Experimental :: Workflow based on FastEvalEngine

Annotations
@Experimental()
object IdentityPreparator extends Serializable

Companion object of IdentityPreparator that conveniently returns an instance of the class of IdentityPreparator for use with EngineFactory.
object LAverageServing extends Serializable

A concrete implementation of LServing returning the average of all algorithms' predictions, where their classes are expected to be all Double.
object LFirstServing extends Serializable

A concrete implementation of LServing returning the first algorithm's prediction result directly without any modification.
object LIdentityPreparator extends Serializable

DEPRECATED.
DEPRECATED. Use IdentityPreparator instead.
object MetricEvaluator extends Serializable

Companion object of MetricEvaluator
object PIdentityPreparator extends Serializable

DEPRECATED.
DEPRECATED. Use IdentityPreparator instead.
object Utils

Controller utilities.
object ZeroMetric extends Serializable

Companion object of ZeroMetric

package controller

Start Building an Engine

The DASE Paradigm

Types of Building Blocks

Engines

Data Source

Preparator

Algorithm

P2LAlgorithm

PAlgorithm

LAlgorithm

Serving

Model Persistence

Type Members

abstract class AverageMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double] with StatsMetricHelper[EI, Q, P, A] with QPAMetric[Q, P, A, Double]

trait CustomQuerySerializer extends BaseQuerySerializer

trait Deployment extends EngineFactory

type EmptyActualResult = SerializableClass

type EmptyAlgorithmParams = EmptyParams

type EmptyDataParams = EmptyParams

type EmptyDataSourceParams = EmptyParams

type EmptyEvaluationInfo = SerializableClass

type EmptyMetricsParams = EmptyParams

type EmptyModel = SerializableClass

case class EmptyParams() extends Params with Product with Serializable

type EmptyPreparatorParams = EmptyParams

type EmptyPreparedData = SerializableClass

type EmptyServingParams = EmptyParams

type EmptyTrainingData = SerializableClass

class Engine[TD, EI, PD, Q, P, A] extends BaseEngine[EI, Q, P, A]

abstract class EngineFactory extends AnyRef

class EngineParams extends Serializable

trait EngineParamsGenerator extends AnyRef

trait Evaluation extends EngineFactory with Deployment

class FastEvalEngine[TD, EI, PD, Q, P, A] extends Engine[TD, EI, PD, Q, P, A]

class FastEvalEngineWorkflow[TD, EI, PD, Q, P, A] extends Serializable

class IdentityPreparator[TD] extends BasePreparator[TD, TD]

abstract class LAlgorithm[PD, M, Q, P] extends BaseAlgorithm[RDD[PD], RDD[M], Q, P]

class LAverageServing[Q] extends LServing[Q, Double]

abstract class LDataSource[TD, EI, Q, A] extends BaseDataSource[RDD[TD], EI, Q, A]

class LFirstServing[Q, P] extends LServing[Q, P]

class LIdentityPreparator[TD] extends IdentityPreparator[TD]

abstract class LPreparator[TD, PD] extends BasePreparator[RDD[TD], RDD[PD]]

abstract class LServing[Q, P] extends BaseServing[Q, P]

trait LocalFileSystemPersistentModel[AP <: Params] extends PersistentModel[AP]

trait LocalFileSystemPersistentModelLoader[AP <: Params, M] extends PersistentModelLoader[AP, M]

abstract class Metric[EI, Q, P, A, R] extends Serializable

class MetricEvaluator[EI, Q, P, A, R] extends BaseEvaluator[EI, Q, P, A, MetricEvaluatorResult[R]]

case class MetricScores[R](score: R, otherScores: Seq[Any]) extends Product with Serializable

abstract class OptionAverageMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double] with StatsOptionMetricHelper[EI, Q, P, A] with QPAMetric[Q, P, A, Option[Double]]

abstract class OptionStdevMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double] with StatsOptionMetricHelper[EI, Q, P, A] with QPAMetric[Q, P, A, Option[Double]]

abstract class P2LAlgorithm[PD, M, Q, P] extends BaseAlgorithm[PD, M, Q, P]

abstract class PAlgorithm[PD, M, Q, P] extends BaseAlgorithm[PD, M, Q, P]

abstract class PDataSource[TD, EI, Q, A] extends BaseDataSource[TD, EI, Q, A]

class PIdentityPreparator[TD] extends IdentityPreparator[TD]

abstract class PPreparator[TD, PD] extends BasePreparator[TD, PD]

trait Params extends Serializable

trait PersistentModel[AP <: Params] extends AnyRef

trait PersistentModelLoader[AP <: Params, M] extends AnyRef

trait QPAMetric[Q, P, A, R] extends AnyRef

trait SanityCheck extends AnyRef

class SerializableClass extends Serializable

class SimpleEngine[TD, EI, Q, P, A] extends Engine[TD, EI, TD, Q, P, A]

class SimpleEngineParams extends EngineParams

abstract class StdevMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double] with StatsMetricHelper[EI, Q, P, A] with QPAMetric[Q, P, A, Double]

abstract class SumMetric[EI, Q, P, A, R] extends Metric[EI, Q, P, A, R] with QPAMetric[Q, P, A, R]

class ZeroMetric[EI, Q, P, A] extends Metric[EI, Q, P, A, Double]

trait IEngineFactory extends EngineFactory

trait IFSPersistentModel[AP <: Params] extends LocalFileSystemPersistentModel[AP]

trait IFSPersistentModelLoader[AP <: Params, M] extends LocalFileSystemPersistentModelLoader[AP, M]

trait IPersistentModel[AP <: Params] extends PersistentModel[AP]

trait IPersistentModelLoader[AP <: Params, M] extends PersistentModelLoader[AP, M]

trait WithPrId extends AnyRef

trait WithQuerySerializer extends CustomQuerySerializer

Value Members

object Engine extends Serializable

object EngineParams extends Serializable

object FastEvalEngineWorkflow extends Serializable

object IdentityPreparator extends Serializable