Package mdp :: Package nodes :: Class SelectPercentileScikitsLearnNode
[hide private]
[frames] | no frames]

Class SelectPercentileScikitsLearnNode

Select features according to a percentile of the highest scores.

This node has been automatically generated by wrapping the ``sklearn.feature_selection.univariate_selection.SelectPercentile`` class
from the ``sklearn`` library.  The wrapped instance can be accessed
through the ``scikits_alg`` attribute.

Read more in the :ref:`User Guide <univariate_feature_selection>`.


score_func : callable
    Function taking two arrays X and y, and returning a pair of arrays
    (scores, pvalues).

percentile : int, optional, default=10
    Percent of features to keep.


``scores_`` : array-like, shape=(n_features,)
    Scores of features.

``pvalues_`` : array-like, shape=(n_features,)
    p-values of feature scores.


Ties between features with equal scores will be broken in an unspecified

See also

f_classif: ANOVA F-value between labe/feature for classification tasks.
chi2: Chi-squared stats of non-negative features for classification tasks.
f_regression: F-value between label/feature for regression tasks.
SelectKBest: Select features based on the k highest scores.
SelectFpr: Select features based on a false positive rate test.
SelectFdr: Select features based on an estimated false discovery rate.
SelectFwe: Select features based on family-wise error rate.
GenericUnivariateSelect: Univariate feature selector with configurable mode.

Instance Methods [hide private]
__init__(self, input_dim=None, output_dim=None, dtype=None, **kwargs)
Select features according to a percentile of the highest scores.
_execute(self, x)
Return the list of dtypes supported by this node. The types can be specified in any format allowed by numpy.dtype.
_stop_training(self, **kwargs)
Concatenate the collected data in a single array.
execute(self, x)
Reduce X to the selected features.
stop_training(self, **kwargs)
Run score function on (X, y) and get the appropriate features.

Inherited from unreachable.newobject: __long__, __native__, __nonzero__, __unicode__, next

Inherited from object: __delattr__, __format__, __getattribute__, __hash__, __new__, __reduce__, __reduce_ex__, __setattr__, __sizeof__, __subclasshook__

    Inherited from Cumulator
_train(self, *args)
Collect all input data in a list.
train(self, *args)
Collect all input data in a list.
    Inherited from Node
__add__(self, other)
__call__(self, x, *args, **kwargs)
Calling an instance of Node is equivalent to calling its execute method.
_check_input(self, x)
_check_output(self, y)
_check_train_args(self, x, *args, **kwargs)
_inverse(self, x)
_pre_execution_checks(self, x)
This method contains all pre-execution checks.
_pre_inversion_checks(self, y)
This method contains all pre-inversion checks.
_refcast(self, x)
Helper function to cast arrays to the internal dtype.
_set_dtype(self, t)
_set_input_dim(self, n)
_set_output_dim(self, n)
copy(self, protocol=None)
Return a deep copy of the node.
Return the index of the current training phase.
Return dtype.
Return input dimensions.
Return output dimensions.
Return the number of training phases still to accomplish.
Return dtypes supported by the node as a list of dtype objects.
Return True if the node has multiple training phases.
inverse(self, y, *args, **kwargs)
Invert y.
Return True if the node is in the training phase, False otherwise.
save(self, filename, protocol=-1)
Save a pickled serialization of the node to filename. If filename is None, return a string.
set_dtype(self, t)
Set internal structures' dtype.
set_input_dim(self, n)
Set input dimensions.
set_output_dim(self, n)
Set output dimensions.
Static Methods [hide private]
Return True if the node can be inverted, False otherwise.
Return True if the node can be trained, False otherwise.
Properties [hide private]

Inherited from object: __class__

    Inherited from Node
List of tuples:
Input dimensions
Output dimensions
Supported dtypes
Method Details [hide private]

__init__(self, input_dim=None, output_dim=None, dtype=None, **kwargs)

Select features according to a percentile of the highest scores.

This node has been automatically generated by wrapping the ``sklearn.feature_selection.univariate_selection.SelectPercentile`` class
from the ``sklearn`` library.  The wrapped instance can be accessed
through the ``scikits_alg`` attribute.

Read more in the :ref:`User Guide <univariate_feature_selection>`.


score_func : callable
    Function taking two arrays X and y, and returning a pair of arrays
    (scores, pvalues).

percentile : int, optional, default=10
    Percent of features to keep.


``scores_`` : array-like, shape=(n_features,)
    Scores of features.

``pvalues_`` : array-like, shape=(n_features,)
    p-values of feature scores.


Ties between features with equal scores will be broken in an unspecified

See also

f_classif: ANOVA F-value between labe/feature for classification tasks.
chi2: Chi-squared stats of non-negative features for classification tasks.
f_regression: F-value between label/feature for regression tasks.
SelectKBest: Select features based on the k highest scores.
SelectFpr: Select features based on a false positive rate test.
SelectFdr: Select features based on an estimated false discovery rate.
SelectFwe: Select features based on family-wise error rate.
GenericUnivariateSelect: Univariate feature selector with configurable mode.

Overrides: object.__init__

_execute(self, x)

Overrides: Node._execute


Return the list of dtypes supported by this node. The types can be specified in any format allowed by numpy.dtype.
Overrides: Node._get_supported_dtypes

_stop_training(self, **kwargs)

Concatenate the collected data in a single array.
Overrides: Node._stop_training

execute(self, x)


Reduce X to the selected features.

This node has been automatically generated by wrapping the sklearn.feature_selection.univariate_selection.SelectPercentile class from the sklearn library. The wrapped instance can be accessed through the scikits_alg attribute.


X : array of shape [n_samples, n_features]
The input samples.


X_r : array of shape [n_samples, n_selected_features]
The input samples with only the selected features.
Overrides: Node.execute

Static Method

Return True if the node can be inverted, False otherwise.
Overrides: Node.is_invertible
(inherited documentation)

Static Method

Return True if the node can be trained, False otherwise.
Overrides: Node.is_trainable

stop_training(self, **kwargs)


Run score function on (X, y) and get the appropriate features.

This node has been automatically generated by wrapping the sklearn.feature_selection.univariate_selection.SelectPercentile class from the sklearn library. The wrapped instance can be accessed through the scikits_alg attribute.


X : array-like, shape = [n_samples, n_features]
The training input samples.
y : array-like, shape = [n_samples]
The target values (class labels in classification, real numbers in regression).


self : object
Returns self.
Overrides: Node.stop_training