Feature Selection Package - Algorithms - Chi Square
Description
The Chi-Square distribution is one of the most widely-used probability
distributions.
Usage
Method Signature:
[out] = fsChiSquare(X,Y)
Output:
out: A struct containing the following fields
- W - The distribution at each data point.
- fList - The list of features that are deemed useful.
- prf - This means that the smaller the feature weight is, the
more useful it will be to the user.
Input:
X: the features on current trunk,
each column is a feature vector on all instances, and each row
is a part of the instance.
Y: the label of instances, in single column form: 1 2 3 4 5 ...
Code Example
% Using the wine.dat data set, which can be found at
% [fspackage_location]/classifiers/knn/wine.mat
fsChiSquare(X,Y)
Keyword in Evaluator Framework
chi2
Paper
BibTex entry for:
H. Liu and R. Setiono, "Chi2: Feature selection and discretization of numeric attributes"
@INPROCEEDINGS{Liu-Seti95,
author = {Liu, H. and Setiono, R.},
title = {Chi2: Feature Selection and Discretization of Numeric Attributes},
booktitle = {Proceedings of the {S}eventh {IEEE} {I}nternational {C}onference on {T}ools with {A}rtificial {I}ntelligence, November 5-8, 1995},
year = {1995},
editor = {Vassilopoulos, J.F.},
pages = {388--391},
address = {Herndon, Virginia},
publisher = {IEEE Computer Society},
owner = {alan},
timestamp = {2008.10.1}
}