New CRAN release after archival.
type
argument.R CMD CHECK
. #38Fixed an issue with the HDRDA classifier's predict
function. The posterior
probabilities did not sum to 1 because they were unnormalized. #34
Fixed another issue with the HDRDA classifier's predict
function, where the
class names were incorrect when predicting a single observation. #34
Improved docs throughout the package to pass R CMD CHECK
. #35
The predict
function now returns posterior-probability estimates for each
classifier.
The object returned by cv_hdrda()
can be plotted. A heatmap is produced
using ggplot2
to illustrate the cross-validation error rate for each
tuning-parameter pair considered.
The predict
function for the HDRDA classifier is now substantially faster
when classifying a large number of observations. #33
The cross-validation helper function cv_hdrda()
for the HDRDA classifier now
returns a trained classifier rather than the optimal model.
cv_hdrda()
also has an optional verbose
argument to dump summary
information while the cross-validation is running.
Fixed issue with classifiers' documentation not appearing in help index. #26
Better handling of HDRDA when its tuning parameters are both 0.
Corrected calculation of W_k and Q_k in HDRDA classifier.
Added unit tests for HDRDA.
Can now specify population means in generate_blockdiag()
.
Added unit tests for generate_blockdiag()
.
Updated man docs with roxygen2 4.0.
Added log_determinant()
helper function to calculate the log-determinant of
a matrix.
The High-Dimensional Regularized Discriminant Analysis (HDRDA) classifier from
Ramey, Stein, and Young (2014) implemented in hdrda()
has been revamped to
improve its computational performance.
lda_pseudo()
is an implementation of Linear Discriminant Analysis (LDA) with
the Moore-Penrose Pseudo-Inverse
lda_schafer()
is an implementation of Linear Discriminant Analysis (LDA)
using the covariance matrix estimator from Schafer and Strimmer (2005)
lda_thomaz()
is an implementation of Linear Discriminant Analysis (LDA)
using the covariance matrix estimator from Thomaz, Kitani, and Gillies (2006)
mdeb()
is an implementation of the Minimum Distance Empirical Bayesian
Estimator (MDEB) classifier from Srivistava and Kubokawa (2007)
mdmeb()
is an implementation of the Minimum Distance Rule using Modified
Empirical Bayes (MDMEB) classifier from Srivistava and Kubokawa (2007)
mdmp()
is an implementation of the Minimum Distance Rule using Moore-Penrose
Inverse (MDMP) classifier from Srivistava and Kubokawa (2007)
smdlda()
is an implementation of the Shrinkage-mean-based Diagonal Linear
Discriminant Analysis (SmDLDA) from Tong, Chen, and Zhao (2012)
smdqda()
is an implementation of the Shrinkage-mean-based Diagonal Quadratic
Discriminant Analysis (SmDQDA) from Tong, Chen, and Zhao (2012)
Added a summary function for hdrda
classifiers
First version of the sparsediscrim
package. With this package, we aim to
provide a large collection of regularized and sparse discriminant analysis
classifiers intended for high-dimensional classification.
hdrda()
is an implementation of the High-Dimensional Regularized
Discriminant Analysis classifier from Ramey, Stein, and Young (2014).
dlda()
is an implementation of the Diagonal Linear Discriminant Analysis
classifier from Dudoit, Fridlyand, and Speed (2002).
dqda()
is an implementation of the Diagonal Quadratic Discriminant Analysis
classifier from Dudoit, Fridlyand, and Speed (2002).
sdlda()
is an implementation of the Shrinkage-based Diagonal Linear
Discriminant Analysis classifier from Pang, Tong, and Zhao (2009).
sdqda()
is an implementation of the Shrinkage-based Diagonal Quadratic
Discriminant Analysis classifier from Pang, Tong, and Zhao (2009).
generate_blockdiag()
generates random variates from K multivariate normal
populations, where each class is generated with a constant mean vector and a
covariance matrix consisting of block-diagonal autocorrelation matrices.
generate_intraclass()
generates random variates from K multivariate normal
populations, where class is generated with a constant mean vector and an
intraclass covariance matrix.
cv_partition()
randomly partitions data for cross-validation.
no_intercept()
removes the intercept term from a formula if it is included.
cov_mle()
computes the maximum likelihood estimator for the sample
covariance matrix under the assumption of multivariate normality.
cov_pool()
computes the pooled maximum likelihood estimator for the common
covariance matrix under the assumption of multivariate normality.
cov_eigen()
computes the eigenvalue decomposition of the maximum likelihood
estimators of the covariance matrices for the given data matrix. We provide an
option to calculate the eigenvalue decomposition using the Fast Singular Value
Decomposition, which can greatly expedite the eigenvalue decomposition for
very tall data (large n, small p) or very wide data (small n, large p).