caret - Classification and Regression Training
Misc functions for training and plotting classification and regression models.
Last updated 3 months ago
19.24 score 1.6k stars 303 dependents 61k scripts 198k downloads
recipes - Preprocessing and Feature Engineering Steps for Modeling
A recipe prepares your data for modeling. We provide an extensible framework for pipeable sequences of feature engineering steps provides preprocessing tools to be applied to data. Statistical parameters for the steps can be estimated from an initial data set and then applied to other data sets. The resulting processed output can then be used as inputs for statistical or machine learning models.
Last updated 3 days ago
18.71 score 584 stars 380 dependents 7.2k scripts 185k downloads
parsnip - A Common API to Modeling and Analysis Functions
A common interface is provided to allow users to specify a model without having to remember the different argument names across different functions or computational engines (e.g. 'R', 'Spark', 'Stan', 'H2O', etc).
Last updated 18 hours ago
16.37 score 612 stars 69 dependents 3.4k scripts 46k downloads
tidymodels - Easily Install and Load the 'Tidymodels' Packages
The tidy modeling "verse" is a collection of packages for modeling and statistical analysis that share the underlying design philosophy, grammar, and data structures of the tidyverse.
Last updated 20 days ago
16.34 score 780 stars 15 dependents 66k scripts 32k downloads
tune - Tidy Tuning Tools
The ability to tune models is important. 'tune' contains functions and classes to be used in conjunction with other 'tidymodels' packages for finding reasonable values of hyper-parameters in models, pre-processing methods, and post-processing steps.
Last updated 9 days ago
14.27 score 293 stars 39 dependents 756 scripts 34k downloadsCubist - Rule- And Instance-Based Regression Modeling
Regression modeling using rules with added instance-based corrections.
Last updated 8 months ago
12.38 score 40 stars 18 dependents 2.8k scripts 19k downloads
probably - Tools for Post-Processing Predicted Values
Models can be improved by post-processing class probabilities, by: recalibration, conversion to hard probabilities, assessment of equivocal zones, and other activities. 'probably' contains tools for conducting these operations as well as calibration tools and conformal inference techniques for regression models.
Last updated 5 months ago
12.07 score 115 stars 1 dependents 21k scripts 1.9k downloadsC50 - C5.0 Decision Trees and Rule-Based Models
C5.0 decision trees and rule-based models for pattern recognition that extend the work of Quinlan (1993, ISBN:1-55860-238-0).
Last updated 2 years ago
11.99 score 50 stars 13 dependents 1.3k scripts 13k downloadsmodeldata - Data Sets Useful for Modeling Examples
Data sets used for demonstrating or testing model-related packages are contained in this package.
Last updated 5 months ago
10.66 score 22 stars 17 dependents 2.2k scripts 26k downloads
tidyposterior - Bayesian Analysis to Compare Models using Resampling Statistics
Bayesian analysis used here to answer the question: "when looking at resampling results, are the differences between models 'real'?" To answer this, a model can be created were the performance statistic is the resampling statistics (e.g. accuracy or RMSE). These values are explained by the model types. In doing this, we can get parameter estimates for each model's affect on performance and make statistical (and practical) comparisons between models. The methods included here are similar to Benavoli et al (2017) <https://jmlr.org/papers/v18/16-305.html>.
Last updated 5 months ago
8.44 score 102 stars 273 scripts 591 downloads
finetune - Additional Functions for Model Tuning
The ability to tune models is important. 'finetune' enhances the 'tune' package by providing more specialized methods for finding reasonable values of model tuning parameters. Two racing methods described by Kuhn (2014) <arXiv:1405.6974> are included. An iterative search method using generalized simulated annealing (Bohachevsky, Johnson and Stein, 1986) <doi:10.1080/00401706.1986.10488128> is also included.
Last updated 7 months ago
8.36 score 62 stars 1 dependents 704 scripts 1.5k downloads
baguette - Efficient Model Functions for Bagging
Tree- and rule-based models can be bagged (<doi:10.1007/BF00058655>) using this package and their predictions equations are stored in an efficient format to reduce the model objects size and speed.
Last updated 1 months ago
8.26 score 25 stars 1 dependents 624 scripts 1.6k downloads
modeldb - Fits Models Inside the Database
Uses 'dplyr' and 'tidyeval' to fit statistical models inside the database. It currently supports KMeans and linear regression models.
Last updated 1 years ago
databasedbplyrdplyrggplot2modelingrlangsqltidyevalvisualization
7.59 score 79 stars 62 scripts 313 downloads
brulee - High-Level Modeling Functions with 'torch'
Provides high-level modeling functions to define and train models using the 'torch' R package. Models include linear, logistic, and multinomial regression as well as multilayer perceptrons.
Last updated 1 months ago
7.49 score 69 stars 214 scripts 856 downloadsAmesHousing - The Ames Iowa Housing Data
Raw and processed versions of the data from De Cock (2011) <http://ww2.amstat.org/publications/jse> are included in the package.
Last updated 5 years ago
7.29 score 14 stars 2 dependents 524 scripts 2.2k downloadsAppliedPredictiveModeling - Functions and Data Sets for 'Applied Predictive Modeling'
A few functions and several data set for the Springer book 'Applied Predictive Modeling'.
Last updated 2 years ago
6.89 score 37 stars 1.2k scripts 3.6k downloadsusemodels - Boilerplate Code for 'Tidymodels' Analyses
Code snippets to fit models using the tidymodels framework can be easily created for a given data set.
Last updated 5 months ago
6.88 score 84 stars 128 scripts 315 downloadsplsmod - Model Wrappers for Projection Methods
Bindings for additional regression models for use with the 'parsnip' package, including ordinary and spare partial least squares models for regression and classification (Rohart et al (2017) <doi:10.1371/journal.pcbi.1005752>).
Last updated 5 months ago
mixomics
6.47 score 14 stars 1 dependents 59 scripts 517 downloadssfd - Space-Filling Design Library
A collection of pre-optimized space-filling designs, for up to ten parameters, is contained here. Functions are provided to access designs described by Husslage et al (2011) <doi:10.1007/s11081-010-9129-8> and Wang and Fang (2005) <doi:10.1142/9789812701190_0040>. The design types included are Audze-Eglais, MaxiMin, and uniform.
Last updated 1 years ago
6.32 score 53 dependents 5 scripts 26k downloadssparseLDA - Sparse Discriminant Analysis
Performs sparse linear discriminant analysis for Gaussians and mixture of Gaussian models.
Last updated 8 years ago
5.45 score 7 stars 3 dependents 45 scripts 806 downloadsdesirability - Function Optimization and Ranking via Desirability Functions
S3 classes for multivariate optimization using the desirability function by Derringer and Suich (1980).
Last updated 8 years ago
4.74 score 1 stars 1 dependents 37 scripts 606 downloadsdesirability2 - Desirability Functions for Multiparameter Optimization
In-line functions for multivariate optimization via desirability functions (Derringer and Suich, 1980, <doi:10.1080/00224065.1980.11980968>) with easy use within `dplyr` pipelines.
Last updated 5 months ago
4.53 score 10 stars 17 scripts 229 downloadssparsediscrim - Sparse and Regularized Discriminant Analysis
A collection of sparse and regularized discriminant analysis methods intended for small-sample, high-dimensional data sets. The package features the High-Dimensional Regularized Discriminant Analysis classifier from Ramey et al. (2017) <arXiv:1602.01182>. Other classifiers include those from Dudoit et al. (2002) <doi:10.1198/016214502753479248>, Pang et al. (2009) <doi:10.1111/j.1541-0420.2009.01200.x>, and Tong et al. (2012) <doi:10.1093/bioinformatics/btr690>.
Last updated 4 years ago
4.11 score 3 stars 86 scripts 956 downloadsbeans - Data on Dried Beans
These data contain morphological image measurements for dried beans from Koklu and Ozkan (2020) <doi:10.1016/j.compag.2020.105507>.
Last updated 4 years ago
3.51 score 1 stars 65 scripts 234 downloadsQSARdata - Quantitative Structure Activity Relationship (QSAR) Data Sets
Molecular descriptors and outcomes for several public domain data sets
Last updated 14 years ago
2.83 score 68 scripts 621 downloads