caret - Classification and Regression Training
Misc functions for training and plotting classification and regression models.
Last updated 1 months ago
19.18 score 1.6k stars 289 dependents 61k scripts 183k downloadsrecipes - Preprocessing and Feature Engineering Steps for Modeling
A recipe prepares your data for modeling. We provide an extensible framework for pipeable sequences of feature engineering steps provides preprocessing tools to be applied to data. Statistical parameters for the steps can be estimated from an initial data set and then applied to other data sets. The resulting processed output can then be used as inputs for statistical or machine learning models.
Last updated 4 hours ago
18.36 score 578 stars 369 dependents 6.5k scripts 106k downloadstidymodels - Easily Install and Load the 'Tidymodels' Packages
The tidy modeling "verse" is a collection of packages for modeling and statistical analysis that share the underlying design philosophy, grammar, and data structures of the tidyverse.
Last updated 3 months ago
16.11 score 774 stars 14 dependents 65k scripts 25k downloadsparsnip - A Common API to Modeling and Analysis Functions
A common interface is provided to allow users to specify a model without having to remember the different argument names across different functions or computational engines (e.g. 'R', 'Spark', 'Stan', 'H2O', etc).
Last updated 1 days ago
16.03 score 606 stars 68 dependents 3.4k scripts 20k downloadstune - Tidy Tuning Tools
The ability to tune models is important. 'tune' contains functions and classes to be used in conjunction with other 'tidymodels' packages for finding reasonable values of hyper-parameters in models, pre-processing methods, and post-processing steps.
Last updated 2 months ago
14.22 score 288 stars 38 dependents 768 scripts 26k downloadsCubist - Rule- And Instance-Based Regression Modeling
Regression modeling using rules with added instance-based corrections.
Last updated 7 months ago
12.45 score 40 stars 18 dependents 2.8k scripts 23k downloadsC50 - C5.0 Decision Trees and Rule-Based Models
C5.0 decision trees and rule-based models for pattern recognition that extend the work of Quinlan (1993, ISBN:1-55860-238-0).
Last updated 2 years ago
12.16 score 50 stars 13 dependents 1.3k scripts 19k downloadsprobably - Tools for Post-Processing Predicted Values
Models can be improved by post-processing class probabilities, by: recalibration, conversion to hard probabilities, assessment of equivocal zones, and other activities. 'probably' contains tools for conducting these operations as well as calibration tools and conformal inference techniques for regression models.
Last updated 3 months ago
12.01 score 115 stars 1 dependents 21k scripts 1.7k downloadsmodeldata - Data Sets Useful for Modeling Examples
Data sets used for demonstrating or testing model-related packages are contained in this package.
Last updated 3 months ago
10.88 score 22 stars 14 dependents 2.1k scripts 27k downloadsfinetune - Additional Functions for Model Tuning
The ability to tune models is important. 'finetune' enhances the 'tune' package by providing more specialized methods for finding reasonable values of model tuning parameters. Two racing methods described by Kuhn (2014) <arXiv:1405.6974> are included. An iterative search method using generalized simulated annealing (Bohachevsky, Johnson and Stein, 1986) <doi:10.1080/00401706.1986.10488128> is also included.
Last updated 5 months ago
8.64 score 62 stars 1 dependents 708 scripts 1.2k downloadstidyposterior - Bayesian Analysis to Compare Models using Resampling Statistics
Bayesian analysis used here to answer the question: "when looking at resampling results, are the differences between models 'real'?" To answer this, a model can be created were the performance statistic is the resampling statistics (e.g. accuracy or RMSE). These values are explained by the model types. In doing this, we can get parameter estimates for each model's affect on performance and make statistical (and practical) comparisons between models. The methods included here are similar to Benavoli et al (2017) <https://jmlr.org/papers/v18/16-305.html>.
Last updated 3 months ago
8.42 score 102 stars 257 scripts 345 downloadsbaguette - Efficient Model Functions for Bagging
Tree- and rule-based models can be bagged (<doi:10.1007/BF00058655>) using this package and their predictions equations are stored in an efficient format to reduce the model objects size and speed.
Last updated 3 months ago
8.05 score 25 stars 1 dependents 624 scripts 779 downloadsmodeldb - Fits Models Inside the Database
Uses 'dplyr' and 'tidyeval' to fit statistical models inside the database. It currently supports KMeans and linear regression models.
Last updated 1 years ago
databasedbplyrdplyrggplot2modelingrlangsqltidyevalvisualization
7.59 score 79 stars 62 scripts 231 downloadsbrulee - High-Level Modeling Functions with 'torch'
Provides high-level modeling functions to define and train models using the 'torch' R package. Models include linear, logistic, and multinomial regression as well as multilayer perceptrons.
Last updated 3 months ago
7.48 score 67 stars 214 scripts 459 downloadsAmesHousing - The Ames Iowa Housing Data
Raw and processed versions of the data from De Cock (2011) <http://ww2.amstat.org/publications/jse> are included in the package.
Last updated 5 years ago
7.08 score 14 stars 2 dependents 540 scripts 1.3k downloadsusemodels - Boilerplate Code for 'Tidymodels' Analyses
Code snippets to fit models using the tidymodels framework can be easily created for a given data set.
Last updated 3 months ago
6.90 score 84 stars 134 scripts 243 downloadsAppliedPredictiveModeling - Functions and Data Sets for 'Applied Predictive Modeling'
A few functions and several data set for the Springer book 'Applied Predictive Modeling'.
Last updated 1 years ago
6.88 score 36 stars 1.2k scripts 3.6k downloadsplsmod - Model Wrappers for Projection Methods
Bindings for additional regression models for use with the 'parsnip' package, including ordinary and spare partial least squares models for regression and classification (Rohart et al (2017) <doi:10.1371/journal.pcbi.1005752>).
Last updated 3 months ago
mixomics
6.47 score 14 stars 1 dependents 58 scripts 526 downloadssfd - Space-Filling Design Library
A collection of pre-optimized space-filling designs, for up to ten parameters, is contained here. Functions are provided to access designs described by Husslage et al (2011) <doi:10.1007/s11081-010-9129-8> and Wang and Fang (2005) <doi:10.1142/9789812701190_0040>. The design types included are Audze-Eglais, MaxiMin, and uniform.
Last updated 1 years ago
6.25 score 52 dependents 5 scripts 23k downloadssparseLDA - Sparse Discriminant Analysis
Performs sparse linear discriminant analysis for Gaussians and mixture of Gaussian models.
Last updated 8 years ago
5.45 score 7 stars 3 dependents 45 scripts 807 downloadsdesirability - Function Optimization and Ranking via Desirability Functions
S3 classes for multivariate optimization using the desirability function by Derringer and Suich (1980).
Last updated 8 years ago
4.74 score 1 stars 1 dependents 37 scripts 480 downloadsdesirability2 - Desirability Functions for Multiparameter Optimization
In-line functions for multivariate optimization via desirability functions (Derringer and Suich, 1980, <doi:10.1080/00224065.1980.11980968>) with easy use within `dplyr` pipelines.
Last updated 3 months ago
4.53 score 10 stars 17 scripts 207 downloadssparsediscrim - Sparse and Regularized Discriminant Analysis
A collection of sparse and regularized discriminant analysis methods intended for small-sample, high-dimensional data sets. The package features the High-Dimensional Regularized Discriminant Analysis classifier from Ramey et al. (2017) <arXiv:1602.01182>. Other classifiers include those from Dudoit et al. (2002) <doi:10.1198/016214502753479248>, Pang et al. (2009) <doi:10.1111/j.1541-0420.2009.01200.x>, and Tong et al. (2012) <doi:10.1093/bioinformatics/btr690>.
Last updated 4 years ago
4.11 score 3 stars 86 scripts 648 downloadsbeans - Data on Dried Beans
These data contain morphological image measurements for dried beans from Koklu and Ozkan (2020) <doi:10.1016/j.compag.2020.105507>.
Last updated 4 years ago
3.51 score 1 stars 65 scripts 229 downloadsQSARdata - Quantitative Structure Activity Relationship (QSAR) Data Sets
Molecular descriptors and outcomes for several public domain data sets
Last updated 14 years ago
2.83 score 68 scripts 515 downloads