caret - Classification and Regression Training
Misc functions for training and plotting classification and regression models.
Last updated 2 years ago
18.87 score 1.6k stars 287 packages 56k scripts 202k downloadsrecipes - Preprocessing and Feature Engineering Steps for Modeling
A recipe prepares your data for modeling. We provide an extensible framework for pipeable sequences of feature engineering steps provides preprocessing tools to be applied to data. Statistical parameters for the steps can be estimated from an initial data set and then applied to other data sets. The resulting processed output can then be used as inputs for statistical or machine learning models.
Last updated 8 days ago
18.57 score 570 stars 360 packages 6.8k scripts 173k downloadsparsnip - A Common API to Modeling and Analysis Functions
A common interface is provided to allow users to specify a model without having to remember the different argument names across different functions or computational engines (e.g. 'R', 'Spark', 'Stan', 'H2O', etc).
Last updated 15 days ago
16.21 score 597 stars 64 packages 3.1k scripts 32k downloadstidymodels - Easily Install and Load the 'Tidymodels' Packages
The tidy modeling "verse" is a collection of packages for modeling and statistical analysis that share the underlying design philosophy, grammar, and data structures of the tidyverse.
Last updated 1 months ago
16.06 score 771 stars 12 packages 44k scripts 39k downloadstune - Tidy Tuning Tools
The ability to tune models is important. 'tune' contains functions and classes to be used in conjunction with other 'tidymodels' packages for finding reasonable values of hyper-parameters in models, pre-processing methods, and post-processing steps.
Last updated 18 hours ago
14.24 score 282 stars 34 packages 716 scripts 29k downloadsC50 - C5.0 Decision Trees and Rule-Based Models
C5.0 decision trees and rule-based models for pattern recognition that extend the work of Quinlan (1993, ISBN:1-55860-238-0).
Last updated 2 years ago
12.10 score 50 stars 12 packages 1.2k scripts 19k downloadsprobably - Tools for Post-Processing Predicted Values
Models can be improved by post-processing class probabilities, by: recalibration, conversion to hard probabilities, assessment of equivocal zones, and other activities. 'probably' contains tools for conducting these operations as well as calibration tools and conformal inference techniques for regression models.
Last updated 1 months ago
11.64 score 115 stars 21k scripts 2.1k downloadsCubist - Rule- And Instance-Based Regression Modeling
Regression modeling using rules with added instance-based corrections.
Last updated 5 months ago
11.49 score 39 stars 17 packages 2.8k scripts 28k downloadsmodeldata - Data Sets Useful for Modeling Examples
Data sets used for demonstrating or testing model-related packages are contained in this package.
Last updated 1 months ago
10.92 score 22 stars 14 packages 2.3k scripts 29k downloadsfinetune - Additional Functions for Model Tuning
The ability to tune models is important. 'finetune' enhances the 'tune' package by providing more specialized methods for finding reasonable values of model tuning parameters. Two racing methods described by Kuhn (2014) <arXiv:1405.6974> are included. An iterative search method using generalized simulated annealing (Bohachevsky, Johnson and Stein, 1986) <doi:10.1080/00401706.1986.10488128> is also included.
Last updated 3 months ago
8.46 score 62 stars 708 scripts 1.6k downloadstidyposterior - Bayesian Analysis to Compare Models using Resampling Statistics
Bayesian analysis used here to answer the question: "when looking at resampling results, are the differences between models 'real'?" To answer this, a model can be created were the performance statistic is the resampling statistics (e.g. accuracy or RMSE). These values are explained by the model types. In doing this, we can get parameter estimates for each model's affect on performance and make statistical (and practical) comparisons between models. The methods included here are similar to Benavoli et al (2017) <https://jmlr.org/papers/v18/16-305.html>.
Last updated 1 months ago
8.39 score 102 stars 238 scripts 433 downloadsmodeldb - Fits Models Inside the Database
Uses 'dplyr' and 'tidyeval' to fit statistical models inside the database. It currently supports KMeans and linear regression models.
Last updated 11 months ago
databasedbplyrdplyrggplot2modelingrlangsqltidyevalvisualization
7.89 score 79 stars 62 scripts 241 downloadsbaguette - Efficient Model Functions for Bagging
Tree- and rule-based models can be bagged (<doi:10.1007/BF00058655>) using this package and their predictions equations are stored in an efficient format to reduce the model objects size and speed.
Last updated 1 months ago
7.53 score 25 stars 566 scripts 922 downloadsbrulee - High-Level Modeling Functions with 'torch'
Provides high-level modeling functions to define and train models using the 'torch' R package. Models include linear, logistic, and multinomial regression as well as multilayer perceptrons.
Last updated 1 months ago
7.47 score 67 stars 212 scripts 661 downloadsAmesHousing - The Ames Iowa Housing Data
Raw and processed versions of the data from De Cock (2011) <http://ww2.amstat.org/publications/jse> are included in the package.
Last updated 4 years ago
7.42 score 13 stars 2 packages 530 scripts 3.2k downloadsusemodels - Boilerplate Code for 'Tidymodels' Analyses
Code snippets to fit models using the tidymodels framework can be easily created for a given data set.
Last updated 1 months ago
6.89 score 84 stars 132 scripts 292 downloadsAppliedPredictiveModeling - Functions and Data Sets for 'Applied Predictive Modeling'
A few functions and several data set for the Springer book 'Applied Predictive Modeling'.
Last updated 1 years ago
6.85 score 36 stars 1.1k scripts 3.7k downloadssfd - Space-Filling Design Library
A collection of pre-optimized space-filling designs, for up to ten parameters, is contained here. Functions are provided to access designs described by Husslage et al (2011) <doi:10.1007/s11081-010-9129-8> and Wang and Fang (2005) <doi:10.1142/9789812701190_0040>. The design types included are Audze-Eglais, MaxiMin, and uniform.
Last updated 11 months ago
6.60 score 48 packages 5 scripts 27k downloadsplsmod - Model Wrappers for Projection Methods
Bindings for additional regression models for use with the 'parsnip' package, including ordinary and spare partial least squares models for regression and classification (Rohart et al (2017) <doi:10.1371/journal.pcbi.1005752>).
Last updated 1 months ago
mixomics
5.97 score 14 stars 55 scripts 408 downloadssparseLDA - Sparse Discriminant Analysis
Performs sparse linear discriminant analysis for Gaussians and mixture of Gaussian models.
Last updated 8 years ago
5.06 score 7 stars 3 packages 36 scripts 1.0k downloadsdesirability - Function Optimization and Ranking via Desirability Functions
S3 classes for multivariate optimization using the desirability function by Derringer and Suich (1980).
Last updated 8 years ago
4.73 score 1 stars 1 packages 36 scripts 780 downloadsdesirability2 - Desirability Functions for Multiparameter Optimization
In-line functions for multivariate optimization via desirability functions (Derringer and Suich, 1980, <doi:10.1080/00224065.1980.11980968>) with easy use within `dplyr` pipelines.
Last updated 30 days ago
4.53 score 10 stars 17 scripts 190 downloadssparsediscrim - Sparse and Regularized Discriminant Analysis
A collection of sparse and regularized discriminant analysis methods intended for small-sample, high-dimensional data sets. The package features the High-Dimensional Regularized Discriminant Analysis classifier from Ramey et al. (2017) <arXiv:1602.01182>. Other classifiers include those from Dudoit et al. (2002) <doi:10.1198/016214502753479248>, Pang et al. (2009) <doi:10.1111/j.1541-0420.2009.01200.x>, and Tong et al. (2012) <doi:10.1093/bioinformatics/btr690>.
Last updated 3 years ago
4.17 score 3 stars 85 scripts 1.2k downloadsbeans - Data on Dried Beans
These data contain morphological image measurements for dried beans from Koklu and Ozkan (2020) <doi:10.1016/j.compag.2020.105507>.
Last updated 3 years ago
3.48 score 1 stars 60 scripts 241 downloadsQSARdata - Quantitative Structure Activity Relationship (QSAR) Data Sets
Molecular descriptors and outcomes for several public domain data sets
Last updated 14 years ago
2.83 score 68 scripts 640 downloads