![](https://github.com/tidymodels/bonsai/raw/HEAD/man/figures/logo.png)
bonsai - Model Wrappers for Tree-Based Models
Bindings for additional tree-based model engines for use with the 'parsnip' package. Models include gradient boosted decision trees with 'LightGBM' (Ke et al, 2017.), conditional inference trees and conditional random forests with 'partykit' (Hothorn and Zeileis, 2015. and Hothorn et al, 2006. <doi:10.1198/106186006X133933>), and accelerated oblique random forests with 'aorsf' (Jaeger et al, 2022 <doi:10.5281/zenodo.7116854>).
Last updated 4 days ago
45 stars 3.23 score 44 dependencies![](https://github.com/tidymodels/stacks/raw/HEAD/man/figures/logo.png)
stacks - Tidy Model Stacking
Model stacking is an ensemble technique that involves training a model to combine the outputs of many diverse statistical models, and has been shown to improve predictive performance in a variety of settings. 'stacks' implements a grammar for 'tidymodels'-aligned model stacking.
Last updated 24 days ago
285 stars 6.21 score 88 dependencies 1 dependents![](https://github.com/tidymodels/workflows/raw/HEAD/man/figures/logo.png)
workflows - Modeling Workflows
Managing both a 'parsnip' model and a preprocessor, such as a model formula or recipe from 'recipes', can often be challenging. The goal of 'workflows' is to streamline this process by bundling the model alongside the preprocessor, all within the same object.
Last updated 1 months ago
200 stars 6.55 score 42 dependencies 36 dependents![](https://github.com/tidymodels/broom/raw/HEAD/man/figures/logo.png)
broom - Convert Statistical Objects into Tidy Tibbles
Summarizes key information about statistical objects in tidy tibbles. This makes it easy to report results, create plots and consistently work with large numbers of models at once. Broom provides three verbs that each provide different types of information about a model. tidy() summarizes information about model components such as coefficients of a regression. glance() reports information about an entire model, such as goodness of fit measures like AIC and BIC. augment() adds information about individual observations to a dataset, such as fitted values or influence measures.
Last updated 2 months ago
modelingtidy-data
1.4k stars 14.47 score 22 dependencies 1432 dependentsshinymodels - Interactive Assessments of Models
Launch a 'shiny' application for 'tidymodels' results. For classification or regression models, the app can be used to determine if there is lack of fit or poorly predicted points.
Last updated 2 months ago
shiny
46 stars 3.19 score 123 dependencies![](https://github.com/tidymodels/infer/raw/HEAD/man/figures/logo.png)
infer - Tidy Statistical Inference
The objective of this package is to perform inference using an expressive statistical grammar that coheres with the tidy design framework.
Last updated 3 months ago
707 stars 8.45 score 39 dependencies 13 dependentsworkflowsets - Create a Collection of 'tidymodels' Workflows
A workflow is a combination of a model and preprocessors (e.g, a formula, recipe, etc.) (Kuhn and Silge (2021) <https://www.tmwr.org/>). In order to try different combinations of these, an object can be created that contains many workflows. There are functions to create workflows en masse as well as training them and visualizing the results.
Last updated 4 months ago
88 stars 5.12 score 84 dependencies 16 dependentsdetectors - Prediction Data from GPT Detectors
Researchers carried out a series of experiments passing a number of essays to different GPT detection models. Juxtaposing detector predictions for papers written by native and non-native English writers, the authors argue that GPT detectors disproportionately classify real writing from non-native English writers as AI-generated.
Last updated 5 months ago
7 stars 1.76 score 0 dependenciesgbfs - Interface with Live Bikeshare Data
Supplies a set of functions to interface with bikeshare data following the General Bikeshare Feed Specification, allowing users to query and accumulate tidy datasets for specified cities/bikeshare programs.
Last updated 5 months ago
36 stars 2.71 score 39 dependenciesreadmission - Hospital Readmission Data for Patients with Diabetes
Clinical care data from 130 U.S. hospitals in the years 1999-2008. Each row describes an "encounter" with a patient with diabetes, including variables on demographics, medications, patient history, diagnostics, payment, and readmission.
Last updated 7 months ago
1 stars 0.92 score 0 dependencies![](https://github.com/simonpcouch/anyflights/raw/HEAD/man/figures/logo.png)
anyflights - Query 'nycflights13'-Like Air Travel Data for Given Years and Airports
Supplies a set of functions to query air travel data for user- specified years and airports. Datasets include on-time flights, airlines, airports, planes, and weather.
Last updated 10 months ago
45 stars 3.15 score 67 dependencies