Get the data

readr

the goal of readr is to provide a fast and friendly way to read rectangular data. readr - vignettes

rio

The idea behind rio is to simplify the process of importing data into R and exporting data from R. rio - vignettes

dplyr

dplyr is a grammar of data manipulation, providing a consistent set of verbs that help you solve the most common data manipulation challenges dplyr - website dplyr - vignettes

MonetDBLite

MonetDBLite for R, a fully embedded version of MonetDB that installs like any other R package.

bigreadr

Convert a CSV to SQLite by parts

bigreadr - website

Data Trasformation

forcats

The goal of the forcats package is to provide a suite of useful tools that solve common problems with factors.

forcats - website

Simulating dataset

Xy

Simulating Supervised Learning Data Xy – github

synthpop

A tool for producing synthetic versions of microdata containing confidential information synthpop – blogpost

holodeck

A Tidy Interface For Simulating Multivariate Data holodeck – github

fakir

Create Fake Data in R for tutorials fakir – github

Explorative data analysis

skimr

display summary statistics that the user can skim quickly to understand their data.

skimr - githun

SmartEDA

Summarize and Explore the Data

SmartEDA - vignettes

FunModeling

funModeling: data cleaning, importance variable analysis and model perfomance

funModeling - github

DataExplorer

Data exploration process for data analysis and model building, so that users could focus on understanding data and extracting insights. The package automatically scans through each variable and does data profiling.

DataExplorer - vignettes DataExplorer - github

naniar

naniar provides data structures and functions that facilitate the plotting of missing values and examination of imputations.

naniar - website naniar - github

dataMaid

Generating codebooks in R

Experimental Design

  • edibble facilitates statistical thinking of adapting experimental designs to different conditions by implementing the grammar of experimental design to generate new designs

edibble - github

Dimension Reduction (PCA)

FactoMineR

FactoMineR is an R package dedicated to multivariate Exploratory Data Analysis.

FactoMineR - website

umap

R implementation of Uniform Manifold Approximation and Projection

umap - github

Clustering

fpc

Various methods for clustering and cluster validation.

gower

Gower’s distance can be used to measure how different two records are.

gower - vignettes

ggbiplot

An implementation of the biplot using ggplot2. The package provides two functions: ggscreeplot() and ggbiplot().

ggbiplot - github

dimRed

A collection of dimensionality reduction techniques from R packages and a common interface for calling the methods.

dimRed - CRAN

Visualization

ggplot2

ggplot2 is a system for declaratively creating graphics, based on The Grammar of Graphics.

ggplot2 - website

GGally

GGally extends ggplot2 by adding several functions to reduce the complexity of combining geoms with transformed data.

GGally - website

ggpubr

The ggpubr package provides some easy-to-use functions for creating and customizing ‘ggplot2’- based publication ready plots.

ggpubr - sthda website

ggrepel

Repel overlapping text labels away from each other

ggrepel - vignettes

ggrough

’ggrough’ is an R package that converts your ggplot2 plots to rough/sketchy charts, using the excellent javascript roughjs library

ggrough – website

tabplot

The tableplot is a powerful visualization method to explore and analyse large multivariate datasets

tabplot - vignettes

ggstatsplot

creates graphics with details from statistical tests ggstatsplot - github

plotly

An R package for creating interactive web graphics.

plotly - github

patchwork

The goal of patchwork is to make it ridiculously simple to combine separate ggplots into the same graphic. patchwork - github

factoextra

factoextra is an R package making easy to extract and visualize the output of exploratory multivariate data analyses

factoextra - sthda website

ggiraphExtra

ggiraphExtra contains many useful functions for exploratoty plots.

ggiraphExtra - vignettes

r2d3

R Interface to D3 Visualizations

r2d3 – webiste

xda

R package for exploratory data analysis

xda - github

gghighlight

Highlight points and lines in ggplot2

gghighlight – github

bbplot

It helps create and export ggplot2 charts in the style used by the BBC News data team

bbplot – site

Machine Learning

SuperML

SuperML - website

caret

The caret package (short for Classification And REgression Training) contains functions to streamline the model training process for complex regression and classification problems. caret - vignettes

mlr

mlr is a framework for machine learning experiments in R. mlr - website mlr - blog

Metrics

An R package for common supervised machine learning metrics Metrics - github

iml

The iml package provides tools for analysing any black box machine learning model. iml - vignettes

lime

The purpose of lime is to explain the predictions of black box classifiers lime - vignettes lime - github

xgboostExplaine

An R package that makes xgboost models fully interpretable xgboostExplainer - tutorial

shapper

The SHAP method is used to calculate influences of variables on the particular observation.

shapper - blogpost

shinyMlr

Integration of the mlr package into shiny. With help of this package mlr can be accessed via a shiny interface.

shinyMlr - github shinyMlr - video tutorial

bounceR

bounceR provides methods and tools for automated feature selection for Machine Learning models. The methods are fit for situation in which the data scientists faces an exceedingly high number of features.

bounceR - github )

recipes

recipes is an alternative method for creating and preprocessing design matrices that can be used for modeling or visualization. recipes - github

modelDown

modelDown generates a website with HTML summaries for predictive models modelDown - vignettes

dalex

DALEX is a set of tools that help to understand how complex models are working. dalex – website

iBreakDown

Model Agnostic Explainers for Individual Predictions The iBreakDown package is a model agnostic tool for explanation of predictions from black boxes ML models. iBreakDown – website

rpart.plot

It extends the functions in the rpart package rpart.plot – website

Association rules

arules

Mining Association Rules and Frequent Itemsets with R arules - github

Time series

tsfknn

The package allows, with only a function, to specify the KNN model and to generate the forecasts. tsfknn - vignettes

prophet

Prophet is a procedure for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. prohet - website

breakDown

Model Agnostic Explainers for Individual Predictions

breakdown - website

ggTimeSeries

This R package offers novel time series visualisations. It is based on ggplot2 and offers geoms and pre-packaged functions for easily creating any of the offered charts. Some examples are listed below. ggTimeSeries - vignettes

tsbox

tsbox: Class-Agnostic Time Series in R

tsbox - website

tsutils

The tsutils package for R provides functions to support various aspects of time series and forecasting modelling.

tsutils - github

R Interface

sparkryl

sparklyr is an R interface for Apache Spark

sparklyr - website

RWeka

An R interface to Weka. Weka is a collection of machine learning algorithms for data mining tasks written in Java.

rbokeh

rbokeh is an R Interface for Bokeh. Bokeh is a visualization library that provides a flexible and powerful declarative framework for creating web-based plots rbokeh - vignettes

Spatial Analysis

cartography

This package allows various cartographic representations such as proportional symbols, choropleth, typology, flows or discontinuities maps. cartography – github

Deep Learning interface

h2o

R scripting functionality for ‘H2O’, the open source math engine for big data that computes parallel distributed machine learning algorithms such as generalized linear models, gradient boosting machines, random forests, and neural networks (deep learning) within various cluster environments.

h2o - website

h2o - tutorial

Keras

R interface to Keras keras - website

Big-Data

ff

The ff package provides data structures that are stored on disk but behave (almost) as if they were in RAM. ff - website

replyr

The replyr package attempts to provide practical data manipulation affordances to make code perform similarly on local or remote (big) data. replyr - github

disk.frame

Fast disk-based parallelized data manipulation framework for larger-than-RAM data

disk.frame - github

Deploy

aurelius

Provides tools for converting R objects and syntax into the Portable Format for Analytics (PFA). aurelius - vignettes

plumber

It converts your existing R code to a web API using a handful of special one-line comments. plumber – website

Shiny

shiny

shiny makes it easy to build interactive web apps straight from R shiny - website

shinydashboard

shinydashboard makes it easy to use Shiny to create dashboards shinydashboard - website

argonDash

Argon dashboard template argomDash – website

shinyproxy

ShinyProxy is your favourite way to deploy Shiny apps in an enterprise context. shinyproxy - website

shinyLP

A R package for Bootstrap Components to make Landing Home Pages for Shiny shinyLP - github

shinytemplate

RStudio Project Templates for Shinyapps shinytemplate - github

shinymodule

A Shiny module is a piece of a Shiny app. shinymodules - shiny website

dashboardthemes

custom theme support for R Shinydashboard applications dashboardthemes - github

fontawesome

The fontawesome package makes it very easy to insert FontAwesome icons within R Markdown documents and Shiny apps. fontawesome - github

shinyWidgets

Extend widgets available in shiny

shinyWidgets - website

shinydashboardPlus

Extensions for shinydashboard shinydashbordpPlus – website

bs4Dash

Bootstrap 4 shinydashboard using AdminLTE3 bs4Dash – website

yonder

A Reactive Web Framework Built on Shiny yonder - website

auth0

The goal of auth0 is to implement an authentication scheme to Shiny using OAuth Apps through the freemium service Auth0. auth0 - website

shinyEffects

Customize your shiny apps with these nice CSS effects shinyEffects - github

tablerDash

A new super light ✨shiny dashboard template. tablerDash - website

dragulaR

Drag’n’drop elements with dragulaR dragulaR - website

gglabeller

Shiny gadget for labeling points on ggplot gglabeller - github

ggiraph

Make ‘ggplot’ Graphics Interactive ggiraph – github

excelR

An R interface to jExcel library to create web-based interactive tables and spreadsheets compatible with ‘Excel’ or any other spreadsheet software. excelR – github

Reporting & Publishing results

rmarkdorwn

Convert R Markdown documents into a variety of formats. rmarkdown - website

icon

web icons for rmarkdown icon - website

officer

Access and manipulate ‘Microsoft Word’ and ‘Microsoft PowerPoint’ documents from R. officer - vignettes

kableExtra

The goal of kableExtra is to help you build common complex tables and manipulate table styles.

kableExtra - website

slidex

slidex is aimed at making the process of converting Microsoft PowerPoint slides to beautiful HTML xaringan slides as seamless as possible, maintaining tables,figures, links, and bulleted lists. slidex - github

xaringan

xaringan - github

gt

Make wonderful-looking tables with the gt package

gt- github

ODE & Quality Control

SixSigma

Functions and utilities to perform Statistical Analyses in the Six Sigma way.

SixSigma with R - ebook website Quality Control with R - ebook

qcc

qcc is a contributed R package for statistical quality control charts qcc - vignettes

ggQC

Plot single and multi-facet quality control charts within the ggplot framework using stat_QC and stat_mR. Calculate quality control limits using QC_Lines. ggQC - github

R API

dockerfiler

dockerfiler is a package that comes with a simple, object oriented API for Dockerfile creation, straight from your R session.

tutorial

cloudML

cloudml provides an R interface to Google Cloud Machine Learning Engine cloudML - website

googleComputeEngineR

provides an R interface to the Google Cloud Compute Engine API, for launching virtual machines. googleComputeEngineR - github

IDE

Rcode

R Analyticflow

Productivity

packagefinder

With ‘packagefinder’ you can search for multiple keywords in the name, title and description of the CRAN package, either case-sensitive or insensitive and define your own weighting scheme for the search results, if you like. packagefinder - CRAN

fs

fs provides a cross-platform, uniform interface to file system operations. fs uses libuv under the hood, which gives a rock solid cross-platform interface to the filesystem. fs - website

pacman

A package management tools for R pacman - github

RStudio Addin

caretTuneGrid

The caretTuneGrid is the RStudio addin for inserting tune parameters code for the caret package.

caretTuneGrid - github.

esquisse

RStudio add-in to make plots with ggplot2

esquisse - github

editData

RStudio addin for editing a ‘data.frame’ or a ‘tibble’

Editing A ‘data.frame’

Rcade

The goal of Rcade is to provide access to games in order to waste some times in RStudio. You can play 2048, Tetris, Mario, Pacman, Spider Solitaire and others.

taskscheduleR

Schedule R scripts/processes with the Windows task scheduler.

taskscheduleR - vignettes

R for FUN

hexSticker

Create hexagon sticker in R