[go: up one dir, main page]

Open Source R Software

R Software

R Clear Filters

Browse free open source R Software and projects below. Use the toggles on the left to filter open source R Software by OS, license, language, programming language, and project status.

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • La version gratuite d'Auth0 s'enrichit ! Icon
    La version gratuite d'Auth0 s'enrichit !

    Gratuit pour 25 000 utilisateurs avec intégration Okta illimitée : concentrez-vous sur le développement de vos applications.

    Vous l'avez demandé, nous l'avons fait ! Les versions gratuite et payante d'Auth0 incluent des options qui vous permettent de développer, déployer et faire évoluer vos applications en toute sécurité. Utilisez Auth0 dès maintenant pour découvrir tous ses avantages.
    Essayez Auth0 gratuitement
  • 1
    ggplot2

    ggplot2

    An implementation of the Grammar of Graphics in R

    ggplot2 is a system written in R for declaratively creating graphics. It is based on The Grammar of Graphics, which focuses on following a layered approach to describe and construct visualizations or graphics in a structured manner. With ggplot2 you simply provide the data, tell ggplot2 how to map variables to aesthetics, what graphical primitives to use, and it will take care of the rest. ggplot2 is over 10 years old and is used by hundreds of thousands of people all over the world for plotting. In most cases using ggplot2 starts with supplying a dataset and aesthetic mapping (with aes()); adding on layers (like geom_point() or geom_histogram()), scales (like scale_colour_brewer()), and faceting specifications (like facet_wrap()); and finally, coordinating systems. ggplot2 has a rich ecosystem of community-maintained extensions for those looking for more innovation. ggplot2 is a part of the tidyverse, an ecosystem of R packages designed for data science.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 2
    Statistics for Data Scientists

    Statistics for Data Scientists

    "Statistics for Data Scientists: 50 Essential Concepts"

    The “statistics-for-data-scientists” repository is a pedagogical resource designed to bridge rigorous statistics theory and practical data science workflows. The code and materials are intended to help data scientists and analysts grasp statistical principles (e.g. inference, regressions, hypothesis testing, probability, confidence intervals) in contexts relevant to real data analysis tasks. The repository includes Jupyter notebooks, R scripts, worked examples, and possibly problem sets that illustrate how statistical methods are applied to real datasets. It aims to demystify the bridge between textbook statistics and empirical modeling by walking through assumption checking, visualization, interpreting outputs, and pitfalls of misuse. Throughout, the content emphasizes clarity and accessibility, showing not just how to run statistical tests or build models, but what they mean and when one method is preferred over another.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    Data Analysis for the Life Sciences

    Data Analysis for the Life Sciences

    Rmd source files for the HarvardX series PH525x

    This repository holds the R Markdown (.Rmd) source files for the PH525x / HarvardX course series (Data Analysis for the Life Sciences / Genomics) managed by GenomicsClass. It functions as the canonical source for course lab exercises, lecture modules, and reading materials in reproducible format. Students and learners use these R Markdown files to follow along, knit notebooks, run code samples, and complete the lab-based assignments. The repo is licensed under MIT, allowing reuse and modification. It is part of a larger ecosystem: the compiled HTML / book version of the labs is published via a companion “book” repository, which presents a polished, browsable version of the materials. The content covers topics such as data wrangling in R, statistical inference, genomics workflows, Bioconductor packages, and project-based analyses. Because it’s open and modular, contributors can suggest improvements, update modules, or add new exercises.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    Statistical Rethinking 2024

    Statistical Rethinking 2024

    This course teaches data analysis

    The 2024 repository is the most recent version of the course, reflecting ongoing refinements in pedagogy, statistical modeling techniques, and coding practices. It provides updated notebooks, R scripts, and model examples, some streamlined and restructured compared to previous years. The 2024 repo also highlights the transition toward more robust Stan models and integration with newer Bayesian workflow practices, continuing to emphasize accessibility for learners while modernizing the tools. This version is designed for students following the 2024 lecture series, offering the most current set of examples, exercises, and teaching material aligned with the Statistical Rethinking framework. Online, flipped instruction. I will pre-record the lectures each week. We'll meet online once a week for an hour to discuss the material. The discussion time (3-4pm Berlin Time) should allow people in the Americas to join in their morning.
    Downloads: 9 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    generativeart

    generativeart

    Create Generative Art with R

    generativeart is an R package for creating algorithmic art by computing the positions of many thousands of points according to user-defined mathematical formulas with randomized parameters. Each render uses a seed to introduce controlled randomness, so every image is unique while remaining reproducible when the same seed and formula are reused. The package logs the seed, formula, and file name to a CSV, which makes it easy to catalog outputs, re-generate favorites, and track experiments. A small helper sets up a simple directory scaffold for “everything” versus “handpicked” images and a logfile folder, encouraging a tidy, iterative workflow. Rendering is performed with ggplot2, and users can select coordinate systems (Cartesian or polar), foreground/background colors, number of images to generate, and output format (PNG by default with other devices available).
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    Introduction to Zig

    Introduction to Zig

    An open, technical and introductory book for the Zig programming lang

    This is the official repository for the book "Introduction to Zig: a project-based Book", written by Pedro Duarte Faria. To know more about the book, check out the About this book section below. You can read the current version of the book in your web browser. The book is built using the publishing system Quarto in conjunction with a little bit of R code (zig_engine.R), which is responsible for calling the Zig compiler to compile and run the Zig code examples.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a direct hands-on reference for students following the 2022 recorded lecture series. There are 10 weeks of instruction. Links to lecture recordings will appear in this table. Weekly problem sets are assigned on Fridays and due the next Friday, when we discuss the solutions in the weekly online meeting.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8
    Statistical Rethinking 2023

    Statistical Rethinking 2023

    Statistical Rethinking Course for Jan-Mar 2023

    The 2023 edition modernizes and expands on the same curriculum, adjusting exercises and code for newer versions of R, Stan, and supporting packages. It continues to provide scripts for lectures and tutorials, while integrating refinements to examples, notation, and computational workflows introduced that year. Compared with 2022, some models are rewritten for clarity, and teaching materials reflect refinements in McElreath’s evolving presentation of Bayesian data analysis. Students following the 2023 lecture videos use this repository as their coding reference. There are 10 weeks of instruction. Links to lecture recordings will appear in this table. Weekly problem sets are assigned on Fridays and due the next Friday, when we discuss the solutions in the weekly online meeting.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 9
    NYC Taxi Data

    NYC Taxi Data

    Import public NYC taxi and for-hire vehicle (Uber, Lyft)

    The nyc-taxi-data repository is a rich dataset and exploratory project around New York City taxi trip records. It collects and preprocesses large-scale trip datasets (fares, pickup/dropoff, timestamps, locations, passenger counts) to enable data analysis, modeling, and visualization efforts. The project includes scripts and notebooks for cleaning and filtering the raw data, memory-efficient processing for large CSV/Parquet files, and aggregation workflows (e.g. trips per hour, heatmaps of pickups/dropoffs). It also contains example analyses—spatial and temporal visualizations like maps, time-series plots, and hotspot detection—highlighting insights such as patterns of demand, peak times, and geospatial distributions. The repository is often used as a benchmark dataset and example for teaching, benchmarking, and demonstration purposes in the data science and urban analytics communities.
    Downloads: 7 This Week
    Last Update:
    See Project
  • ManageEngine Endpoint Central for IT Professionals Icon
    ManageEngine Endpoint Central for IT Professionals

    A one-stop Unified Endpoint Management (UEM) solution

    ManageEngine's Endpoint Central is a Unified Endpoint Management Solution, that takes care of enterprise mobility management (including all features of mobile application management and mobile device management), as well as client management for a diversified range of endpoints - mobile devices, laptops, computers, tablets, server machines etc. With ManageEngine Endpoint Central, users can automate their regular desktop management routines like distributing software, installing patches, managing IT assets, imaging and deploying OS, and more.
    Learn More
  • 10
    benchm-ml

    benchm-ml

    A minimal benchmark for scalability, speed and accuracy of commonly us

    This repository is designed to provide a minimal benchmark framework comparing commonly used machine learning libraries in terms of scalability, speed, and classification accuracy. The focus is on binary classification tasks without missing data, where inputs can be numeric or categorical (after one-hot encoding). It targets large scale settings by varying the number of observations (n) up to millions and the number of features (after expansion) to about a thousand, to stress test different implementations. The benchmarks cover algorithms like logistic regression, random forest, gradient boosting, and deep neural networks, and they compare across toolkits such as scikit-learn, R packages, xgboost, H2O, Spark MLlib, etc. The repository is structured in logical folders (e.g. “1-linear”, “2-rf”, “3-boosting”, “4-DL”) each corresponding to algorithm categories.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    tidyverse

    tidyverse

    Easily install and load packages from the tidyverse

    tidyverse is a meta‑package that installs and loads a cohesive suite of R packages designed for data science, sharing underlying design principles, grammar, and data structures. Core components include ggplot2, dplyr, tidyr, readr, purrr, tibble, stringr, forcats, and more. It promotes tidy data workflows and consistency across tasks.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    R Color Palettes

    R Color Palettes

    Comprehensive list of color palettes available in R

    This repository is a curated collection of color palettes crafted or curated for data visualization in R. The goal is to provide designers, data scientists, and R users with aesthetically pleasing, perceptually consistent color schemes that work well for plots, maps, and graphics. The repo contains static files listing palette definitions (e.g. hex codes, named hues), sample visualizations showing how each palette performs under different contexts (categorical, sequential, diverging), and helper functions/scripts to import or use the palettes in R. The author also documents palette provenance and usage guidance (contrast, readability, colorblind friendliness). While not a full package in itself, it’s often used as a reference or source of palette definitions for other R plotting or theming packages.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    RStudio Cheatsheets

    RStudio Cheatsheets

    Curated collection of official cheat sheets for data science tools

    The cheatsheets repository from RStudio is a curated collection of official cheat sheets for R, RStudio, the tidyverse, Shiny, and related data science tools. Each cheat sheet is a single (or double) page PDF that condenses important syntax, functions, workflows, and best practices into a visually organized format ideal for quick reference. The repository contains source files (R Markdown or LaTeX) that generate the cheat sheets, version history, and metadata (title, author, description) for each. It covers topics such as data wrangling, data import, modeling, visualization, RStudio IDE shortcuts, Shiny development, and the tidyverse suite (dplyr, ggplot2, tidyr, purrr). These cheat sheets are widely used by R learners, educators, and practitioners as quick reference tools, and they often ship with RStudio by default or are linked from RStudio’s help/documentation pages. Users can also contribute new cheat sheet proposals, corrections, or translations via pull requests.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    bbplot

    bbplot

    R package that helps create and export ggplot2 charts

    bbplot is an R package developed by the BBC visual journalism team aimed at helping data journalists and analysts produce chart styles consistent with BBC aesthetics. It provides functions and themes that make it easier to adopt BBC’s visual style (fonts, colors, annotations, layout) in ggplot2 plots. The package includes helper functions for axis labels, captions, legends, branding (e.g. BBC red lines or accents), and common chart types styled for editorial presentation. It offers templates and defaults that reduce styling overhead so users can focus on data and storytelling rather than aesthetic minutiae. Because visual consistency is important in media, bbplot helps non-designers build plots that align with professional publication standards. The repository includes documentation, vignettes, example plots, and guidelines for customization (e.g. switching colors, modifying typography).
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    R Source

    R Source

    Read-only mirror of R source code

    The wch/r-source repository is a read-only mirror of the official R language source code, maintained to reflect the upstream Subversion (SVN) R core development tree. This mirror provides public visibility into R’s internals—everything from the interpreter, base and recommended packages, documentation, and C/Fortran code under the hood. It is updated hourly to stay in sync with the upstream SVN. Although it mirrors the R source for browsing and reference, it is not the “canonical development repo* (i.e. you can’t submit pull requests via that mirror). The repository includes build instructions, the full directory structure (src, src/library, doc, etc.), licensing information (GPL-2.0), and documentation. Developers, package authors, and curious users often browse this mirror to inspect implementation details, debug issues, or see how base functions are implemented in C or Fortran.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Paper2GUI

    Paper2GUI

    Convert AI papers to GUI

    Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术 Paper2GUI: An AI desktop APP toolbox for ordinary people. It can be used immediately without installation. It already supports 40+ AI models, covering AI painting, speech synthesis, video frame complementing, video super-resolution, object detection, and image stylization. , OCR recognition and other fields. Support Windows, Mac, Linux systems. Paper2GUI: 一款面向普通人的 AI 桌面 APP 工具箱,免安装即开即用,已支持 40+AI 模型,内容涵盖 AI 绘画、语音合成、视频补帧、视频超分、目标检测、图片风格化、OCR 识别等领域。支持 Windows、Mac、Linux 系统。
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    LabPlot

    LabPlot

    Data Visualization and Analysis

    LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 18
    DataScienceR

    DataScienceR

    a curated list of R tutorials for Data Science, NLP

    The DataScienceR repository is a curated collection of tutorials, sample code, and project templates for learning data science using the R programming language. It includes an assortment of exercises, sample datasets, and instructional code that cover the core steps of a data science project: data ingestion, cleaning, exploratory analysis, modeling, evaluation, and visualization. Many of the modules demonstrate best practices in R, such as using the tidyverse, R Markdown, modular scripting, and reproducible workflows. The repository also shows examples of linking R with external resources — APIs, databases, and file formats — and integrating into larger pipelines. It acts as a learning scaffold for students or beginners transitioning to more advanced data science work in R, offering a hands-on, example-driven approach. The structure encourages modularity, readability, and reproducible practices, making it a useful reference repository for learners and educators alike.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Investing

    Investing

    Investing Returns on the Market as a Whole

    This repository, owned by the user zonination (Zoni Nation), presents a data visualization and analysis project on long-term returns from broad stock market indexes, especially the S&P 500. The author gathers historical price data (adjusted for inflation and dividends) and computes growth trajectories under a “buy and hold” strategy over decades. The key insight illustrated is that over sufficiently long holding periods (e.g. 40 years), the stock market stabilizes and nearly always yields positive returns, even accounting for extreme market crashes and recessions. The visualizations show “return curves” for different starting years and durations, and also illustrate the probability of losses over various time horizons. The project is centered on transparency in finance and encourages users to examine the data themselves; the code is shared in R and uses ggplot2 for plotting.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    ProgrammingAssignment2

    ProgrammingAssignment2

    Repository for Programming Assignment 2 for R Programming on Coursera

    This repository contains the second programming assignment for an R course, focused on caching expensive computations by leveraging R’s scoping rules. The assignment walks you through creating a special matrix object that stores both a matrix and its cached inverse, avoiding repeated calls to costly operations. It builds on a worked example that caches the mean of a numeric vector, demonstrating how the operator preserves state across function calls. You then implement analogous logic for matrices via two functions, one to construct the cache-aware object and another to compute or retrieve the cached inverse. The instructions emphasize using solve for inversion and assuming that the supplied matrix is always invertible. The repository outlines the workflow for forking, editing the provided R stub, committing your solution, and submitting your repository URL as the final deliverable.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    rvest

    rvest

    Simple web scraping for R

    rvest helps you scrape (or harvest) data from web pages. It is designed to work with magrittr to make it easy to express common web scraping tasks, inspired by libraries like beautiful soup and RoboBrowser. If you’re scraping multiple pages, I highly recommend using rvest in concert with polite. The polite package ensures that you’re respecting the robots.txt and not hammering the site with too many requests.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    CausalImpact

    CausalImpact

    An R package for causal inference in time series

    The CausalImpact repository houses an R package that implements causal inference in time series using Bayesian structural time series models. Its goal is to estimate the effect of an intervention (e.g. a marketing campaign, policy change) on a time series outcome by predicting what would have happened in a counterfactual “no intervention” world. The package requires as input a response time series plus one or more control (covariate) time series that are assumed unaffected by the intervention, and it divides the time horizon into “pre-intervention” and “post-intervention” periods. It uses Bayesian modeling to fit a structural time series to the pre-period and extrapolate a counterfactual prediction for the post period, then compares observed vs predicted to infer the causal effect. The package supports plotting, summary tables, and verbal narratives for interpretive reports.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    ExData Plotting1

    ExData Plotting1

    Plotting Assignment 1 for Exploratory Data Analysis

    This repository explores household energy usage over time using the “Individual household electric power consumption” dataset from the UC Irvine Machine Learning Repository. The dataset covers nearly four years of minute-level measurements, including power consumption, voltage, current intensity, and detailed sub-metering values for different household areas. For analysis, focus is placed on a two-day period in February 2007, highlighting short-term consumption trends. The data requires careful handling due to its size of more than 2 million rows and coded missing values. By processing the date and time fields into proper formats, it becomes possible to generate clear time-series plots of energy usage. The repository demonstrates effective exploratory data analysis practices in R with a reproducible workflow for transforming raw data into visual insights.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    MetBrewer

    MetBrewer

    Color palette package inspired by Metropolitan Museum of Art in NY

    MetBrewer is an R package that provides color palettes inspired by artworks and collections in the Metropolitan Museum of Art (The Met). The idea is to draw on the rich visual heritage of fine art to generate palettes that are aesthetically pleasing and grounded in real-world artistic color usage. The palettes are curated, named after artworks or styles, and often include notes about colorblind-friendliness and contrast. The package supports both discrete and continuous palette types, with interpolation when more colors are requested than originally defined. It also provides ggplot2-friendly scale functions (scale_color_met_c, scale_fill_met_d, etc.) so integration into typical R plotting workflows is smooth. Internally, the package includes functions to list available palettes, check which are colorblind-friendly, and visualize all palettes at once.
    Downloads: 2 This Week
    Last Update:
    See Project