Browse free open source Data Science tools and projects for Linux below. Use the toggles on the left to filter open source Data Science tools by OS, license, language, programming language, and project status.
An implementation of the Grammar of Graphics in R
RStudio is an integrated development environment (IDE) for R
Positron, a next-generation data science IDE
Data science spreadsheet with Python & SQL
Graphical User Interface Toolkit for Python with minimal dependencies
High-Performance Serverless event and data processing platform
Course materials for the Data Science Specialization on Coursera
Slides and Jupyter notebooks for the Deep Learning lectures
Vector database for scalable similarity search and AI applications
Scalable and Flexible Gradient Boosting
Always know what to expect from your data
Function-oriented Make-like declarative workflows for R
The data science OS
Data science at the command line
A framework for real-life data science
Automatic extraction of relevant features from time series
Linux for content creation, web scraping, coding, and data analysis.
Adhoc Data Exploration - Live & Easy
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
For building machine learning (ML) workflows and pipelines on AWS
Jupyter notebooks that demonstrate how to build models using SageMaker
A curated list of data mining papers about fraud detection
Streamline your ML workflow
Project structure for doing and sharing data science work