Data Warehousing Software for Linux

View 10 business solutions

Browse free open source Data Warehousing software and projects for Linux below. Use the toggles on the left to filter open source Data Warehousing software by OS, license, language, programming language, and project status.

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Greenplum Database

    Greenplum Database

    Massive parallel data platform for analytics, machine learning and AI

    Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput. Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor lock-in, and allowing open influence on product direction. Greenplum reduces data silos by providing you with a single, scale-out environment for converging analytic and operational workloads, like streaming ingestion. All major Greenplum contributions are part of the Greenplum Database project and share the same database core, including the MPP architecture, analytical interfaces, and security capabilities.
    Downloads: 51 This Week
    Last Update:
    See Project
  • 2
    ReportServer Community Edition

    ReportServer Community Edition

    ReportServer is a modern and versatile business intelligence platform

    ReportServer is a modern and versatile open source business intelligence (BI) platform with powerful reporting features. With ReportServer you are not limited to one provider's solutions. ReportServer integrates Jasper, Birt, Mondrian and Excel-based reporting: choose what best suits your needs! The source code is also available in GitHub: https://github.com/infofabrik/reportserver ReportServer scripting samples: https://github.com/infofabrik/reportserver-samples
    Downloads: 63 This Week
    Last Update:
    See Project
  • 3
    The aoetools are programs for users of the ATA over Ethernet (AoE) network storage protocol, a simple protocol for using storage over an ethernet LAN. The vblade program (storage target) exports a block device using AoE.
    Leader badge
    Downloads: 33 This Week
    Last Update:
    See Project
  • 4
    SIDU admin GUI : MySQL PostgreSQL SQLite
    SIDU is a FREE database web GUI written in PHP. Handy and powerful for MySQL + PostgreSQL + SQLite + CUBRID. SIDU is simple and easy DB tool to use! SIDU has all features you need for database admin and web development. It's a great DB admin tool! No installation need. Best database front-end web based tools, cross platform looking no further
    Downloads: 9 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. It also had Hadoop ( Big data ) support to move files to/from Hadoop Grid, Create, Load and Profile Hive Tables. This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Apache Doris

    Apache Doris

    MPP-based interactive SQL data warehousing for reporting and analysis

    Apache Doris is a modern MPP analytical database product. It can provide sub-second queries and efficient real-time data analysis. With it's distributed architecture, up to 10PB level datasets will be well supported and easy to operate. Apache Doris can meet various data analysis demands, including history data reports, real-time data analysis, interactive data analysis, and exploratory data analysis. Make your data analysis easier! Support standard SQL language, compatible with MySQL protocol. The main advantages of Doris are the simplicity (of developing, deploying and using) and meeting many data serving requirements in a single system. Doris mainly integrates the technology of Google Mesa and Apache Impala, and it is based on a column-oriented storage engine and can communicate by MySQL client.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    MailArchiva is a powerful, full featured email archiving (email archiver) and compliance solution for mail systems such as Microsoft Exchange. It stores all incoming, outgoing and internal emails for long term storage. A web based user interface is avail
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Palo ETL Server is a Java based Tool for Extraction, Transformation and Loading of mass data into the Palo OLAP Server. Palo ETL Server is one part of the Palo Suite.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • Powerful App Monitoring Without Surprise Bills Icon
    Powerful App Monitoring Without Surprise Bills

    AppSignal starts at $23/month with all features included. No overages, no hidden fees. 30-day free trial.

    Tired of monitoring tools that punish you for scaling? AppSignal offers transparent, predictable pricing with every feature unlocked on every plan. Track errors, monitor performance, detect anomalies, and manage logs across Ruby, Python, Node.js, and more. Trusted by developers since 2012 with free dev-to-dev support. No credit card required to start your 30-day trial.
    Try AppSignal Free
  • 10
    apache spark data pipeline osDQ

    apache spark data pipeline osDQ

    osDQ dedicated to create apache spark based data pipeline using JSON

    This is an offshoot project of open source data quality (osDQ) project https://sourceforge.net/projects/dataquality/ This sub project will create apache spark based data pipeline where JSON based metadata (file) will be used to run data processing , data pipeline , data quality and data preparation and data modeling features for big data. This uses java API of apache spark. It can run in local mode also. Get json example at https://github.com/arrahtech/osdq-spark How to run Unzip the zip file Windows : java -cp .\lib\*;osdq-spark-0.0.1.jar org.arrah.framework.spark.run.TransformRunner -c .\example\samplerun.json Mac UNIX java -cp ./lib/*:./osdq-spark-0.0.1.jar org.arrah.framework.spark.run.TransformRunner -c ./example/samplerun.json For those on windows, you need to have hadoop distribtion unzipped on local drive and HADOOP_HOME set. Also copy winutils.exe from here into HADOOP_HOME\bin
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11
    LucidDB is a DBMS optimized for business intelligence. Besides architectural innovations such as column-store, it supports many advanced features from SQL:2003, including SQL/MED and user-defined transformations written in Java.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    CloverDX

    CloverDX

    Design, automate, operate and publish data pipelines at scale

    Please, visit www.cloverdx.com for latest product versions. Data integration platform; can be used to transform/map/manipulate data in batch and near-realtime modes. Suppors various input/output formats (CSV,FIXLEN,Excel,XML,JSON,Parquet, Avro,EDI/X12,HL7,COBOL,LOTUS, etc.). Connects to RDBMS/JMS/Kafka/SOAP/Rest/LDAP/S3/HTTP/FTP/ZIP/TAR. CloverDX offers 100+ specialized components which can be further extended by creation of "macros" - subgraphs - and libraries, shareable with 3rd parties. Simple data manipulation jobs can be created visually. More complex business logic can be implemented using Clover's domain-specific-language CTL, in Java or languages like Python or JavaScript. Through its DataServices functionality, it allows to quickly turn data pipelines into REST API endpoints. The platform allows to easily scale your data job across multiple cores or nodes/machines. Supports Docker/Kubernetes deployments and offers AWS/Azure images in their respective marketplace
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    OpenReports is a powerful, flexible, and easy to use web reporting solution that provides browser based, parameter driven, dynamic report generation and flexible report scheduling capabilities. Supports JasperReports, JFreeReport, JXLS, and Eclipse BIRT
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    SQL*Plus Commander

    SQL*Plus Commander

    Text-based user interface to query data on Oracle DB in a smart way

    SQL*Plus Commander is Text-based user interface (TUI) / framework to query data on Oracle DB in a smart way. It consists in a fully customizable script shell for bash and ksh. It executes custom queries or procedures on DB with SQLPlus for Oracle. The results of queries can be browsed in a colorful text interface resulting data from a query can be selected and passed dinamically as parameters for others queries or procedures It may be useful for people who runs frequently a limited number of query and uses the results as parameters for other queries. suggested for DBA activities, log tables browsing. downloaded version contains a demo with HR data model from oracle.com Try it and let me know if you find it useful any idea or suggestion will be appreciated
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    JBELT : link your design to the business

    JBELT : link your design to the business

    JBelt :: link your design to the business

    JBELT is a collection of procedure to create a PLM system oriented to connect the CAD application to the ERP database. Based on the web. The engine are JBoss, PostgreSQL and developed in Java on JBoss Seam framework.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Archiweb will provide a CMS for managing architecture information and drawings, and viewing projects for selling them. This is a very good CMS for real-estate companies or architicture and blendings companies. It is a ready to install PHPNuke module.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    CoreMan (Correspondence Management System) is a web-based document and correspondence management system that enables companies and organizations to develop an easily accessible digital document and correspondence archive that can be efficiently managed.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    DBBrowser is an open source (GPL license), cross-platform tool which can be used to view the contents of a database. It works with Oracle and MySQL. The user can view, modify, delete records without writing SQL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Comprehensive & high performance data distribution &transformation system. Features a simple, user-friendly event driven scripting interface transparently generates & execs highly efficient Perl/C code. Uses:ETL,datawarehousing,statistics,data-cleansing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Quipu
    In 2015, the software company Qosqo who supported the development of Quipu decided to abandon the open source project as it was. The latest open source release (version 2.x) is what you'll find here. The development moved on in closed source at http://www.datawarehousemanagement.org, currently (March 2016) at version 3.2. The open source description: Quipu is an open source data warehouse generation system that creates and monitors data warehouses. With Quipu you can implement a data warehouse much quicker and easier.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Vaultier - Team Password Manager

    Vaultier - Team Password Manager

    Linux based collaborative password manager with strong encryption.

    Open source enterprise collaborative password manager and file storage. Download and install on your own servers or let us prepare your installation on dedicated server. Store and share your sensitive data safely and manage users who are allowed to access the information. Vaultier is based on advanced security technology (RSA over AES). All data stored in Vaultier databases are encrypted. Only authorized users with valid keys can read them. Encryption keys always stay on client machines. Vaultier is not an online service where all clients and users share the same database. What you always get is a standalone installation with your own database. Vaultier offers you a free opensource Community version and paid Enterprise edition which comes with more features and services in three subscription plans.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    A generic academic database which caters the requirement of some extra functionalities in traditional Academic Databases by adding new use cases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Aestel

    Aestel

    Applications for data management

    "Information is data in action", and, consequently, having good quality data is essential. The AESTEL package contains two highly configurable applications for data management: A data loader and a reporting application, i.e. DataLoader and AEREA, respectively. The data loader application applies user-defined instructions to validate, process and load data. The reporting application provides a query builder and spreadsheet template designer. Both applications work with any relational data model. (Postgres and Oracle have been tested). The two applications have been initially developed for small molecule drug discovery research. However, they can be extended for use in other data domains.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Alfresco Audit Analysis and Reporting
    With Alfresco Audit Analysis and Reporting (A.A.A.R.) is provided a solution to extract, store and query audit data together with the document/folder informations at a very detailed level, with the goal to be useful to the end-user in a very easy way. To reach that goal, to make the data more friendly for the end-user, the data are published in reports in well-known formats (pdf, Microsoft Excel, csv, etc.) and stored directly in Alfresco as static documents organized in folders, versioned, authorized and published. On the top of the A.A.A.R. solution, the A.A.A.R. Analytics is a set of powerful tools to analyze data in an interactive and customizable way with a user console composed by dashboards, reports and free analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB