ETL Tools for Windows

View 51 business solutions

Browse free open source ETL tools and projects for Windows below. Use the toggles on the left to filter open source ETL tools by OS, license, language, programming language, and project status.

  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 1
    Pentaho

    Pentaho

    Pentaho offers comprehensive data integration and analytics platform.

    Pentaho couples data integration with business analytics in a modern platform to easily access, visualize and explore data that impacts business results. Use it as a full suite or as individual components that are accessible on-premise, in the cloud, or on-the-go (mobile). Pentaho enables IT and developers to access and integrate data from any source and deliver it to your applications all from within an intuitive and easy to use graphical tool. The Pentaho Enterprise Edition Free Trial can be obtained from https://pentaho.com/download/
    Leader badge
    Downloads: 2,142 This Week
    Last Update:
    See Project
  • 2
    Logstash

    Logstash

    Centralize, transform and stash your data

    Logstash is a server-side data processing pipeline that dynamically ingests data from numerous sources, transforms it, and ships it to your favorite “stash” regardless of format or complexity. It supports and ingests data of all shapes, sizes and sources, dynamically transforms and prepares this data, and transports it to the output of your choice. Logstash is extensible, with over 200 plugins available to let you create and configure your pipeline how you choose.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    AWS Data Wrangler

    AWS Data Wrangler

    Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.

    An AWS Professional Service open-source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data-related services. Easy integration with Athena, Glue, Redshift, Timestream, OpenSearch, Neptune, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON, and EXCEL). Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual ETL tasks like load/unload data from Data Lakes, Data Warehouses, and Databases. Convert the column name to be compatible with Amazon Athena and the AWS Glue Catalog. Run a query against AWS CloudWatchLogs Insights and convert the results to Pandas DataFrame. Get QuickSight dashboard ID given a name and fails if there is more than 1 ID associated with this name. List IAM policy assignments in the current Amazon QuickSight account.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    MyDBF2MySQL

    Extract, transform, and load DBF into MySQL

    This is an ETL software which loads data from DBF/XBase files into MySQL. This utility has command line interface, designed to work without user interaction.
    Leader badge
    Downloads: 22 This Week
    Last Update:
    See Project
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 5
    HPCC Systems

    HPCC Systems

    End-to-end big data in a massively scalable supercomputing platform.

    HPCC Systems® (www.hpccsystems.com) from LexisNexis® Risk Solutions is a proven, open source solution for Big Data insights that can be implemented by businesses of all sizes. With HPCC Systems, developers can design applications with Big Data at their core, enabling businesses to better analyze and understand data at scale, improving business time to results and decisions. HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete end-to-end architecture for efficient processing. Read our blog (http://hpccsystems.com/blog ), or connect with us on Twitter (@hpccsystems), Facebook (https://www.facebook.com/hpccsystems ) and LinkedIn (http://www.linkedin.com/company/hpcc-systems) HPCC Systems is available on AWS & can be configured through the Instant Cloud Solution.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    Talend Spatial Module (aka Spatial Data Integrator or SDI) is an ETL tool for geospatial. Based on Talend Open Studio, input, output and transform geocomponents are available. IO components read/write GIS formats(eg.PostGIS, GeoRSS). Transformers all
    Downloads: 14 This Week
    Last Update:
    See Project
  • 7
    CloverDX

    CloverDX

    Design, automate, operate and publish data pipelines at scale

    Please, visit www.cloverdx.com for latest product versions. Data integration platform; can be used to transform/map/manipulate data in batch and near-realtime modes. Suppors various input/output formats (CSV,FIXLEN,Excel,XML,JSON,Parquet, Avro,EDI/X12,HL7,COBOL,LOTUS, etc.). Connects to RDBMS/JMS/Kafka/SOAP/Rest/LDAP/S3/HTTP/FTP/ZIP/TAR. CloverDX offers 100+ specialized components which can be further extended by creation of "macros" - subgraphs - and libraries, shareable with 3rd parties. Simple data manipulation jobs can be created visually. More complex business logic can be implemented using Clover's domain-specific-language CTL, in Java or languages like Python or JavaScript. Through its DataServices functionality, it allows to quickly turn data pipelines into REST API endpoints. The platform allows to easily scale your data job across multiple cores or nodes/machines. Supports Docker/Kubernetes deployments and offers AWS/Azure images in their respective marketplace
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Datapipe

    Datapipe

    Real-time, incremental ETL library for ML with record-level depend

    Datapipe is a real-time, incremental ETL library for Python with record-level dependency tracking. Datapipe is designed to streamline the creation of data processing pipelines. It excels in scenarios where data is continuously changing, requiring pipelines to adapt and process only the modified data efficiently. This library tracks dependencies for each record in the pipeline, ensuring minimal and efficient data processing.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Excel  AddIn :   In2Sql

    Excel AddIn : In2Sql

    ODBC Cloud SQL Explorer. Connection Manager. Query Editor.

    https://sourceforge.net/projects/in2sql Video for best usage https://rb.gy/tvl8lk This Excel Addin helps SQL analytic create an Excel report based on ODBC relational data. *Creates table base on data from a relational database *Generate a pivot report using the same external connection (1) *Some ad-hoc tools are available - like "keep only" and "remove only" *you can use the row limit option for exploring the largest dataset *The ODBC connection manager is available *auto-build query tool can create SQL select statement by using different database tables with matching them by column name * creating connections for PowerQuery news and updates -- change list -- v05 beta export tables and SQL to CSV files treat CSV like relational tables -- add Cloud ClickHouse Source resolve the problem with an untrusted source changed Sql Editor fixed behavior for "update rows"
    Downloads: 7 This Week
    Last Update:
    See Project
  • The All-in-One Commerce Platform for Businesses - Shopify Icon
    The All-in-One Commerce Platform for Businesses - Shopify

    Shopify offers plans for anyone that wants to sell products online and build an ecommerce store, small to mid-sized businesses as well as enterprise

    Shopify is a leading all-in-one commerce platform that enables businesses to start, build, and grow their online and physical stores. It offers tools to create customized websites, manage inventory, process payments, and sell across multiple channels including online, in-person, wholesale, and global markets. The platform includes integrated marketing tools, analytics, and customer engagement features to help merchants reach and retain customers. Shopify supports thousands of third-party apps and offers developer-friendly APIs for custom solutions. With world-class checkout technology, Shopify powers over 150 million high-intent shoppers worldwide. Its reliable, scalable infrastructure ensures fast performance and seamless operations at any business size.
    Learn More
  • 10
    Metl ETL Data Integration

    Metl ETL Data Integration

    Simple message-based, web-based ETL integration

    Metl is a simple, web-based ETL tool that allows for data integrations including database, files, messaging, and web services. Supports RDBMS, SOAP, HTTP, FTP, SFTP, XML, FIXLEN, CSV, JSON, ZIP, and more. Metl implements scheduled integration tasks without the need for custom coding or heavy infrastructure. It can be deployed in the cloud or in an internal data center, and it was built to allow developers to extend it with custom components.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    GeoKettle
    GeoKettle is a powerful, metadata-driven spatial ETL (Extract, Transform and Load) tool dedicated to the integration of different data sources for building and updating geospatial databases, data warehouses and services.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    Jaspersoft ETL
    Jaspersoft ETL is a data integration platform providing high performance data extract-transform-load (ETL) capabilities. Jaspersoft ETL is appropriate for all analytic and operational data integration needs. Activity on this project is located at jas
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    RapidMiner -- Data Mining, ETL, OLAP, BI
    ETL, data warehousing, data mining, OLAP, business intelligence (BI) in Java. 500+ modules: extract, transform, load (ETL), data mining, data analysis + Weka, statistical forecasting, preprocessing, validation, visualization, OLAP, business intelligence.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Automatic Report Generator

    Automatic Report Generator

    Generate reports from Java applications directly.

    Automatic Report Generator is a mini-ETL API which allows to retrieve data through an SQL query towards a structured file, would it be a CSV, XLSX, or XML. This API also supports BIRT reports, and, in this case, its respective template is required. The project is available on Maven: https://mvnrepository.com/artifact/net.sf.automatic-report-generator Version 3: https://mvnrepository.com/artifact/net.sf.ennahdi.automatic-report-generator Checkout the documentation for both version 2 and version 3: https://sourceforge.net/p/automatic-report-generator/wiki/Home/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    AvaSattva

    AvaSattva

    Search replace files or pipe

    See https://github.com/qualiu/msr/ Match/Search/Replace: msr.exe/msr-Win32.exe/msr.cygwin/msr.gcc**/msr-i386.gcc** Match/Search/Replace/Execute/* Files/Pipe Lines/Blocks. Filter/Load/Extract/Transform/Stats/* Files/Pipe Lines/Blocks. Not-IN-latter: nin.exe/nin-Win32.exe/nin.cygwin/nin.gcc**/nin-i386.gcc** Get Exclusive/Mutual Line-Set or Key-Set; Remove Line-Set or Key-Set matched in latter file/pipe; Get Unique/Mutual/Distribution/Stats/* Files/Pipe Line-Set or Key-Set. Match/Search/Replace files/pipe text with plain/Regex syntax. And for ETL alike work like Load and filter files -> Extract -> Transform output. For replacing files, you can preview and backup, in multiple directories and files or pipe, with plain text matching or using general Regex as C++, C#, Java, Scala; So msr is a good tool to learn and test Regex since it has different colors for matched groups captured by the Regex pattern.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16

    Data Migrator for Oracle

    Migrate/Copy your data between Oracle database and 13 major DBs.

    Command line data Copy/Migration tool for Oracle. Supports Oracle 7.3, Oracle 8i, Oracle 9i, Oracle 10G, Oracle 11G and 13 major databases. 1. Exadata 2. Sybase ASE 3. Informix Innovator C 4. Sybase SQL Anywhere 5. DB2 UDB 6. CSV 7. SQLServer 8. MariaDB 9. Sybase IQ 10. PostgreSQL 11. MySQL 12. Informix IDS 13. TimesTen
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    KETL(tm) is a production ready ETL platform. The engine is built upon an open, multi-threaded, XML-based architecture. KETL's is designed to assist in the development and deployment of data integration efforts which require ETL and scheduling
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    apache spark data pipeline osDQ

    apache spark data pipeline osDQ

    osDQ dedicated to create apache spark based data pipeline using JSON

    This is an offshoot project of open source data quality (osDQ) project https://sourceforge.net/projects/dataquality/ This sub project will create apache spark based data pipeline where JSON based metadata (file) will be used to run data processing , data pipeline , data quality and data preparation and data modeling features for big data. This uses java API of apache spark. It can run in local mode also. Get json example at https://github.com/arrahtech/osdq-spark How to run Unzip the zip file Windows : java -cp .\lib\*;osdq-spark-0.0.1.jar org.arrah.framework.spark.run.TransformRunner -c .\example\samplerun.json Mac UNIX java -cp ./lib/*:./osdq-spark-0.0.1.jar org.arrah.framework.spark.run.TransformRunner -c ./example/samplerun.json For those on windows, you need to have hadoop distribtion unzipped on local drive and HADOOP_HOME set. Also copy winutils.exe from here into HADOOP_HOME\bin
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    CSVSplitter
    # CSV Splitter Uma ferramenta para dividir arquivos CSV em múltiplos arquivos com base na quantidade de registros especificada, mantendo a integridade dos dados e permitindo configurações de charset, separador e formatação. Ideal para lidar com grandes arquivos CSV que precisam ser fragmentados para melhor manuseio e processamento. ## Funcionalidades - **Divisão de CSV**: Divide o arquivo original em múltiplos arquivos CSV, com o número de registros por arquivo definido pelo usuário. - **Detecção Automática de Charset e Separador**: O charset e o separador do arquivo de origem podem ser detectados automaticamente ou especificados manualmente. - **Configuração de Destino Personalizável**: Permite definir charset e separador de destino. - **Formatação de Dados**: Formatação opcional para os padrões BR, EUA, EU e UK, com exemplos para ajudar na escolha do formato desejado. - **Interface Gráfica Intuitiva**: Interface com `Tkinter`, incluindo barra de progresso e log do proc
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    A ETL made in VB.NET.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    A command line utility to read a text file containing lines of data, clean up any CR/LF anomalies, and output the lines of text with clean CR/LF terminators to standard output. The binary is a Windows 32 bit console app.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Toolsverse ETL Framework

    Toolsverse ETL Framework

    Open source Extract Transform Load engine written in Java

    ETL Framework is a standalone Extract Transform Load engine written in Java. It includes executables for all major platforms and can be easily integrated into other applications. Key Features: * embeddable, open source and free * fast and scalable * uses target database features to do transformations and loads * manual and automatic data mapping * data streaming * bulk data loads * data quality features using SQL, JavaScript? and regex * data transformations Requirements * Java 1.6 and up * At least 4 MB of RAM New in 3.2 (01/18/2013) * Improved auto-update functionality * Bug fixes
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Utility that performs bulk user import to Active Directory from selected data sources. It can perform data mapping and generate required fields using existing info( generate userPrincipleName from name, surname and patronymic of user for example). This is still a beta-release, so things can work not so well sometimes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    BIAutomationTool

    Tool created to aggregate commands to disparate ETL tools

    This project was created to allow executing ETL jobs/tasks from a single command line tool with the same syntax, no matter what tool you were executing in. As long as you have a command line client for the ETL tool, you can configure the BIAutomationTool to use it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    COBOL Data Definitions
    Parse, analyze and -- most importantly -- use COBOL data definitions. This gives you access to COBOL data from Python programs. Write data analyzers, one-time data conversion utilities and Python programs that are part of COBOL systems. Really.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.