[go: up one dir, main page]

Browse free open source Python ETL Tools and projects below. Use the toggles on the left to filter open source Python ETL Tools by OS, license, language, programming language, and project status.

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • Parasoft: Automated Testing to Deliver Superior Quality Software Icon
    Parasoft: Automated Testing to Deliver Superior Quality Software

    Parasoft provides test automation for every phase of the software development life cycle.

    Parasoft helps organizations continuously deliver high-quality software with its AI-powered software testing platform and automated test solutions. Supporting the embedded, enterprise, and IoT markets, Parasoft’s proven technologies reduce the time, effort, and cost of delivering secure, reliable, and compliant software by integrating everything from deep code analysis and unit testing to web UI and API testing, plus service virtualization and complete code coverage, into the delivery pipeline. Bringing all this together, Parasoft’s award-winning reporting and analytics dashboard provides a centralized view of quality, enabling organizations to deliver with confidence and succeed in today’s most strategic ecosystems and development initiatives—security, safety-critical, Agile, DevOps, and continuous testing.
    Learn More
  • 1
    Datapipe

    Datapipe

    Real-time, incremental ETL library for ML with record-level depend

    Datapipe is a real-time, incremental ETL library for Python with record-level dependency tracking. Datapipe is designed to streamline the creation of data processing pipelines. It excels in scenarios where data is continuously changing, requiring pipelines to adapt and process only the modified data efficiently. This library tracks dependencies for each record in the pipeline, ensuring minimal and efficient data processing.
    Downloads: 123 This Week
    Last Update:
    See Project
  • 2
    AWS Data Wrangler

    AWS Data Wrangler

    Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.

    An AWS Professional Service open-source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data-related services. Easy integration with Athena, Glue, Redshift, Timestream, OpenSearch, Neptune, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON, and EXCEL). Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual ETL tasks like load/unload data from Data Lakes, Data Warehouses, and Databases. Convert the column name to be compatible with Amazon Athena and the AWS Glue Catalog. Run a query against AWS CloudWatchLogs Insights and convert the results to Pandas DataFrame. Get QuickSight dashboard ID given a name and fails if there is more than 1 ID associated with this name. List IAM policy assignments in the current Amazon QuickSight account.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3

    MyDBF2MySQL

    Extract, transform, and load DBF into MySQL

    This is an ETL software which loads data from DBF/XBase files into MySQL. This utility has command line interface, designed to work without user interaction.
    Leader badge">
    Downloads: 44 This Week
    Last Update:
    See Project
  • 4
    Ethereum ETL

    Ethereum ETL

    Python scripts for ETL (extract, transform and load) jobs for Ethereum

    Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery. Ethereum ETL lets you convert blockchain data into convenient formats like CSVs and relational databases.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Houzz Pro is the #1 business management software for home construction and design professionals. Icon
    Houzz Pro is the #1 business management software for home construction and design professionals.

    Get the all-in-one tool for marketing, project and client management built specifically for remodeling and design professionals.

    Get an all-in-one solution that spans the full customer lifecycle, including marketing, CRM, estimation & proposal building, project management, a 3D Floor Plan builder, an online invoicing and payment portal, as well as a client portal and collaboration tools. Start a free trial today to see why thousands of Pros run their business on Houzz Pro. Plans available for all business sizes.
    Learn More
  • 5
    CSVSplitter
    # CSV Splitter Uma ferramenta para dividir arquivos CSV em múltiplos arquivos com base na quantidade de registros especificada, mantendo a integridade dos dados e permitindo configurações de charset, separador e formatação. Ideal para lidar com grandes arquivos CSV que precisam ser fragmentados para melhor manuseio e processamento. ## Funcionalidades - **Divisão de CSV**: Divide o arquivo original em múltiplos arquivos CSV, com o número de registros por arquivo definido pelo usuário. - **Detecção Automática de Charset e Separador**: O charset e o separador do arquivo de origem podem ser detectados automaticamente ou especificados manualmente. - **Configuração de Destino Personalizável**: Permite definir charset e separador de destino. - **Formatação de Dados**: Formatação opcional para os padrões BR, EUA, EU e UK, com exemplos para ajudar na escolha do formato desejado. - **Interface Gráfica Intuitiva**: Interface com `Tkinter`, incluindo barra de progresso e log do proc
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6

    Data Migrator for Oracle

    Migrate/Copy your data between Oracle database and 13 major DBs.

    Command line data Copy/Migration tool for Oracle. Supports Oracle 7.3, Oracle 8i, Oracle 9i, Oracle 10G, Oracle 11G and 13 major databases. 1. Exadata 2. Sybase ASE 3. Informix Innovator C 4. Sybase SQL Anywhere 5. DB2 UDB 6. CSV 7. SQLServer 8. MariaDB 9. Sybase IQ 10. PostgreSQL 11. MySQL 12. Informix IDS 13. TimesTen
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7

    PDF*Merger for Windows

    Merge/concatenate PDF files into one PDF file

    Merge your PDF files for upload to reporting engine or other needs. Command line, win32 Written in Python. Compiled with PyInstaller.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    TabZilla

    Ad-hoc data replication for Oracle database.

    #FreeUkraine #SaveUkraine #StopRussia #StopPutin #CrimeaIsUkraine #UnitedForUkraine #RussiaInvadedUkraine UI written using wxPython. Allows you, to copy tables between Oracle databases using drag-n-drop interface. AKA filezilla, but for tables, not files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    COBOL Data Definitions
    Parse, analyze and -- most importantly -- use COBOL data definitions. This gives you access to COBOL data from Python programs. Write data analyzers, one-time data conversion utilities and Python programs that are part of COBOL systems. Really.
    Downloads: 0 This Week
    Last Update:
    See Project
  • The top-rated AI recruiting platform for faster, smarter hiring. Icon
    The top-rated AI recruiting platform for faster, smarter hiring.

    Humanly is an AI recruiting platform that automates candidate conversations, screening, and scheduling.

    Humanly is an AI-first recruiting platform that helps talent teams hire in days, not months—without adding headcount. Our intuitive CRM pairs with powerful agentic AI to engage and screen every candidate instantly, surfacing top talent fast. Built on insights from over 4 million candidate interactions, Humanly delivers speed, structure, and consistency at scale—engaging 100% of interested candidates and driving pipeline growth through targeted outreach and smart re-engagement. We integrate seamlessly with all major ATSs to reduce manual work, improve data flow, and enhance recruiter efficiency and candidate experience. Independent audits ensure our AI remains fair and bias-free, so you can hire confidently.
    Learn More
  • 10

    CSV*Extractor Pro (Windows command line)

    Spool you scalar data in CSV format from 14 major Databases.

    Command line tool for data export from major relational data stores (RDBMS). DB2 Advanced Enterprise Server DB2 Advanced Workgroup Server DB2 Developer Edition DB2 Enterprise Server DB2 Express DB2 Express C DB2 Workgroup Server Exadata Infobright Informix IDS Informix Innovator C MariaDB MySQL Oracle Oracle XE PostgreSQL SAP Sybase ASE SQL Lite SQL Server Enterprise SQL Server Express Sybase IQ Sybase SQL Anywhere TimesTen
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    CSV*Extractor for RDBMS (command line)

    Extract table data is CSV format from 14 databases.

    Spools data for a given query or table from 14 databases. Windows command-line application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    CSV*Loader PRO (Windows command line)

    Loads CSV file to14 databases

    Windows command line tool for CSV data load to 14 relational stores.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    CSV*Loader for Oracle

    Simplified CSV turbo loader to Oracle

    Tired of writing control files? No problem! CSV*Loader will generate control file for SQL*Loader. Too slow? No problem! CSV*Loader turbo mode may load it 10x faster to your Oracle database than your good old Perl::DBI script.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Data Migration Tools for RDBMS

    DataMigrator for 14 major databases

    Touch and go Windows command line data migration tool for 14 databases: 1. Sybase ASE 2. Informix Innovator C 3. Sybase SQL Anywhere 4. DB2 UDB 5. SQLServer 6. MariaDB 7. Sybase IQ 8. PostgreSQL 9. MySQL 10. Informix IDS 11. TimesTen 12. Oracle 13. SQL Lite 14. Exadata
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Data Spooler For Oracle #SaveUkraine

    Simplified turbo spooler for Oracle.

    #SaveUkraine #StopRussia #FreeUkraine #StopPutin #CrimeaIsUkraine #UnitedForUkraine #RussiaInvadedUkraine Exports/Spools scalar data on disk for a given Oracle table. Turbo mode spools 5x faster.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Data Spooler for SQLServer #SaveUkraine

    Extracts table or query data from SQL Server 2005, 2008, 2012

    #SaveUkraine #StopRussia #FreeUkraine #StopPutin #CrimeaIsUkraine #UnitedForUkraine #RussiaInvadedUkraine Spools/extracts/dump table or query data from SQL Server 2015, 2008,2012. Serial spool creates single dump file. Turbo mode offers 5x spool performance improvement. Sharded turbo more creates multiple files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    DataCopy For SQLServer

    Data Copy tool for SQL Server and Oracle

    Migrate your data from SQLServer to Oracle without creating single dump file. Input is a SQLServer query file defining dataset you want to copy to Oracle. Target table has to exist for copy to go through. Turbo mode offers 5x copy performance improvement.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    DataMule

    Extract-Copy-Load (ECL) tool for 14 databases.

    Extract, Copy and Load operations for: 1. Sybase ASE 2. Informix Innovator C 3. Sybase SQL Anywhere 4. DB2 UDB 5. SQLServer 6. MariaDB 7. Sybase IQ 8. PostgreSQL 9. MySQL 10. Informix IDS 11. TimesTen 12. Oracle 13. SQL Lite 14. Exadata Total 224 data copy vectors. CSV -> DB. DB->DB DB->CSV
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Drop is a graphical interface for the GPL project Wets (hosted in SourceForge). Wets is an ETL software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    SQLServer ->SQLServer Data Migrator

    Copy data between your SQLServer instances

    Ad-hoc data replication for SQLServer 2005,2008,2010 and 2012. Touch-and-go design requires you to provide just login info, query file with your SQL and target table name.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Pypes is a framework which allows users to break complex data processing logic down into a series of smaller less complex tasks. These tasks, referred to as components, can then be connected so that the output of one becomes the input to another.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next