A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations....
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLSe...
Actively curated list of awesome BI tools. PRs welcome!
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/min...
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal...
[DEPRECATED] Detect threats with log data and improve cloud security posture
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Streaming reference architecture for ETL with Kafka and Kafka-Connect. You can find more on...
React components to build CSV files on the fly basing on Array/literal object of data
Data ETL & Analysis on the dataset 'Baby Names from Social Security Card Applications - National Data'....
Example project implementing best practices for PySpark ETL jobs and applications.
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand....
Archived repository. For current repo, see: https://github.com/etlegacy/etlegacy
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases....