ETL framework for .NET (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)...
SmartCode = IDataSource -> IBuildTask -> IOutput => Build Everything!!!
Set of .Net Libraries written in C# to create Listeners, Extractors, Writers and possibly more. These libraries allow you to (a) listen for events, (b) load data i...
ETL & Data Enrichment with Spark.NET and ML.NET Automated (Auto) ML
Laughing Waffle is a helper library for doing bulk insert and upate (read upsert) work with SQL Server. Specifically providing help and code generation around the...
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted...
Apache Doris is an easy-to-use, high performance and unified analytics database.
An orchestration platform for the development, production, and observation of data assets.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature...
Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
The open source high performance ELT framework powered by Apache Arrow
Privacy and Security focused Segment-alternative, in Golang and React
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, Pos...
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments....
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents