Developer-first embedded analytics
-
Updated
Jun 12, 2024 - TypeScript
Developer-first embedded analytics
The open source high performance ELT framework powered by Apache Arrow
A repository for scraping IPL Hawkeye data
My personal project for data engineering zoomcamp
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.
Apache Superset is a Data Visualization and Data Exploration Platform
Apply Data Engineering to Personal Finance
fluent iteration
Turns Data and AI algorithms into production-ready web applications in no time.
🐚 Python-powered, cross-platform, Unix-gazing shell.
Home of the Open Data Contract Standard (ODCS).
Egeria core
Open Source Feature Flagging and A/B Testing Platform
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)
Privacy and Security focused Segment-alternative, in Golang and React
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
The Open-Source Enterprise Data Platform in a single Portal
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."