datalake
Here are 225 public repositories matching this topic...
This project is about building a data lake and creating an ETL pipeline in Spark that loads data from Amazon S3, processes the data into analytics tables, and loads them back into S3
-
Updated
Jun 15, 2021 - Python
Datalake on AW
-
Updated
Oct 18, 2022 - Python
An insanely customizable framework for key-value storage 💾
-
Updated
Apr 7, 2024 - Python
Collection of data on Formula One Racing
-
Updated
Dec 21, 2022 - Python
A simple search engine for the Datalake
-
Updated
Apr 30, 2018 - JavaScript
MSc. Data Engineering Project at Data ScienceTech Institute (DSTI )
-
Updated
Mar 8, 2021 - HTML
Repositório para armazenar códigos do projeto.
-
Updated
Dec 2, 2021 - Python
Personal, cloud based (AWS), data lake for experimenting with cloud services.
-
Updated
Mar 6, 2022 - HCL
Azure_Synapse_Project_NYC_TAXI_DATA--Sayantan Barat
-
Updated
Sep 30, 2022 - TSQL
Solução para buscar tweets com uma determinada “HashTag” e armazená-los em formato Parquet
-
Updated
Feb 14, 2023 - Jupyter Notebook
Nayco(内湖) is all in one micro DataLake for IoT
-
Updated
Jan 6, 2023 - JavaScript
How to combine smart store and ingest action for datalake use case
-
Updated
Jan 15, 2024 - Python
Soccer Players Data Analyst and Similar Players Finder
-
Updated
May 6, 2024 - Jupyter Notebook
An Ansible Role to Configure and setup Hadoop Job Tracker Node.
-
Updated
May 18, 2021 - Jinja
Improve this page
Add a description, image, and links to the datalake topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the datalake topic, visit your repo's landing page and select "manage topics."