hadoop
Here are 3,341 public repositories matching this topic...
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
-
Updated
Jun 11, 2024 - Jupyter Notebook
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
Jun 11, 2024 - Java
Apache Ignite
-
Updated
Jun 11, 2024 - Java
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.
-
Updated
Jun 11, 2024 - Java
Scalable, redundant, and distributed object store for Apache Hadoop
-
Updated
Jun 11, 2024 - Java
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
-
Updated
Jun 11, 2024 - Java
Alluxio, data orchestration for analytics and machine learning in the cloud
-
Updated
Jun 11, 2024 - Java
IT Knowledge Base from 20 years in DevOps, Linux, Cloud, Big Data, AWS, GCP etc - gradually porting my large private knowledge base to public
-
Updated
Jun 11, 2024 - Shell
AI on Hadoop
-
Updated
Jun 11, 2024 - Java
Smart Automation Tool for building modern Data Lakes and Data Pipelines
-
Updated
Jun 11, 2024 - Scala
Open Source Phasor Data Concentrator
-
Updated
Jun 11, 2024 - C#
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."