Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
-
Updated
Jun 12, 2024 - Java
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
lakeFS - Data version control for your data lake | Git for data
Easy way to write java objects to apache orc files.
Library for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.
Average Temperature - Hadoop - Mapper - Reducer
OctopuFS library helps managing cloud storage, ADLSgen2 specifically. It allows you to operate on files (moving, copying, setting ACLs) in very efficient manner. Designed to work on databricks, but should work on any other platform as well.
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Data Engineering Project with Hadoop HDFS and Kafka
To develop an Airbnb database and create a pipeline using MongoDB and Hadoop architecture to ease the process of managing, loading, processing, querying, and analyzing Airbnb data based on location
Une API en Java pour interagir avec le Hadoop Distributed File System (HDFS). Cette API offre des fonctionnalités pour la lecture et l'écriture de données dans le HDFS
SFTP server which works on the top of HDFS,It is based on Apache sshd to access and operate HDFS through SFTP protocol
基于Hadoop的分布式云存储系统 🌴
Design and implementation of different MapReduce jobs used to analyze a dataset on Covid-19 disease created by Our World In Data
Worked on Hadoop file streaming
Hadoop utility to compact small files
The project was aimed to help the consumer find the best suitable health insurance plans amongst the pool of such plans. Thus, utilized Hadoop and HiveQL to store, process, and analyze large amounts of health insurance marketplace data, resulting in a 40% increase in data processing efficiency.
Kafka Connect FileSystem Connector
Add a description, image, and links to the hadoop-filesystem topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-filesystem topic, visit your repo's landing page and select "manage topics."