Skip to content
View jiegzhan's full-sized avatar
  • Disney Streaming
  • San Francisco Bay Area, CA
Block or Report

Block or report jiegzhan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jiegzhan/README.md

Hi there 👋

  • 🔭 I am Zhang Jie (张 杰), a Senior Software Engineer at Roku 💜 Big Data Platform team, where I provide data infrastructure and data solutions both in large scale 🔵 real time streaming processing and data lake batch processing.

  • 🌱Tech Stack: Flink, Spark, Kafka, Kafka Connect, Presto, Iceberg, Hive, Hadoop, Airflow, Kubernetes, Docker, AWS Stack, DataDog, Jupyter Notebook, Superset, Looker.

Real Time Streaming Processing ✅

Lead a Flink & Kubenetes powered real time streaming platform which provides capabilities to build Flink streaming applications and run them on Kubernetes clusters seamlessly. Onboarded other engineering teams and promoted best streaming practices.

Data Lake Batch Processing ✅

Built and maintained a Spark & Hive & S3 & Airflow based data lake, architected and implemented distributed data ingestion and processing pipelines.

Pinned

  1. multi-class-text-classification-cnn multi-class-text-classification-cnn Public

    Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model with CNN (Convolutional Neural Network) and Word Embeddings on Tensorflow.

    Python 428 200

  2. multi-class-text-classification-cnn-rnn multi-class-text-classification-cnn-rnn Public

    Classify Kaggle San Francisco Crime Description into 39 classes. Build the model with CNN, RNN (GRU and LSTM) and Word Embeddings on Tensorflow.

    Python 592 263

  3. image-classification-rnn image-classification-rnn Public

    Classify MNIST image dataset into 10 classes. Build an image classifier with Recurrent Neural Network (RNN: LSTM) on Tensorflow.

    Python 86 48

  4. trinodb/trino trinodb/trino Public

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

    Java 9.6k 2.8k

  5. apache/hudi apache/hudi Public

    Upserts, Deletes And Incremental Processing on Big Data.

    Java 5.1k 2.4k

  6. prestodb/presto prestodb/presto Public

    The official home of the Presto distributed SQL query engine for big data

    Java 15.6k 5.3k