Skip to content

Latest commit

 

History

History
42 lines (26 loc) · 1.59 KB

README.md

File metadata and controls

42 lines (26 loc) · 1.59 KB

Hi there 👋

My report:

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance arxiv2401.08772

My favorate projects:

  • llama onnx format and single demo without torch

  • how to optimize GEMM,armv7/aarch64/aarch64-int8/cuda/cuda-int4/vulkan all supported

  • ML solution for long-tailed demands, MegFlow is implemented with Rust and Python