Yeqiao Fu

Student, University of Hong Kong

u3597466@connect.hku.hk

ParaMind: Collaborative Large Language Model Inference Platform

FYP student – system architect & implementer

Time. 2025–present
Affiliation. The University of Hong Kong (Final Year Project)
Role. FYP owner (architecture + implementation)

Tagline. P2P collaborative inference platform that aims to stitch consumer GPUs into one logical accelerator.

Summary. ParaMind is an ongoing FYP exploring how households of heterogeneous devices can jointly serve large LLMs by combining pipeline/tensor parallelism, NAT traversal, and transparent scheduling. The current focus is on designing the end-to-end architecture and building a working multi-peer prototype.

Highlights.

Keywords. Distributed inference, P2P, pipeline parallelism, systems for ML.

Links. Work in progress – FYP repo, report, and slides to be released after project completion.