prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters
#计算机科学#Source code of the paper "Private Collaborative Edge Inference via Over-the-Air Computation".
Official impl. of ACM MM paper "Identity-Aware Attribute Recognition via Real-Time Distributed Inference in Mobile Edge Clouds". A distributed inference model for pedestrian attribute recognition with...