The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detection on Point Clouds"
SmartFD: Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms
CUDA C parallel implementation of the Merge operation.
Finite Field Arithmetic Benchmarking on Accelerators, using SYCL
CUDA C parallel implementations of some well-known algorithms.