z-lab/pile-val-backup
Viewer
•
Updated
•
215k
•
204
Efficient AI
DFlash: Block Diffusion for Flash Speculative Decoding
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference