RSS Bot@lemmy.bestiver.seMB to Hacker News@lemmy.bestiver.seEnglish · 12 days agoCutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpointmodal.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1external-linkCutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpointmodal.comRSS Bot@lemmy.bestiver.seMB to Hacker News@lemmy.bestiver.seEnglish · 12 days agomessage-square0linkfedilinkfile-text