CUDA kernels incompatible with standard PyTorch device movement with 4bit/8bit, necessitating device-specific handling

#416
No description provided.
madhavanvenkatesh changed pull request status to closed
madhavanvenkatesh changed pull request status to open
ctheodoris changed pull request status to merged

Sign up or log in to comment